publication . Preprint . 2016

Feature-Augmented Neural Networks for Patient Note De-identification

Lee, Ji Young; Dernoncourt, Franck; Uzuner, Ozlem; Szolovits, Peter;
Open Access English
  • Published: 30 Oct 2016
Abstract
Comment: Accepted as a conference paper at COLING ClinicalNLP 2016. The first two authors contributed equally to this work
Subjects
free text keywords: Computer Science - Computation and Language, Computer Science - Neural and Evolutionary Computing, Statistics - Machine Learning
Download from
34 references, page 1 of 3

[Aberdeen et al.2010] John Aberdeen, Samuel Bayer, Reyyan Yeniterzi, Ben Wellner, Cheryl Clark, David Hanauer, Bradley Malin, and Lynette Hirschman. 2010. The MITRE Identification Scrubber Toolkit: design, training, and assessment. International journal of medical informatics, 79(12):849-859. [OpenAIRE]

[Beckwith et al.2006] Bruce A Beckwith, Rajeshwarri Mahaadevan, Ulysses J Balis, and Frank Kuo. 2006. Development and evaluation of an open source software tool for deidentification of pathology reports. BMC medical informatics and decision making, 6(1):1.

[Berman2003] Jules J Berman. 2003. Concept-match medical data scrubbing: how pathology text can be used in research. Archives of pathology & laboratory medicine, 127(6):680-686.

[Dernoncourt et al.2016] Franck Dernoncourt, Ji Young Lee, Ozlem Uzuner, and Peter Szolovits. 2016. Deidentification of patient notes with recurrent neural networks. arXiv preprint arXiv:1606.03475. [OpenAIRE]

[DesRoches et al.2013] Catherine M DesRoches, Chantal Worzala, and Scott Bates. 2013. Some hospitals are falling behind in meeting meaningful use criteria and could be vulnerable to penalties in 2015. Health Affairs, 32(8):1355-1360.

[Douglas et al.2004] Margaret Douglas, Gari Clifford, Andrew Reisner, George Moody, and Roger Mark. 2004. Computer-assisted de-identification of free text in the mimic ii database. In Computers in Cardiology, 2004, pages 341-344. IEEE.

[Douglass et al.2005] Margaret Douglass, Gari Cliffford, Andrew Reisner, William Long, George Moody, and Roger Mark. 2005. De-identification algorithm for free-text nursing notes. In Computers in Cardiology, 2005, pages 331-334. IEEE. [OpenAIRE]

[Fielstein et al.2004] Elliot M. Fielstein, Steven H. Brown, and Theodore Speroff. 2004. Algorithmic deidentification of VA medical exam text for HIPAA privacy compliance: Preliminary findings. Medinfo, 1590.

[Filannino and Nenadic2015] Michele Filannino and Goran Nenadic. 2015. Temporal expression extraction with extensive feature type selection and a posteriori label adjustment. Data & Knowledge Engineering, 100:19-33.

[Friedlin and McDonald2008] Jeff Friedlin and Clement J McDonald. 2008. A software tool for removing patient identifying information from clinical documents. Journal of the American Medical Informatics Association, 15(5):601-610.

[Goldberger et al.2000] Ary L Goldberger, Luis AN Amaral, Leon Glass, Jeffrey M Hausdorff, Plamen Ch Ivanov, Roger G Mark, Joseph E Mietus, George B Moody, Chung-Kang Peng, and H Eugene Stanley. 2000. Physiobank, physiotoolkit, and physionet components of a new research resource for complex physiologic signals. Circulation, 101(23):e215-e220.

[Guo et al.2006] Yikun Guo, Robert Gaizauskas, Ian Roberts, George Demetriou, and Mark Hepple. 2006. Identifying personal health information using support vector machines. In i2b2 workshop on challenges in natural language processing for clinical data, pages 10-11.

[Gupta et al.2004] Dilip Gupta, Melissa Saul, and John Gilbertson. 2004. Evaluation of a deidentification (De-Id) software engine to share pathology reports and clinical documents for research. American journal of clinical pathology, 121(2):176-186.

[Hara2006] Kazuo Hara. 2006. Applying a SVM based chunker and a text classifier to the deid challenge. In i2b2 Workshop on challenges in natural language processing for clinical data, pages 10-11. Am Med Inform Assoc.

[Hochreiter and Schmidhuber1997] Sepp Hochreiter and Ju¨rgen Schmidhuber. 1997. Long short-term memory. Neural computation, 9(8):1735-1780.

34 references, page 1 of 3
Powered by OpenAIRE Open Research Graph
Any information missing or wrong?Report an Issue