
Abstract Summary Second use of clinical data commonly involves annotating biomedical text with terminologies and ontologies. The National Center for Biomedical Ontology Annotator is a frequently used annotation service, originally designed for biomedical data, but not very suitable for clinical text annotation. In order to add new functionalities to the NCBO Annotator without hosting or modifying the original Web service, we have designed a proxy architecture that enables seamless extensions by pre-processing of the input text and parameters, and post processing of the annotations. We have then implemented enhanced functionalities for annotating and indexing free text such as: scoring, detection of context (negation, experiencer, temporality), new output formats and coarse-grained concept recognition (with UMLS Semantic Groups). In this paper, we present the NCBO Annotator+, a Web service which incorporates these new functionalities as well as a small set of evaluation results for concept recognition and clinical context detection on two standard evaluation tasks (Clef eHealth 2017, SemEval 2014). Availability and implementation The Annotator+ has been successfully integrated into the SIFR BioPortal platform—an implementation of NCBO BioPortal for French biomedical terminologies and ontologies—to annotate English text. A Web user interface is available for testing and ontology selection (http://bioportal.lirmm.fr/ncbo_annotatorplus); however the Annotator+ is meant to be used through the Web service application programming interface (http://services.bioportal.lirmm.fr/ncbo_annotatorplus). The code is openly available, and we also provide a Docker packaging to enable easy local deployment to process sensitive (e.g. clinical) data in-house (https://github.com/sifrproject). Supplementary information Supplementary data are available at Bioinformatics online.
NCBO Annotator, [INFO.INFO-AI] Computer Science [cs]/Artificial Intelligence [cs.AI], Semantic annotation, Text mining, [INFO.INFO-WB] Computer Science [cs]/Web, [INFO.INFO-TT] Computer Science [cs]/Document and Text Processing, Information Storage and Retrieval, Biomedical ontologies, Applications Notes, Biological Ontologies, Ontologies, Humans, Software, [INFO.INFO-BI] Computer Science [cs]/Bioinformatics [q-bio.QM]
NCBO Annotator, [INFO.INFO-AI] Computer Science [cs]/Artificial Intelligence [cs.AI], Semantic annotation, Text mining, [INFO.INFO-WB] Computer Science [cs]/Web, [INFO.INFO-TT] Computer Science [cs]/Document and Text Processing, Information Storage and Retrieval, Biomedical ontologies, Applications Notes, Biological Ontologies, Ontologies, Humans, Software, [INFO.INFO-BI] Computer Science [cs]/Bioinformatics [q-bio.QM]
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 19 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 10% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Top 10% | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Top 10% |
