A survey of named entity recognition and classification

descriptionPublicationkeyboard_double_arrow_right Part of book or chapter of book , Article , Conference object 10 Aug 2007 Canada English Publisher:John Benjamins Publishing CompanyJournal:Lingvisticae Investigationes, volume 30, pages 3-26 (issn: 0378-4169, eissn: 1569-9927,

Copyright policy )

Authors: Nadeau, David; Sekine, S.;

doi: 10.1075/bct.19.03nad , 10.1075/li.30.1.03nad

A survey of named entity recognition and classification

- Summary
- Metrics

Abstract

This survey covers fifteen years of research in the Named Entity Recognition and Classification (NERC) field, from 1991 to 2006. We report observations about languages, named entity types, domains and textual genres studied in the literature. From the start, NERC systems have been developed using hand-made rules, but now machine learning techniques are widely used. These techniques are surveyed along with other critical aspects of NERC such as features and evaluation methods. Features are word-level, dictionary-level and corpus-level representations of words in a document. Evaluation techniques, ranging from intuitive exact match to very complex matching techniques with adjustable cost of errors, are an indisputable key to progress.

Country

Canada

Related Organizations

New York University
United States
National Research Council Canada
Canada
National Academies of Sciences, Engineering, and Medicine
United States

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	2K
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 0.01%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Top 0.01%
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 1%

Found an issue? Give us feedback

2K

Top 0.01%

Top 1%

Fields of Science

engineering and technology

electrical engineering, electronic engineering, information engineering

Fields of Science

engineering and technology

electrical engineering, electronic engineering, information engineering

Upload OA version

Are you the author of this publication? Upload your Open Access version to Zenodo!

It’s fast and easy, just two clicks!

uploadUpload now