Broad-coverage sense disambiguation and information extraction with a supersense sequence tagger

descriptionPublicationkeyboard_double_arrow_right Article , Conference object 01 Jan 2006Publisher:Association for Computational Linguistics (ACL)Journal:Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing - EMNLP '06

Authors: Massimiliano Ciaramita; Yasemin Altun;

doi: 10.3115/1610075.1610158

Broad-coverage sense disambiguation and information extraction with a supersense sequence tagger

- Summary
- Related research
  (2)
- Metrics

Abstract

In this paper we approach word sense disambiguation and information extraction as a unified tagging problem. The task consists of annotating text with the tagset defined by the 41 Wordnet supersense classes for nouns and verbs. Since the tagset is directly related to Wordnet synsets, the tagger returns partial word sense disambiguation. Furthermore, since the noun tags include the standard named entity detection classes -- person, location, organization, time, etc. -- the tagger, as a by-product, returns extended named entity information. We cast the problem of supersense tagging as a sequential labeling task and investigate it empirically with a discriminatively-trained Hidden Markov Model. Experimental evaluation on the main sense-annotated datasets available, i.e., Semcor and Senseval, shows considerable improvements over the best known "first-sense" baseline.

Related Organizations

Toyota Technological Institute at Chicago
United States
National Research Council
Sri Lanka
National Academies of Sciences, Engineering, and Medicine
United States

2 Research products, page 1 of 1

An Information Retrieval Approach to Sense Ranking.
2007IsAmongTopNSimilarDocuments
SenseLearner
2005IsAmongTopNSimilarDocuments

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	46
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 10%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Top 10%
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average