The interaction of knowledge sources in word sense disambiguation

Article English OPEN
Stevenson, M. ; Wilks, Y. (2001)
  • Publisher: MIT Press

Word sense disambiguation (WSD) is a computational linguistics task likely to benefit from the tradition of combining different knowledge sources in artificial in telligence research. An important step in the exploration of this hypothesis is to determine which linguistic knowledge sources are most useful and whether their combination leads to improved results.\ud \ud We present a sense tagger which uses several knowledge sources. Tested accuracy exceeds 94% on our evaluation corpus.Our system attempts to disambiguate all content words in running text rather than limiting itself to treating a restricted vocabulary of words. It is argued that this approach is more likely to assist the creation of practical systems.
  • References (66)
    66 references, page 1 of 7

    Bateman, John, Robert Kasper, Johanna Moore, and Richard Whitney. 1990. A general organization of knowledge for natural language processing: the PENMAN upper model, Technical report, USC/Information Sciences Institute, Marina del Rey, CA.

    Brill, Eric. 1995. Transformation-based error-driven learning and natural language processing: A case study in part of speech tagging. Computational Linguistics, 21(4):543-566.

    Brown, Peter, Stephen Della Pietra, Vincent Della Pietra, and Robert Mercer. 1991. Word sense disambiguation using statistical methods. In Proceedings of the 29th Meeting of the Association for Computational Linguistics (ACL-91), pages 264-270, Berkeley, CA.

    Bruce, Rebecca and Louise Guthrie. 1992. Genus disambiguation: A study in weighted performance. In Proceedings of the 14th International Conference on Computational Linguistics (COLING-92), pages 1187-1191, Nantes, France.

    Bruce, Rebecca and Janyce Wiebe. 1994. Word-sense disambiguation using decomposable models. In Proceedings of the 32nd Annual Meeting of the Association for Computational Linguistics (ACL-94), pages 139-145, Las Cruces, New Mexico.

    Burnard, Lou. 1995. Users Reference Guide for the British National Corpus. Oxford University Computing Services.

    Chapman, R. L. 1977. Roget's International Thesaurus Fourth Edition, Thomas Y. Crowell Company, New York, NY.

    Cost, Scott and Steven Salzberg. 1993. A weighted nearest neighbour algorithm for learning with symbolic features. Machine Learning, 10(1):57-78.

    Cottrell, Garrison. 1984. A model of lexical access of ambiguous words. In Proceedings of the National Conference on ArtiĊ½cial Intelligence (AAAI-84), pages 61-67, Austin, TX.

    Cowie, Jim, Louise Guthrie, and Joe Guthrie. 1992. Lexical disambiguation using simulated annealing. In Proceedings of the 14th International Conference on Computational Linguistics (COLING-92), pages 359-365, Nantes, France.

  • Metrics
    0
    views in OpenAIRE
    0
    views in local repository
    78
    downloads in local repository

    The information is available from the following content providers:

    From Number Of Views Number Of Downloads
    White Rose Research Online - IRUS-UK 0 78
Share - Bookmark