Coreference resolution using clusterization

Anastasiya Bodrova; Natalia Grafeeva

Found an issue? Give us feedback

https://doi.org/10.1...arrow_drop_down

https://doi.org/10.1109/fruct....

Article . 2016 . Peer-reviewed

Data sources: Crossref

https://dx.doi.org/10.1109/fru...

Other literature type

Data sources: Microsoft Academic Graph

Coreference resolution using clusterization

descriptionPublicationkeyboard_double_arrow_right Article , Other literature type 04 Sep 2016Publisher:IEEEJournal:2016 International FRUCT Conference on Intelligence, Social Media and Web (ISMW FRUCT)

Authors: Anastasiya Bodrova; Natalia Grafeeva;

doi: 10.1109/fruct.2016.7584764

Coreference resolution using clusterization

- Summary
- Metrics

Abstract

This work deseribes the experience of ereating a corefarence resolution system for Russian language. Coreference resolution is a key subtask of Information Extraction, and aims to grouping mentions that refer to the same discourse entity. This work was aimed to applying a clusterization algorithm for Russian-language newswire texts. We narrowed the task to Person proper names clusterization. Our approach model included two steps: mention extraction and clusterization. Mention extraction was proceeded by manually-created grammars for Tomita-parser. For mention grouping, we used agglomerative clusterization on entity level with the help of weighted feature vectors. We run our experiments on newswire texts, annotated for factRuEval-2016 competition, organized by Dialogue Evaluation. We compare our results with competitors. As a baseline, we set built-in Tonuta-parser algorithms for name extraction and name clusterization. We got comparable results and outperformed the baseline.

Related Organizations

St Petersburg University
Russian Federation

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	0
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

0

Average

Fields of Science

engineering and technology

electrical engineering, electronic engineering, information engineering

Fields of Science

engineering and technology

electrical engineering, electronic engineering, information engineering

Upload OA version

Are you the author of this publication? Upload your Open Access version to Zenodo!

It’s fast and easy, just two clicks!

uploadUpload now