Extractive multi-document summarization using multilayer networks

descriptionPublicationkeyboard_double_arrow_right Article , Preprint 01 Aug 2018Embargo end date: 01 Jan 2017 English Publisher:Elsevier BVJournal:Physica A: Statistical Mechanics and its Applications, volume 503, pages 526-539 (issn: 0378-4371,

Copyright policy )

Authors: Jorge Valverde Tohalino; Diego R. Amancio;

doi: 10.1016/j.physa.2018.03.013 , 10.48550/arxiv.1711.02608

arXiv: 1711.02608

Extractive multi-document summarization using multilayer networks

- Summary
- Subjects
- Metrics

Abstract

Huge volumes of textual information has been produced every single day. In order to organize and understand such large datasets, in recent years, summarization techniques have become popular. These techniques aims at finding relevant, concise and non-redundant content from such a big data. While network methods have been adopted to model texts in some scenarios, a systematic evaluation of multilayer network models in the multi-document summarization task has been limited to a few studies. Here, we evaluate the performance of a multilayer-based method to select the most relevant sentences in the context of an extractive multi document summarization (MDS) task. In the adopted model, nodes represent sentences and edges are created based on the number of shared words between sentences. Differently from previous studies in multi-document summarization, we make a distinction between edges linking sentences from different documents (inter-layer) and those connecting sentences from the same document (intra-layer). As a proof of principle, our results reveal that such a discrimination between intra- and inter-layer in a multilayered representation is able to improve the quality of the generated summaries. This piece of information could be used to improve current statistical methods and related textual models.

Related Organizations

Indiana University
United States
Universidade de Sao Paulo/Instituto dos Estudos Avançados
Brazil
Indiana University Bloomington
United States
UNIVERSIDADE DE SAO PAULO
Brazil

Keywords

FOS: Computer and information sciences, Computer Science - Computation and Language, Computation and Language (cs.CL)

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	55
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 1%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Top 10%
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 10%

Found an issue? Give us feedback

55

Top 1%

Top 10%

Green

bronze

Fields of Science

engineering and technology

electrical engineering, electronic engineering, information engineering

Fields of Science

engineering and technology

electrical engineering, electronic engineering, information engineering