AUTOMATIC TOPIC SEGMENTATION AND LABELING IN MULTIPARTY DIALOGUE

descriptionPublicationkeyboard_double_arrow_right Article , Conference object , Contribution for newspaper or weekly magazine 01 Jan 2006 United Kingdom Publisher:IEEEJournal:2006 IEEE Spoken Language Technology Workshop

Authors: Hsueh, Pei-Yun; Moore, Johanna D.;

doi: 10.1109/slt.2006.326826

handle: 20.500.11820/0b89d270-619a-48e1-a576-70858cc9a102 , 1842/4182

AUTOMATIC TOPIC SEGMENTATION AND LABELING IN MULTIPARTY DIALOGUE

- Summary
- Subjects
- Metrics

Abstract

This study concerns how to segment a scenario-driven multiparty dialogue and how to label these segments automatically. We apply approaches that have been proposed for identifying topic boundaries at a coarser level to the problem of identifying agenda-based topic boundaries in scenario-based meetings. We also develop conditional models to classify segments into topic classes. Experiments in topic segmentation show that a supervised classification approach that combines lexical and conversational features outperforms the unsupervised lexical chain-based approach, achieving 20% and 12% improvement on segmentating top-level and sub-topic segments respectively. Experiments in topic classification suggest that it is possible to automatically categorize segments into appropriate topic classes given only the transcripts. Training with features selected using the Log Likelihood ratio improves the results by 13.3%.

Country

United Kingdom

Related Organizations

University of Edinburgh
United Kingdom

Keywords

pattern classification, speech processing

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	9
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Top 10%
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average