On-line handwritten text categorization

descriptionPublicationkeyboard_double_arrow_right Article , Conference object 18 Jan 2009Publisher:SPIEJournal:SPIE Proceedings, volume 7,247, page 724,709 (issn: 0277-786X,

Copyright policy )

Authors: Peña Saldarriaga, Sebastián; Viard-Gaudin, Christian; Morin, Emmanuel;

doi: 10.1117/12.804355

On-line handwritten text categorization

- Summary
- Subjects
- Metrics

Abstract

As new innovative devices, accepting or producing on-line documents, emerge, managing facilities for these kinds of documents such as topic spotting are required. This means that we should be able to perform text categorization of on-line documents. The textual data available in on-line documents can be extracted through online recognition, a process which produces noise, i.e. errors, in the resulting text. This work reports experiments on categorization of on-line handwritten documents based on their textual contents. We analyze the effect of the word recognition rate on the categorization performances, by comparing the performances of a categorization system over the texts obtained through on-line handwriting recognition and the same texts available as ground truth. Two categorization algorithms (kNN and SVM) are compared in this work. A subset of the Reuters-21578 corpus consisting of more than 2000 handwritten documents has been collected for this study. Results show that accuracy loss is not significant, and precision loss is only significant for recall values of 60%-80% depending on the noise levels.

Related Organizations

French National Centre for Scientific Research
France
University of Nantes
France
Centre national de la recherche scientifique
France
École Centrale de Nantes
France

Keywords

[INFO.INFO-TT] Computer Science [cs]/Document and Text Processing

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	2
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

2

Average

Fields of Science

engineering and technology

electrical engineering, electronic engineering, information engineering

Fields of Science

engineering and technology

electrical engineering, electronic engineering, information engineering

Upload OA version

Are you the author of this publication? Upload your Open Access version to Zenodo!

It’s fast and easy, just two clicks!

uploadUpload now