Document image analysis: A primer

descriptionPublicationkeyboard_double_arrow_right Article 01 Feb 2002 English Publisher:Springer Science and Business Media LLCJournal:Sadhana, volume 27, pages 3-22 (issn: 0256-2499, eissn: 0973-7677,

Copyright policy )

Authors: Rangachar Kasturi; Lawrence O’Gorman; Venu Govindaraju;

doi: 10.1007/bf02703309

Document image analysis: A primer

- Summary
- Metrics

Abstract

Document image analysis refers to algorithms and techniques that are applied to images of documents to obtain a computer-readable description from pixel data. A well-known document image analysis product is the Optical Character Recognition (OCR) software that recognizes characters in a scanned document. OCR makes it possible for the user to edit or search the document’s contents. In this paper we briefly describe various components of a document analysis system. Many of these basic building blocks are found in most document analysis systems, irrespective of the particular domain or language to which they are applied. We hope that this paper will help the reader by providing the background necessary to understand the detailed descriptions of specific techniques presented in other papers in this issue.

Related Organizations

Pennsylvania State University
United States
State University of New York at Potsdam
United States
University at Buffalo, State University of New York
United States
Avaya (Bermuda)
Bermuda

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	77
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 10%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Top 1%
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

77

Top 10%

Top 1%

Average

gold

Fields of Science

engineering and technology

electrical engineering, electronic engineering, information engineering

Fields of Science

engineering and technology

electrical engineering, electronic engineering, information engineering