
<script type="text/javascript">
<!--
document.write('<div id="oa_widget"></div>');
document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=undefined&type=result"></script>');
-->
</script>
This dataset is a selection of 500 pages from the Renovated District Court Records (19th century), one of the largest collections in the National Archives of Finland. The documents consists of records of deeds, mortgages, traditional life-annuity, among others. This dataset contains images with one or two document pages, and it is annotated at image level using six different region types along with the baselines and line level transcription (Swedish). This blend of single page and double page images is a common complexity found in historical documents. Layout labels are: 1. page-number: the page number, commonly placed on the top-right corner of the image, 2. paragraph: a paragraph placed on a single page image or on the left side of a double page image. 3. paragraph_2: a paragraph placed on the right side of a double page image. 4. marginalia: any annotation on the margin of the document, 5. table: a table placed on a single page image or on the left side of a double page image. 6. table_2: a table placed on the right side of a double page image. The images along with their respective ground-truth was compiled in PAGE compliant XML format by the National Archives of Finland and the HTR group of the Pattern Recognition and Human Language Technologies Research Center.
handwritten text document, document layout analysis, XIX century documents
handwritten text document, document layout analysis, XIX century documents
citations This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 1 | |
popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
views | 67 | |
downloads | 13 |