Downloads provided by UsageCounts
Page layout analysis is a fundamental step in document processing which enables to segment a page into regions of interest. With highly complex layouts and mixed scripts, scholarly commentaries are text-heavy documents which remain challenging for state-of-the-art models. Their layout considerably varies across editions and their most important regions are mainly defined by semantic rather than graphical characteristics such as position or appearance. This setting calls for a comparison between textual, visual and hybrid approaches. We therefore assess the performances of two transformers (LayoutLMv3 and RoBERTa) and an objection-detection network (YOLOv5). If results show a clear advantage in favor of the latter, we also list several caveats to this finding. In addition to our experiments, we release a dataset of ca. 300 annotated pages sampled from 19th century commentaries.
Same as https://ceur-ws.org/Vol-3290/long_paper8670.pdf
FOS: Computer and information sciences, Computer Science - Computation and Language, Artificial Intelligence (cs.AI), evaluation, Computer Science - Artificial Intelligence, Computer Vision and Pattern Recognition (cs.CV), Computer Science - Computer Vision and Pattern Recognition, historical classical commentaries, digital humanities, Computation and Language (cs.CL), Information Retrieval (cs.IR), Computer Science - Information Retrieval, page layout analysis
FOS: Computer and information sciences, Computer Science - Computation and Language, Artificial Intelligence (cs.AI), evaluation, Computer Science - Artificial Intelligence, Computer Vision and Pattern Recognition (cs.CV), Computer Science - Computer Vision and Pattern Recognition, historical classical commentaries, digital humanities, Computation and Language (cs.CL), Information Retrieval (cs.IR), Computer Science - Information Retrieval, page layout analysis
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 0 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
| views | 8 | |
| downloads | 6 |

Views provided by UsageCounts
Downloads provided by UsageCounts