
<script type="text/javascript">
<!--
document.write('<div id="oa_widget"></div>');
document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=undefined&type=result"></script>');
-->
</script>
This deliverable presents three case studies involving digitisation and transformation processes; the studies are presented in order of the complexity of the research question, which is reflected in the difficulty of the corpus compilation task. Transformation processes seem to be inevitable in each case, but paradoxically the urgency of digitisation diminishes as the complexity of a task increases, The case studies described in this deliverable are: 1. Creation of an ELTeC affine corpus of the Slovak novel (chapter 2) 2. Finding the haiku across multilingual corpora (chapter 3) 3. Measuring entropy and surprisal in the prose of the Tsarist Empire Devoted to Terrorism (Russian and Polish Texts) (chapter 4) The first two case studies have already served as reference cases for the data landscape review (CLS INFRA Deliverable 5.1). This extended version, which conveys the experience of six months of research and is enriched by the third case study, highlights specific aspects of the multidimensional landscape of literary text collections. In Deliverable 5.1, they were merely illustrations and concretisations of general points; now they are the focus of attention. The third case has been designed with the most complex research questions in mind, to go even further in exploring what is available and what is possible in the digital humanities today.
data lanscape, literary text collections, corpus architecture, topology of corpora, FAIR principles, surprisal
data lanscape, literary text collections, corpus architecture, topology of corpora, FAIR principles, surprisal
citations This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 0 | |
popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
views | 15 | |
downloads | 15 |