
This corpus was compiled by gathering texts published on the website https://hotline.org.tw/ from the 13th of March, 2011 to the 21st of February, 2024. It comprises 111,126 tokens, 90,273 words, and 4,370 sentences. Additionally, it encompasses 15,567 lemmas and 13,559 unique word forms (including non-words). The uploaded file contains both a plain text version (without POS tags or lemmas, but retaining all structures and structural attributes) and a vertical file (presenting the corpus in vertical format, including POS tags, lemmas, structures, and attributes). POS tagging and lemmatization were executed using the Sketch Engine platform (http://www.sketchengine.eu; Kilgarriff et al. 2004, Kilgarriff et al. 2014).
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 0 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
