Views provided by UsageCounts
Content: List of the 743 domains, their term vocabularies in 10 languages, and the Wikipedia articles associated to each domain extracted by the best model described in: Cristina España-Bonet, Alberto Barrón-Cedeño and Lluís Màrquez. "Tailoring and Evaluating the Wikipedia for in-Domain Comparable Corpora Extraction." Knowledge and Information Systems, Volume 65, pages 1365-1397. 2023. Springer-Verlag, London Ldt. https://doi.org/10.1007/s10115-022-01767-5 https://github.com/cristinae/WikiTailor Files Description: commonCats2015.enesdefrcaareuelrooc.tsv Multilingual domains listed one per line, languages are separated by a tab in the order en, es, de, fr, ca, ar, eu, el, ro and oc. For each language we include the pair "ID categoryName" separated by a blank space. [LAN].0.tar.bz A folder per domain for language [LAN] containing the vocabulary and IDs of the extracted articles by the Wikitailor model 50-WT100. extraction[LAN]0.tar.bz A folder per domain for language [LAN] containing the text of the extracted articles. The name of the file corresponds to the IDs in [LAN].0.tar.bz.
comparable corpora, domain-specific corpora, Wikipedia
comparable corpora, domain-specific corpora, Wikipedia
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 0 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
| views | 3 |

Views provided by UsageCounts