Downloads provided by UsageCounts
BasCrawl is a 186-million-token web corpus of Basque obtained by crawling over 12000 domains. We include the crawled domains. The corpus has been preprocessed and deduplicated as described in http://journal.sepln.org/sepln/ojs/ojs/index.php/pln/article/view/6405. It consists of 186.832.691 tokens, 12.303.132 sentences and 736.180 documents. Documents are separated by single new lines. We license the actual packaging of these data under a Creative Commons Attribution 4.0 International License. Copyright (c) 2022 Secretaría de Estado de Digitalización e Inteligencia Artificial
Funded by the Plan de Impulso de las Tecnologías del Lenguaje (Plan TL).
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 0 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
| views | 32 | |
| downloads | 360 |

Views provided by UsageCounts
Downloads provided by UsageCounts