
pmid: 35282246
pmc: PMC8907927
The present study analyzed the vocabulary profile of the News on the Web (NOW) corpus, which contained 12 billion words from online newspapers and magazines in 20 countries to determine the vocabulary knowledge needed to reasonably understand online newspaper and magazine articles. The results showed that, in general, knowledge of the most frequent 4,000 word families in the British National Corpus/Corpus of Contemporary American English (BNC/COCA) wordlist plus proper nouns, marginal words, transparent compounds and acronyms was necessary to gain 95% coverage for the NOW corpus. However, when it came to the 98% coverage, online newspaper and magazine articles from different countries had relatively distinct lexical demands. In-depth analyses were carried out and the findings offered comprehensive insights into the issue. Implications for teaching and learning were also provided.
Coca, Artificial intelligence, Social Sciences, Vocabulary, News on the Web, Language and Linguistics, Vocabulary Acquisition in Second Language Learning, Sociology, Artificial Intelligence, Vocabulary Acquisition, Developmental and Educational Psychology, Psychology, Psychiatry, COCA, Corpus linguistics, Natural language processing, Media studies, vocabulary profile, Linguistics, BNC, Statistical Machine Translation and Natural Language Processing, Computer science, Newspaper, BF1-990, Lexicography and Dictionary Development, FOS: Sociology, FOS: Philosophy, ethics and religion, FOS: Psychology, Philosophy, Noun, Computer Science, Physical Sciences, FOS: Languages and literature, Arts and Humanities, lexical coverage
Coca, Artificial intelligence, Social Sciences, Vocabulary, News on the Web, Language and Linguistics, Vocabulary Acquisition in Second Language Learning, Sociology, Artificial Intelligence, Vocabulary Acquisition, Developmental and Educational Psychology, Psychology, Psychiatry, COCA, Corpus linguistics, Natural language processing, Media studies, vocabulary profile, Linguistics, BNC, Statistical Machine Translation and Natural Language Processing, Computer science, Newspaper, BF1-990, Lexicography and Dictionary Development, FOS: Sociology, FOS: Philosophy, ethics and religion, FOS: Psychology, Philosophy, Noun, Computer Science, Physical Sciences, FOS: Languages and literature, Arts and Humanities, lexical coverage
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 12 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 10% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Top 10% |
