Downloads provided by UsageCounts
Acronym Disambiguation (AD) is crucial for natural language understanding on various sources, including biomedical reports, scientific papers, and search engine queries. However, existing acronym disambiguation benchmarks and tools are limited to specific domains, and the size of prior benchmarks is rather small. To accelerate the research on acronym disambiguation, we construct a new benchmark named GLADIS with three components: (1) a much larger acronym dictionary with 1.5M acronyms and 6.4M long forms; (2) a pre-training corpus with 160 million sentences; (3) three datasets that cover the general, scientific, and biomedical domains. We then pre-train a language model, \emph{AcroBERT}, on our constructed corpus for general acronym disambiguation, and show the challenges and values of our new benchmark.
Long paper at EACL 23
FOS: Computer and information sciences, Computer Science - Computation and Language, [INFO.INFO-CL] Computer Science [cs]/Computation and Language [cs.CL], Acronym Disambiguation, Entity Linking, [INFO] Computer Science [cs], Benchmark, Computation and Language (cs.CL)
FOS: Computer and information sciences, Computer Science - Computation and Language, [INFO.INFO-CL] Computer Science [cs]/Computation and Language [cs.CL], Acronym Disambiguation, Entity Linking, [INFO] Computer Science [cs], Benchmark, Computation and Language (cs.CL)
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 1 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
| views | 236 | |
| downloads | 54 |

Views provided by UsageCounts
Downloads provided by UsageCounts