Effective Unsupervised Author Disambiguation with Relative Frequencies

Conference object, Preprint English OPEN
Backes, Tobias;
(2018)
  • Related identifiers: doi: 10.1145/3197026.3197036
  • Subject: Agglomerative Clustering | Computer Science - Computation and Language | Statistics - Machine Learning | Computer Science - Machine Learning | Author Disambiguation | Probabilities | Computer Science - Information Retrieval

This work addresses the problem of author name homonymy in the Web of Science. Aiming for an efficient, simple and straightforward solution, we introduce a novel probabilistic similarity measure for author name disambiguation based on feature overlap. Using the research... View more
  • References (16)
    16 references, page 1 of 2

    [1] Aron Culotta, Pallika Kanani, Robert Hall, Michael Wick, and Andrew McCallum. 2007. Author disambiguation using error-driven machine learning with a ranking loss function. In Sixth International Workshop on Information Integration on the Web (IIWeb-07), Vancouver, Canada.

    [2] Anderson A Ferreira, Marcos André Gonçalves, and Alberto HF Laender. 2012. A brief survey of automatic methods for author name disambiguation. Acm Sigmod Record 41, 2 (2012), 15-26.

    [3] Anderson A Ferreira, Adriano Veloso, Marcos André Gonçalves, and Alberto HF Laender. 2010. Efective self-training author name disambiguation in scholarly digital libraries. In Proceedings of the 10th annual joint conference on Digital libraries. ACM, 39-48.

    [4] Thomas Gurney, Edwin Horlings, and Peter Van Den Besselaar. 2012. Author disambiguation using multi-aspect similarity indicators. Scientometrics 91, 2 (2012), 435-449.

    [5] Hui Han, Wei Xu, Hongyuan Zha, and C Lee Giles. 2005. A hierarchical naive Bayes mixture model for name disambiguation in author citations. In Proceedings of the 2005 ACM symposium on Applied computing. ACM, 1065-1069.

    [6] Anne-Wil Harzing. 2015. Health warning: might contain multiple personalities - the problem of homonyms in Thomson Reuters Essential Science Indicators. Scientometrics 105, 3 (2015), 2259-2270.

    [7] T. Kramer, F. Momeni, and P. Mayr. 2017. Coverage of Author Identifiers in Web of Science and Scopus. ArXiv e-prints (March 2017). arXiv:cs.DL/1703.01319

    [8] Michael Levin, Stefan Krawczyk, Steven Bethard, and Dan Jurafsky. 2012. Citationbased bootstrapping for large-scale author disambiguation. Journal of the American Society for Information Science and Technology 63, 5 (2012), 1030-1047.

    [9] Staša Milojević. 2013. Accuracy of simple, initials-based methods for author name disambiguation. Journal of Informetrics 7, 4 (2013), 767-773.

    [10] Alan Filipe Santana, Marcos André Gonçalves, Alberto HF Laender, and Anderson A Ferreira. 2017. Incremental author name disambiguation by exploiting domain-specific heuristics. Journal of the Association for Information Science and Technology 68, 4 (2017), 931-945.

  • Metrics
Share - Bookmark