Using Deep Learning for Title-Based Semantic Subject Indexing to Reach Competitive Performance to Full-Text

Conference object, Preprint English OPEN
Mai, Florian; Galke, Lukas; Scherp, Ansgar;
  • Publisher: ACM
  • Related identifiers: doi: 10.1145/3197026.3197039
  • Subject: Deep Learning | Digital Libraries | Computer Science - Digital Libraries | Text Classification

For (semi-)automated subject indexing systems in digital libraries, it is often more practical to use metadata such as the title of a publication instead of the full-text or the abstract. Therefore, it is desirable to have good text mining and text classification algori... View more
  • References (44)
    44 references, page 1 of 5

    [1] Damien Brain and G Webb. 1999. On the efect of data set size on bias and variance in classification learning. In Australian Knowledge Acquisition Workshop, AI'99. 117-128.

    [2] Junyoung Chung, Caglar Gulcehre, KyungHyun Cho, and Yoshua Bengio. 2014. Empirical evaluation of gated recurrent neural networks on sequence modeling. arXiv:1412.3555 (2014).

    [3] Alexis Conneau, Holger Schwenk, Loïc Barrault, and Yann Lecun. 2017. Very deep convolutional networks for text classification. In ECACL, Vol. 1. 1107-1116.

    [4] Yarin Gal and Zoubin Ghahramani. 2016. A Theoretically Grounded Application of Dropout in Recurrent Neural Networks. In NIPS. 1019-1027.

    [5] Lukas Galke, Florian Mai, Alan Schelten, Dennis Brunsch, and Ansgar Scherp. 2017. Using Titles vs. Full-text as Source for Automated Semantic Document Annotation. In K-CAP. 9.

    [6] Lukas Galke, Ahmed Saleh, and Ansgar Scherp. 2017. Word Embeddings for Practical Information Retrieval. In INFORMATIK. 2155-2167.

    [7] Edouard Grave, Tomas Mikolov, Armand Joulin, and Piotr Bojanowski. 2017. Bag of Tricks for Eficient Text Classification. In EACL. 427-431.

    [8] Klaus Gref, Rupesh K Srivastava, Jan Koutník, Bas R Steunebrink, and Jürgen Schmidhuber. 2017. LSTM: A search space odyssey. IEEE TNNLS (2017).

    [9] Gregor Große-Bölting, Chifumi Nishioka, and Ansgar Scherp. 2015. A comparison of diferent strategies for automated semantic document annotation. In K-CAP. ACM, 8.

    [10] Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2016. Deep residual learning for image recognition. In CVPR. 770-778.

  • Related Research Results (1)
  • Related Organizations (1)
  • Metrics
Share - Bookmark