Advanced search in
Research products
arrow_drop_down
Searching FieldsTerms
Any field
arrow_drop_down
includes
arrow_drop_down
Include:
64 Research products, page 1 of 7

  • 2017-2021
  • Open Access
  • Conference object
  • BG
  • Digital Humanities and Cultural Heritage

10
arrow_drop_down
Relevance
arrow_drop_down
  • Open Access
    Authors: 
    Ming Jiang; Junjie Hu; Qiuyuan Huang; Lei Zhang; Jana Diesner; Jianfeng Gao;
    Publisher: Association for Computational Linguistics

    Popular metrics used for evaluating image captioning systems, such as BLEU and CIDEr, provide a single score to gauge the system’s overall effectiveness. This score is often not informative enough to indicate what specific errors are made by a given system. In this stud...

  • Open Access
    Authors: 
    Svetla Boytcheva; Boris Velichkov; Gerasim Velchev; Ivan Koychev;
    Publisher: IEEE

    We propose methods for automatic generation of corpora that contains descriptions of diagnoses in Bulgarian and their associated codes in ICD-10-CM (International Classification of Diseases, 10th revision, Clinical Modification). The proposed approach is based on the av...

  • Open Access
    Authors: 
    Boris Velichkov; Sylvia Vassileva; Simeon Gerginov; Boris Kraychev; Ivaylo Ivanov; Philip Ivanov; Ivan Koychev; Svetla Boytcheva;
    Publisher: INCOMA Ltd. Shoumen, BULGARIA
    Country: Bulgaria

    The task of automatic diagnosis encoding into standard medical classifications and ontologies, is of great importance in medicine - both to support the daily tasks of physicians in the preparation and reporting of clinical documentation, and for automatic processing of ...

  • Open Access
    Authors: 
    Daniel Kopev; Atanas Atanasov; Dimitrina Zlatkova; Momchil Hardalov; Ivan Koychev; Ivelina Nikolova; Galia Angelova;
    Publisher: Association for Computational Linguistics

    We present the system built for SemEval-2018 Task 2 on Emoji Prediction. Although Twitter messages are very short we managed to design a wide variety of features: textual, semantic, sentiment, emotion-, and color-related ones. We investigated different methods of text p...

  • Publication . Preprint . Part of book or chapter of book . Conference object . 2020
    Open Access
    Authors: 
    Alberto Barrón-Cedeño; Tamer Elsayed; Preslav Nakov; Giovanni Da San Martino; Maram Hasanain; Reem Suwaileh; Fatima Haouari; Nikolay Babulkov; Bayan Hamdan; Alex Nikolov; +2 more
    Publisher: Springer International Publishing
    Country: Italy

    We present an overview of the third edition of the CheckThat! Lab at CLEF 2020. The lab featured five tasks in two different languages: English and Arabic. The first four tasks compose the full pipeline of claim verification in social media: Task 1 on check-worthiness e...

  • Open Access English
    Authors: 
    Agata Savary; Carlos Ramisch; Silvio Cordeiro; Federico Sangati; Veronika Vincze; Behrang Qasemizadeh; Marie Candito; Fabienne Cap; Voula Giouli; Ivelina Stoyanova; +1 more
    Publisher: HAL CCSD
    Countries: France, Sweden
    Project: ANR | PARSEME-FR (ANR-14-CERA-0001)

    International audience; Multiword expressions (MWEs) are known as a "pain in the neck" for NLP due to their idiosyncratic behaviour. While some categories of MWEs have been addressed by many studies, verbal MWEs (VMWEs), such as to take a decision, to break one's heart ...

  • Publication . Preprint . Conference object . 2019
    Open Access
    Authors: 
    Yoan Dinkov; Ivan Koychev; Preslav Nakov;
    Publisher: Incoma Ltd., Shoumen, Bulgaria

    Online media aim for reaching ever bigger audience and for attracting ever longer attention span. This competition creates an environment that rewards sensational, fake, and toxic news. To help limit their spread and impact, we propose and develop a news toxicity detect...

  • Open Access
    Authors: 
    Kiril Simov; Svetla Boytcheva; Petya Osenova;
    Publisher: Incoma Ltd. Shoumen, Bulgaria

    Word vectors with varying dimensionalities and produced by different algorithms have been extensively used in NLP. The corpora that the algorithms are trained on can contain either natural language text (e.g. Wikipedia or newswire articles) or artificially-generated pse...

  • Open Access English
    Authors: 
    Krasimira Bozhanova; Yoan Dinkov; Ivan Koychev; Maria Castaldo; Tommaso Venturini; Preslav Nakov;

    We propose a novel framework for predicting the factuality of reporting of news media outlets by studying the user attention cycles in their YouTube channels. In particular, we design a rich set of features derived from the temporal evolution of the number of views, lik...

  • Open Access
    Authors: 
    Ming Jiang; Qiuyuan Huang; Lei Zhang; Xin Wang; Pengchuan Zhang; Zhe Gan; Jana Diesner; Jianfeng Gao;
    Publisher: Association for Computational Linguistics

    This paper presents a new metric called TIGEr for the automatic evaluation of image captioning systems. Popular metrics, such as BLEU and CIDEr, are based solely on text matching between reference captions and machine-generated captions, potentially leading to biased ev...