Filters (5)
Download Results
19 research outcomes, page 1 of 2
  • publication . Article . 2016
    Open Access English
    Authors:
    Piotr Przybyła; Matthew Shardlow; Sophie Aubin; Robert Bossy; Richard Eckart de Castilho; Stelios Piperidis; John McNaught; Sophia Ananiadou;
    Persistent Identifiers
    Project: UKRI | Enriching Metabolic PATHw... (BB/M006891/1), EC | OpenMinTeD (654021)

    Text mining is a powerful technology for quickly distilling key information from vast quantities of biomedical literature. However, to harness this power the researcher must be well versed in the availability, suitability, adaptability, interoperability and comparative ...

    Add to ORCIDorcid
  • publication . Conference object . 2016
    Open Access English
    Authors:
    Zhao, Zhiming; Martin, Paul; de Laat, Cees; Jones, Andrew; Taylor, Ian; Hardisty, Alex; Atkinson, Malcolm; Jeffery, Keith; Zuiderwijk-van Eijk, Anneke; Yin, Yi; ...
    Publisher: Zenodo
    Project: EC | VRE4EIC (676247), EC | SWITCH (643963), EC | ENVRI PLUS (654182)

    Data-centric approaches play an increasing role in many scientific domains, but in turn rely increasingly heavily on advanced research support environments for coordinating research activities, providing access to research data, and choreographing complex experiments. C...

  • other research product . Other ORP type . 2012
    Open Access English
    Authors:
    Zhang, J.; Wilson, M.L.; Russell-Rose, T; Larsen, B; Kalbach, J;

    People with complex information needs are for example Humanities researchers, who need advanced search engines to investigate their research questions. Much can be gained by combining research datasets, reusing tools and serendipitously discovering new insights for furt...

  • publication . Conference object . 2018
    English
    Authors:
    Magalie Ochs; Philippe Blache; Montcheuil, G.; Pergandi, J.; Roxane Bertrand; Saubesty, J.; Francon, D.; Mestre, D.;
    Publisher: HAL CCSD
    Project: ANR | Amidex (ANR-11-IDEX-0001), ANR | BLRI (ANR-11-LABX-0036), ANR | ILCB (ANR-16-CONV-0002)

    International audience; The paper aims at presenting the Acorformed corpus composed of human-human and human-machine interactions in French in the specific context of training doctors to break bad news to patients. In the context of human-human interaction, an audiovisu...

  • publication . Preprint . 2020
    Open Access English
    Authors:
    Kocmi, Tom; Limisiewicz, Tomasz; Stanovsky, Gabriel;
    Project: EC | Bergamot (825303)

    Gender bias in machine translation can manifest when choosing gender inflections based on spurious gender correlations. For example, always translating doctors as men and nurses as women. This can be particularly harmful as models become more popular and deployed within...

  • publication . Part of book or chapter of book . 2016
    Open Access English
    Authors:
    Bouillon, Pierrette; Spechbach, Hervé;
    Project: EC | EXPERT (317471), EC | CRACKER (645357), EC | IMTRAP (299251), EC | MMT (645487), EC | TraMOOC (644333), EC | HimL (644402)

    BabelDr (http://babeldr.unige.ch/) is a joint project of Geneva's Faculty of Translation and Interpretation (FTI) and University Hospitals (HUG), that has been active since July 2015. The goal is to develop methods that allow rapid prototyping of medium-vocabulary web-e...

  • publication . Article . 2011
    Open Access Spanish; Castilian
    Authors:
    Silvia Arano; Gemma Martínez; Marina Losada; Marta Villegas; Anna Casaldàliga; Núria Bel;
    Persistent Identifiers
    Publisher: Consejo Superior de Investigaciones Científicas
    Project: EC | CLARIN (212230)

    El artículo presenta una primera aproximación a la publicación en acceso abierto de datos resultantes de la investigación en el área de humanidades. Describe el estudio de caso implementado en el repositorio institucional de la Universitat Pompeu Fabra, a partir de la c...

    Add to ORCIDorcid
  • publication . Conference object . Article . 2012
    Open Access English
    Authors:
    Villegas, M.; Bel, N.; Gonzalo, C.; Moreno, A.; Nuria Simelio;
    Publisher: ACL (Association for Computational Linguistics)
    Project: EC | CLARIN (212230)

    Comunicació presentada a: Eighth International Conference on Language Resources and Evaluation, celebrada a Istanbul, Turkey, del 21 al 27 de maig de 2012.

  • publication . Article . 2014
    Open Access English
    Authors:
    Tressy Arts; Yonatan Belinkov; Nizar Habash; Adam Kilgarriff; Vít Suchomel;
    Persistent Identifiers
    Publisher: Elsevier

    AbstractWe present arTenTen, a web-crawled corpus of Arabic, gathered in 2012. arTenTen consists of 5.8-billion words. A chunk of it has been lemmatized and part-of-speech (POS) tagged with the MADA tool and subsequently loaded into Sketch Engine, a leading corpus query...

    Add to ORCIDorcid
  • publication . Conference object . 2016
    Open Access
    Authors:
    Otegi, A.; Aranberri, N.; Branco, A.; Hajič, J.; Neale, S.; Osenova, P.; Pereira, R.; Popel, M.; Silva, J.; Simov, K.; ...
    Project: EC | QTLEAP (610516)

    This work presents parallel corpora automatically annotated with several NLP tools, including lemma and part of-speech tagging, named-entity recognition and classification, named-entity disambiguation, word-sense disambiguation, and coreference. The corpora comprise bot...

    Add to ORCIDorcid
19 research outcomes, page 1 of 2