Filters (5)
Download Results
18 research outcomes, page 1 of 2
  • publication . Article . 2016
    Open Access English
    Authors:
    Piotr Przybyła; Matthew Shardlow; Sophie Aubin; Robert Bossy; Richard Eckart de Castilho; Stelios Piperidis; John McNaught; Sophia Ananiadou;
    Persistent Identifiers
    Project: UKRI | Enriching Metabolic PATHw... (BB/M006891/1), EC | OpenMinTeD (654021)

    Text mining is a powerful technology for quickly distilling key information from vast quantities of biomedical literature. However, to harness this power the researcher must be well versed in the availability, suitability, adaptability, interoperability and comparative ...

    Add to ORCIDorcid
  • publication . Conference object . 2016
    Open Access English
    Authors:
    Zhao, Zhiming; Martin, Paul; de Laat, Cees; Jones, Andrew; Taylor, Ian; Hardisty, Alex; Atkinson, Malcolm; Jeffery, Keith; Zuiderwijk-van Eijk, Anneke; Yin, Yi; ...
    Publisher: Zenodo
    Project: EC | VRE4EIC (676247), EC | SWITCH (643963), EC | ENVRI PLUS (654182)

    Data-centric approaches play an increasing role in many scientific domains, but in turn rely increasingly heavily on advanced research support environments for coordinating research activities, providing access to research data, and choreographing complex experiments. C...

  • other research product . Other ORP type . 2012
    Open Access English
    Authors:
    Zhang, J.; Wilson, M.L.; Russell-Rose, T; Larsen, B; Kalbach, J;

    People with complex information needs are for example Humanities researchers, who need advanced search engines to investigate their research questions. Much can be gained by combining research datasets, reusing tools and serendipitously discovering new insights for furt...

  • publication . Conference object . 2018
    English
    Authors:
    Magalie Ochs; Philippe Blache; Montcheuil, G.; Pergandi, J.; Roxane Bertrand; Saubesty, J.; Francon, D.; Mestre, D.;
    Publisher: HAL CCSD
    Project: ANR | Amidex (ANR-11-IDEX-0001), ANR | BLRI (ANR-11-LABX-0036), ANR | ILCB (ANR-16-CONV-0002)

    International audience; The paper aims at presenting the Acorformed corpus composed of human-human and human-machine interactions in French in the specific context of training doctors to break bad news to patients. In the context of human-human interaction, an audiovisu...

  • publication . Preprint . 2020
    Open Access English
    Authors:
    Kocmi, Tom; Limisiewicz, Tomasz; Stanovsky, Gabriel;
    Project: EC | Bergamot (825303)

    Gender bias in machine translation can manifest when choosing gender inflections based on spurious gender correlations. For example, always translating doctors as men and nurses as women. This can be particularly harmful as models become more popular and deployed within...

  • publication . Part of book or chapter of book . 2016
    Open Access English
    Authors:
    Bouillon, Pierrette; Spechbach, Hervé;
    Project: EC | EXPERT (317471), EC | CRACKER (645357), EC | IMTRAP (299251), EC | MMT (645487), EC | TraMOOC (644333), EC | HimL (644402)

    BabelDr (http://babeldr.unige.ch/) is a joint project of Geneva's Faculty of Translation and Interpretation (FTI) and University Hospitals (HUG), that has been active since July 2015. The goal is to develop methods that allow rapid prototyping of medium-vocabulary web-e...

  • publication . Article . 2014
    Open Access English
    Authors:
    Tressy Arts; Yonatan Belinkov; Nizar Habash; Adam Kilgarriff; Vít Suchomel;
    Persistent Identifiers
    Publisher: Elsevier

    AbstractWe present arTenTen, a web-crawled corpus of Arabic, gathered in 2012. arTenTen consists of 5.8-billion words. A chunk of it has been lemmatized and part-of-speech (POS) tagged with the MADA tool and subsequently loaded into Sketch Engine, a leading corpus query...

    Add to ORCIDorcid
  • publication . Part of book or chapter of book . Conference object . 2016
    Open Access
    Authors:
    Rinke Hoekstra; Albert Meroño-Peñuela; Kathrin Dentler; Auke Rijpma; Richard L. Zijdeman; Ivo Zandhuis;
    Persistent Identifiers
    Publisher: Springer International Publishing

    The main promise of the digital humanities is the ability to perform scholarly studies at a much broader scale, and in a much more reusable fashion. The key enabler for such studies is the availability of sufficiently well described data. For the field of socio-economic...

    Add to ORCIDorcid
  • publication . Conference object . Other literature type . 2016
    Open Access English
    Authors:
    Simon Clematide; Frick, Karina; Aepli, Noëmi; Goldman, Jean-Philippe;
    Persistent Identifiers
    Publisher: Sprachwissenschaftliches Institut, Ruhr-Universität Bochum

    In this paper, we systematically analyze writing variations of Swiss German in two existing corpora with standard German glosses, a corpus of 10,000 short text messages and a corpus of transcribed oral history recordings (90,000 tokens). We show that neither resource is...

    Add to ORCIDorcid
  • publication . Contribution for newspaper or weekly magazine . Conference object . Preprint . 2017
    Open Access English
    Authors:
    Jindřich Libovický; Jindřich Helcl;
    Project: EC | QT21 (645452)

    Comment: 7 pages; Accepted to ACL 2017

    Add to ORCIDorcid
18 research outcomes, page 1 of 2