Filters (5)
Download Results
15 research outcomes, page 1 of 2
  • publication . Article . 2016
    Open Access English
    Authors:
    Piotr Przybyła; Matthew Shardlow; Sophie Aubin; Robert Bossy; Richard Eckart de Castilho; Stelios Piperidis; John McNaught; Sophia Ananiadou;
    Persistent Identifiers
    Publisher: Oxford University Press
    Project: EC | OpenMinTeD (654021), UKRI | Enriching Metabolic PATHw... (BB/M006891/1)

    Text mining is a powerful technology for quickly distilling key information from vast quantities of biomedical literature. However, to harness this power the researcher must be well versed in the availability, suitability, adaptability, interoperability and comparative ...

    Add to ORCIDorcid
  • publication . Article . 2019
    Open Access English
    Authors:
    Bavel, Bas J. P.; Curtis, Daniel R.; Hannaford, Matthew J.; Moatsos, Michail; Roosen, Joris; Soens, Tim;
    Project: EC | COORDINATINGFORLIFE (339647), NWO | CLARIAH Common Lab Resear... (2300184354)

    Recent literature investigating long‐term interactions between climate and society increasingly utilizes historical big data. Too often this is done without applying historical criticism, which may lead to misguided narratives. We propose a set of interventions to avoid...

    Add to ORCIDorcid
  • publication . Conference object . 2014
    Open Access English
    Authors:
    Aleksandr Chuklin; Ke Zhou; Anne Schuth; Floor Sietsma; Maarten de Rijke;
    Persistent Identifiers
    Publisher: ACM
    Project: NWO | Digging archaeology data:... (2300186891), EC | VOX-POL (312827), NWO | Modeling and Learning fro... (2300171779), NWO | SPuDisc: Searching Public... (2300176811), NWO | Semantic Search in E-Disc... (2300168486), EC | LIMOSINE (288024)

    Modeling user behavior on a search engine result page is important for understanding the users and supporting simulation experiments. As result pages become more complex, click models evolve as well in order to capture additional aspects of user behavior in response to ...

    Add to ORCIDorcid
  • other research product . Other ORP type . 2012
    Open Access English
    Authors:
    Zhang, J.; Wilson, M.L.; Russell-Rose, T; Larsen, B; Kalbach, J;

    People with complex information needs are for example Humanities researchers, who need advanced search engines to investigate their research questions. Much can be gained by combining research datasets, reusing tools and serendipitously discovering new insights for furt...

  • publication . Preprint . 2020
    Open Access English
    Authors:
    Kocmi, Tom; Limisiewicz, Tomasz; Stanovsky, Gabriel;
    Project: EC | Bergamot (825303)

    Gender bias in machine translation can manifest when choosing gender inflections based on spurious gender correlations. For example, always translating doctors as men and nurses as women. This can be particularly harmful as models become more popular and deployed within...

  • publication . Article . Preprint . 2019
    Open Access
    Authors:
    Anastasios Lytos; Thomas Lagkas; Panagiotis Sarigiannidis; Kalina Bontcheva;
    Persistent Identifiers
    Publisher: Elsevier BV

    Comment: Journal of Information Processing & Management, Elsevier - Accepted Version

    Add to ORCIDorcid
  • publication . Article . 2014
    Open Access English
    Authors:
    Tressy Arts; Yonatan Belinkov; Nizar Habash; Adam Kilgarriff; Vít Suchomel;
    Persistent Identifiers
    Publisher: Production and hosting by Elsevier B.V.

    AbstractWe present arTenTen, a web-crawled corpus of Arabic, gathered in 2012. arTenTen consists of 5.8-billion words. A chunk of it has been lemmatized and part-of-speech (POS) tagged with the MADA tool and subsequently loaded into Sketch Engine, a leading corpus query...

    Add to ORCIDorcid
  • publication . Conference object . Article . Part of book or chapter of book . 2016
    Open Access English
    Authors:
    Hoekstra, R.; Meroño-Peñuela, A.; Dentler, K.; Rijpma, A.; Zijdeman, R.; Zandhuis, I.; Sack, H.; Rizzo, G.; Steinmetz, N.; Mladenić, D.; ...

    The main promise of the digital humanities is the ability to perform scholarly studies at a much broader scale, and in a much more reusable fashion. The key enabler for such studies is the availability of sufficiently well described data. For the field of socio-economic...

    Add to ORCIDorcid
  • publication . Conference object . Contribution for newspaper or weekly magazine . Preprint . 2017
    Open Access
    Authors:
    Jindřich Libovický; Jindřich Helcl;
    Publisher: Association for Computational Linguistics
    Project: EC | QT21 (645452)

    Comment: 7 pages; Accepted to ACL 2017

    Add to ORCIDorcid
  • publication . Project deliverable . Report . 2019
    Open Access
    Authors:
    Tahko, Tuuli; Zehavi, Ora; Lhotak, Martin; Romanova, Natasha; Clivaz, Claire; Ros, Salvador; Raciti, Marco;
    Project: EC | DESIR (731081), EC | Locus Ludi (741520)

    The DESIR project sets out to strengthen the sustainability of DARIAH and firmly establish it as a long-term leader and partner within arts and humanities communities. The project was designed to address six core infrastructural sustainability dimensions and one of thes...

15 research outcomes, page 1 of 2