7 Research products, page 1 of 1
Loading
- Publication . Conference object . 2020FrenchAuthors:Martin, Louis; Muller, Benjamin; Javier Ortiz Suárez, Pedro; Dupont, Yoan; Romary, Laurent; Villemonte de la Clergerie, Eric; Sagot, Benoît; Seddah, Djamé;Martin, Louis; Muller, Benjamin; Javier Ortiz Suárez, Pedro; Dupont, Yoan; Romary, Laurent; Villemonte de la Clergerie, Eric; Sagot, Benoît; Seddah, Djamé;Publisher: HAL CCSDProject: ANR | PRAIRIE (ANR-19-P3IA-0001), ANR | SoSweet (ANR-15-CE38-0011), ANR | BASNUM (ANR-18-CE38-0003), ANR | PARSITI (ANR-16-CE33-0021)
Les modèles de langue neuronaux contextuels sont désormais omniprésents en traitement automatique des langues. Jusqu’à récemment, la plupart des modèles disponibles ont été entraînés soit sur des données en anglais, soit sur la concaténation de données dans plusieurs la...
- Publication . Conference object . 2020FrenchAuthors:Martin, Louis; Muller, Benjamin; Ortiz Suárez, Pedro Javier; Dupont, Yoan; Romary, Laurent; Villemonte de la Clergerie, Eric; Sagot, Benoît; Seddah, Djamé;Martin, Louis; Muller, Benjamin; Ortiz Suárez, Pedro Javier; Dupont, Yoan; Romary, Laurent; Villemonte de la Clergerie, Eric; Sagot, Benoît; Seddah, Djamé;Publisher: HAL CCSDCountry: FranceProject: ANR | BASNUM (ANR-18-CE38-0003), ANR | PARSITI (ANR-16-CE33-0021), ANR | PRAIRIE (ANR-19-P3IA-0001), ANR | SoSweet (ANR-15-CE38-0011)
National audience; Contextual word embeddings have become ubiquitous in Natural Language Processing. Until recently,most available models were trained on English data or on the concatenation of corpora in multiplelanguages. This made the practical use of models in all l...
- Publication . Conference object . Preprint . 2020Open AccessAuthors:Pedro Javier Ortiz Suárez; Laurent Romary; Benoît Sagot;Pedro Javier Ortiz Suárez; Laurent Romary; Benoît Sagot;Publisher: Association for Computational LinguisticsCountry: FranceProject: ANR | PRAIRIE (ANR-19-P3IA-0001), ANR | BASNUM (ANR-18-CE38-0003)
International audience; We use the multilingual OSCAR corpus, extracted from Common Crawl via language classification, filtering and cleaning, to train monolingual contextualized word embeddings (ELMo) for several mid-resource languages. We then compare the performance ...
add Add to ORCIDPlease grant OpenAIRE to access and update your ORCID works.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product. - Publication . Doctoral thesis . 2020EnglishAuthors:Khemakhem, Mohamed;Khemakhem, Mohamed;Publisher: HAL CCSDCountry: FranceProject: ANR | BASNUM (ANR-18-CE38-0003), EC | PARTHENOS (654119)
Dictionaries could be considered as the most comprehensive reservoir of human knowledge, which carry not only the lexical description of words in one or more languages, but also the common awareness of a certain communityabout every known piece of knowledge in a time fr...
- Publication . Part of book or chapter of book . 2020EnglishAuthors:Williams, Geoffrey; ioana, galleron; Stincone, Clarissa;Williams, Geoffrey; ioana, galleron; Stincone, Clarissa;Publisher: HAL CCSDCountry: FranceProject: ANR | BASNUM (ANR-18-CE38-0003)
International audience
- Publication . Preprint . Conference object . 2020Open Access EnglishAuthors:Ortiz Suárez, Pedro Javier; Dupont, Yoann; Muller, Benjamin; Romary, Laurent; Sagot, Benoît;Ortiz Suárez, Pedro Javier; Dupont, Yoann; Muller, Benjamin; Romary, Laurent; Sagot, Benoît;Country: FranceProject: ANR | PRAIRIE (ANR-19-P3IA-0001), ANR | BASNUM (ANR-18-CE38-0003)
Due to COVID19 pandemic, the 12th edition is cancelled. The LREC 2020 Proceedings are available at http://www.lrec-conf.org/proceedings/lrec2020/index.html; International audience; The French TreeBank developed at the University Paris 7 is the main source of morphosynta...
- Publication . Conference object . Preprint . 2020Open Access EnglishAuthors:Louis Martin; Benjamin Muller; Pedro Javier Ortiz Suárez; Yoann Dupont; Laurent Romary; Éric Villemonte de la Clergerie; Djamé Seddah; Benoît Sagot;Louis Martin; Benjamin Muller; Pedro Javier Ortiz Suárez; Yoann Dupont; Laurent Romary; Éric Villemonte de la Clergerie; Djamé Seddah; Benoît Sagot;Publisher: HAL CCSDCountry: FranceProject: ANR | PRAIRIE (ANR-19-P3IA-0001), ANR | SoSweet (ANR-15-CE38-0011), ANR | BASNUM (ANR-18-CE38-0003), ANR | PARSITI (ANR-16-CE33-0021)
Pretrained language models are now ubiquitous in Natural Language Processing. Despite their success, most available models have either been trained on English data or on the concatenation of data in multiple languages. This makes practical use of such models --in all la...
add Add to ORCIDPlease grant OpenAIRE to access and update your ORCID works.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.