software . 2020

Transformers: State-of-the-Art Natural Language Processing

Wolf, Thomas; Debut, Lysandre; Sanh, Victor; Chaumond, Julien; Delangue, Clement; Moi, Anthony; Cistac, Perric; Ma, Clara; Jernite, Yacine; Plu, Julien; ...
Open Access
  • Published: 01 Oct 2020
  • Publisher: Zenodo
Abstract
v4.11.0: GPT-J, Speech2Text2, FNet, Pipeline GPU utilization, dynamic model code loading GPT-J Three new models are released as part of the GPT-J implementation: GPTJModel, GPTJForCausalLM, GPTJForSequenceClassification, in PyTorch. The GPT-J model was released in the kingoflolz/mesh-transformer-jax repository by Ben Wang and Aran Komatsuzaki. It is a GPT-2-like causal language model trained on the Pile dataset. It was contributed by @StellaAthena, @kurumuz, @EricHallahan, and @leogao2. GPT-J-6B #13022 (@StellaAthena) Compatible checkpoints can be found on the Hub: https://huggingface.co/models?filter=gptj SpeechEncoderDecoder & Speech2Text2 One new model is...
Download fromView all 6 versions
Open Access
Zenodo
Software . 2020
Provider: Datacite
Open Access
Zenodo
Software . 2020
Provider: Datacite
Open Access
Zenodo
Software . 2020
Provider: Datacite
Open Access
Zenodo
Software . 2020
Provider: Datacite
Any information missing or wrong?Report an Issue