Understanding BLOOM: An empirical study on diverse NLP tasks

descriptionPublicationkeyboard_double_arrow_right Article , Preprint 01 Jan 2022Embargo end date: 01 Jan 2022Publisher:arXivJournal:CoRR, volume abs/2211.14865

Authors: Parag Pravin Dakle; SaiKrishna Rallabandi; Preethi Raghavan;

doi: 10.48550/arxiv.2211.14865

arXiv: 2211.14865

Understanding BLOOM: An empirical study on diverse NLP tasks

- Summary
- Subjects
- Related research
  (8)
- Metrics

Abstract

We view the landscape of large language models (LLMs) through the lens of the recently released BLOOM model to understand the performance of BLOOM and other decoder-only LLMs compared to BERT-style encoder-only models. We achieve this by evaluating the smaller BLOOM model variants (\textit{350m/560m} and \textit{1b3/1b7}) on several NLP benchmark datasets and popular leaderboards. We make the following observations: (1) BLOOM performance does not scale with parameter size, unlike other LLMs like GPT and BERT. Experiments fine-tuning BLOOM models show that the 560m variant performs similarly to or better than the 1b7 variant, (2) Zero-shot cross-lingual and multi-lingual fine-tuning experiments show that BLOOM is at par or worse than monolingual GPT-2 models, and (3) Toxicity analysis of prompt-based text generation using the RealToxicityPrompts dataset shows that the text generated by BLOOM is at least 17\% less toxic than GPT-2 and GPT-3 models.

Keywords

FOS: Computer and information sciences, Computer Science - Computation and Language, Computation and Language (cs.CL)

8 Research products, page 1 of 1

Compare Encoder-Decoder, Encoder-Only, and Decoder-Only Architectures for Text Generation on Low-Resource Datasets
2021IsAmongTopNSimilarDocuments
Denoising based Sequence-to-Sequence Pre-training for Text Generation
2019IsAmongTopNSimilarDocuments
Source Coding With Encoder Side Information
2004IsAmongTopNSimilarDocuments
Source Coding With Distortion Side Information
2008IsAmongTopNSimilarDocuments
LightSeq2: Accelerated Training for Transformer-Based Models on GPUs
2022IsAmongTopNSimilarDocuments
GPT2sQA software on GitHub
IsRelatedTo
sagemaker-python-sdk software on GitHub
IsRelatedTo
detoxify software on GitHub
IsRelatedTo

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	0
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

0

Average

Green

Fields of Science

engineering and technology

electrical engineering, electronic engineering, information engineering

Fields of Science

engineering and technology

electrical engineering, electronic engineering, information engineering