Noisy Text Data: Achilles’ Heel of BERT

Name: Noisy Text Data: Achilles’ Heel of BERT
Keywords: FOS: Computer and information sciences, Computer Science - Computation and Language, Computation and Language (cs.CL)

Ankit Kumar; Piyush Makhija; Anuj Gupta

Found an issue? Give us feedback

https://www.aclweb.o...arrow_drop_down

https://www.aclweb.org/antholo...

Article

License: CC BY

Data sources: UnpayWall

arXiv.org e-Print Archive

Preprint . 2020

Data sources: arXiv.org e-Print Archive

https://doi.org/10.18653/v1/20...

Article . 2020 . Peer-reviewed

Data sources: Crossref

https://dx.doi.org/10.48550/ar...

Article . 2020

License: CC BY SA

Data sources: Datacite

DBLP

Conference object

Data sources: DBLP

https://dx.doi.org/10.18653/v1...

Article

Data sources: Microsoft Academic Graph

Noisy Text Data: Achilles’ Heel of BERT

descriptionPublicationkeyboard_double_arrow_right Article , Preprint , Conference object 01 Jan 2020Embargo end date: 01 Jan 2020Publisher:Association for Computational Linguistics (ACL)Journal:Proceedings of the Sixth Workshop on Noisy User-generated Text (W-NUT 2020)

Authors: Ankit Kumar; Piyush Makhija; Anuj Gupta;

doi: 10.18653/v1/2020.wnut-1.3 , 10.48550/arxiv.2003.12932

arXiv: 2003.12932

Noisy Text Data: Achilles’ Heel of BERT

- Summary
- Subjects
- Related research
  (2)
- Metrics

Abstract

Owing to the phenomenal success of BERT on various NLP tasks and benchmark datasets, industry practitioners are actively experimenting with fine-tuning BERT to build NLP applications for solving industry use cases. For most datasets that are used by practitioners to build industrial NLP applications, it is hard to guarantee absence of any noise in the data. While BERT has performed exceedingly well for transferring the learnings from one use case to another, it remains unclear how BERT performs when fine-tuned on noisy text. In this work, we explore the sensitivity of BERT to noise in the data. We work with most commonly occurring noise (spelling mistakes, typos) and show that this results in significant degradation in the performance of BERT. We present experimental results to show that BERT's performance on fundamental NLP tasks like sentiment analysis and textual similarity drops significantly in the presence of (simulated) noise on benchmark datasets viz. IMDB Movie Review, STS-B, SST-2. Further, we identify shortcomings in the existing BERT pipeline that are responsible for this drop in performance. Our findings suggest that practitioners need to be vary of presence of noise in their datasets while fine-tuning BERT to solve industry use cases.

7 pages, 2 tables, 1 plot

Related Organizations

Salesforce.com
Australia
International Institute of Information Technology, Hyderabad
India

Keywords

FOS: Computer and information sciences, Computer Science - Computation and Language, Computation and Language (cs.CL)

2 Research products, page 1 of 1

transformers software on GitHub
IsRelatedTo
bert software on GitHub
IsRelatedTo

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	33
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 10%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Top 10%
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 10%

Found an issue? Give us feedback

33

Top 10%

Green

hybrid

Noisy Text Data: Achilles’ Heel of BERT

Noisy Text Data: Achilles’ Heel of BERT

2 Research products, page 1 of 1

transformers software on GitHub

bert software on GitHub