Selective Word Substitution for Contextualized Data Augmentation

Name: Selective Word Substitution for Contextualized Data Augmentation
Keywords: Word substitution, Binary classification, Contextual data augmentation

Pantelidou, Kyriaki; Chatzakou, Despoina; Tsikrika, Theodora; Vrochidis, Stefanos; Kompatsiaris, Ioannis

Found an issue? Give us feedback

ZENODOarrow_drop_down

ZENODO

Article . 2022

License: CC BY

Data sources: Datacite

ZENODO

Article . 2022

License: CC BY

Data sources: Datacite

https://doi.org/10.1007/978-3-...

Part of book or chapter of book . 2022 . Peer-reviewed

License: Springer TDM

Data sources: Crossref

http://dx.doi.org/10.1007/978-...

Part of book or chapter of book

License: Springer TDM

Full-Text: https://link.springer.com/content/pdf/10.1007/978-3-031-08473-7

Data sources: Sygma

http://dx.doi.org/10.5281/zeno...

Conference object . 2022

Data sources: European Union Open Data Portal

http://dx.doi.org/10.1007/978-...

Part of book or chapter of book . 2022

Data sources: European Union Open Data Portal

Selective Word Substitution for Contextualized Data Augmentation

descriptionPublicationkeyboard_double_arrow_right Part of book or chapter of book , Article , Conference object 01 Jan 2022 English Publisher:Springer International PublishingFunded by:EC | CONNEXIONs, EC | CREST, EC | STARLIGHT

Authors: Pantelidou, Kyriaki; Chatzakou, Despoina; Tsikrika, Theodora; Vrochidis, Stefanos; Kompatsiaris, Ioannis;

doi: 10.1007/978-3-031-08473-7_47 , 10.5281/zenodo.6531616 , 10.5281/zenodo.6531617

Selective Word Substitution for Contextualized Data Augmentation

- Summary
- Subjects
- Related research
  (2)
- Metrics

Abstract

The often observed unavailability of large amounts of training data typically required by deep learning models to perform well in the context of NLP tasks has given rise to the exploration of data augmentation techniques. Originally, such techniques mainly focused on rule-based methods (e.g. random insertion/deletion of words) or synonym replacement with the help of lexicons. More recently, model-based techniques which involve the use of non-contextual (e.g. Word2Vec, GloVe) or contextual (e.g. BERT) embeddings seem to be gaining ground as a more effective way of word replacement. For BERT, in particular, which has been employed successfully in various NLP tasks, data augmentation is typically performed by applying a masking approach where an arbitrary number of word positions is selected to replace words with others of the same meaning. Considering that the words selected for substitution are bound to affect the final outcome, this work examines different ways of selecting the words to be replaced by emphasizing different parts of a sentence, namely specific parts of speech or words that carry more sentiment information. Our goal is to study the effect of selecting the words to be substituted during data augmentation on the final performance of a classification model. Evaluation experiments performed for binary classification tasks on two benchmark datasets indicate improvements in the effectiveness against state-of-the-art baselines.

This preprint has not undergone peer review (when applicable) or any post-submission improvements or corrections. The Version of Record of this contribution is published in the 27th International Conference on Natural Language & Information Systems and is available online at http://dx.doi.org/10.1007/978-3-031-08473-7_47.

Related Organizations

Keywords

Word substitution, Binary classification, Contextual data augmentation

2 Research products, page 1 of 1

Selective Word Substitution for Contextualized Data Augmentation
2022HasVersion
keras-adamw software on GitHub
IsRelatedTo

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	1
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average