Abusive Text Detection Using Neural Networks

descriptionPublicationkeyboard_double_arrow_right Article , Conference object , Other literature type 01 Jan 2017 Ireland Publisher:Technological University DublinJournal:CEUR Workshop Proceedings, volume 2,086, pages 258-260 (issn: 1613-0073,

Copyright policy )Publicly funded

Authors: Chen, Hao; McKeever, Susan; Delany, Sarah Jane;

doi: 10.21427/d7pb64

Abusive Text Detection Using Neural Networks

- Summary
- Subjects
- Metrics

Abstract

Neural network models have become increasingly popular for text classification in recent years. In particular, the emergence of word embeddings within deep learning architectures has recently attracted a high level of attention amongst researchers. In this paper, we focus on how neural network models have been applied in text classification. Secondly, we extend our previous work [4, 3] using a neural network strategy for the task of abusive text detection. We compare word embedding features to the traditional feature representations such as n-grams and handcrafted features. In addition, we use an off-the-shelf neural network classifier, FastText[16]. Based on our results, the conclusions are: (1) Extracting selected manual features can increase abusive content detection over using basic ngrams; (2) Although averaging pre-trained word embeddings is a naive method, the distributed feature representation has better performance to ngrams in most of our datasets; (3) While the FastText classifier works efficiently with fast performance, the results are not remarkable as it is a shallow neural network with only one hidden layer; (4) Using pre-trained word embeddings does not guarantee better performance in the FastText classifier.

Country

Ireland

Related Organizations

Keywords

abusive, machine learning, feature selection, classification, Computer Sciences, detection, abusive detection, neural networks, labelling strategy, text, 004

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	0
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

0

Average

Green

gold

Related to Research communities

EUt+

TU-NET