Fiction vs Non-Fiction Genre Classification: Classical Readability Metrics vs BERT

Rajeshwari Satrasala; Kushal Shah

Found an issue? Give us feedback

https://doi.org/10.3...arrow_drop_down

https://doi.org/10.32388/9a17f...

Article . 2025 . Peer-reviewed

License: CC BY

Data sources: Crossref

Fiction vs Non-Fiction Genre Classification: Classical Readability Metrics vs BERT

descriptionPublicationkeyboard_double_arrow_right Article 11 Apr 2025Publisher:Qeios Ltd

Authors: Rajeshwari Satrasala; Kushal Shah;

doi: 10.32388/9a17f4

Fiction vs Non-Fiction Genre Classification: Classical Readability Metrics vs BERT

- Summary
- Metrics

Abstract

In this paper, we show that fiction vs non-fiction genre classification can be achieved with very high accuracy using simple readability metrics, which have been extensively studied by linguists for many decades. In addition, we explore the BERT model for this classification and find that, although it can also achieve very high accuracy with the same amount of training data, its results are very hard to understand. We tried many adversarial attacks to break the fine-tuned BERT model but found it to be quite resilient.

Related Organizations

Indian Institute of Science Education and Research, Bhopal
India

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	0
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

0

Average

hybrid