
handle: 20.500.12358/23866
Text genre is a type of written text. Arabic text genre classification predicts genre of specific text document written in Arabic independent of its topic. In this paper, an approach was proposed that takes an Arabic document and classify it into one of four genres which are advertisements, news, subjective and scientific documents. Since the frequency of words approach produces a low performance when used in the genre, an attempted was made to generate attributes based on the style of the text. This approach evaluated using corpus collected for this purpose. Using four machine learning methods, our approach compared with the word frequency approach, and it found that our approach is better than this mainstream approach. It, also, found that predicting subjectivity and scientific genre is more accurate than predicting advertisements and news.
arabic language processing, index terms-text genre, machine learning methods, text genre classification, text mining
arabic language processing, index terms-text genre, machine learning methods, text genre classification, text mining
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 0 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
