Powered by OpenAIRE graph
Found an issue? Give us feedback
addClaim

Chinese News Classification

Authors: David Amore Cecchini; Li Na;

Chinese News Classification

Abstract

We applied three different machine learning models to classify Chinese news into a group of classes in two schemes. The first scheme is to process the texts into TF-IDF matrices prior to running support vector machines (SVM) and maximum entropy (MAXENT) models, while the second scheme uses an embedding layer in a convolutional neural network (CNN) in order to learn features during the training process. We then compare the results obtained by all the models in terms of overall accuracy, precision, recall and F-scores. The MAXENT model showed the best performance, with an overall accuracy of 93.71%. The CNN model showed a lower performance in comparison with MAXENT and SVM models, with an overall accuracy around 73.58%. This result was not expected and we conclude with some considerations about the CNN design and possible future improvements.

Related Organizations
  • BIP!
    Impact byBIP!
    selected citations
    These citations are derived from selected sources.
    This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    8
    popularity
    This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
    Top 10%
    influence
    This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    Top 10%
    impulse
    This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
    Top 10%
Powered by OpenAIRE graph
Found an issue? Give us feedback
selected citations
These citations are derived from selected sources.
This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Citations provided by BIP!
popularity
This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
BIP!Popularity provided by BIP!
influence
This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Influence provided by BIP!
impulse
This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
BIP!Impulse provided by BIP!
8
Top 10%
Top 10%
Top 10%
Upload OA version
Are you the author of this publication? Upload your Open Access version to Zenodo!
It’s fast and easy, just two clicks!