
doi: 10.1007/11875581_7
In this paper, the grouping method of the similar words, is proposed for the classification of documents. It is shown that the grouping of words has equivalent ability to the LSA in the classification accuracy. Further, a new combining method is proposed for the documents classification, which consists of Grouping, Latent Semantic Analysis(LSA) followed by the k-Nearest Neighbor classification ( k-NN ). The combining method proposed here, shows the higher accuracy in the classification than the conventional methods of the kNN, and the LSA followed by the kNN. Thus, the grouping method is effective as a preprocessing before the conventional method.
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 2 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
