
In order to reduce dimension number of feature space and improve clustering precision, a novel SOM clustering algorithm based on feature selection-FSSOM is provided in this paper. This algorithm first evaluates importance and distinguishing ability of each feature, and only selects features which can efficiently improve clustering precision to construct feature space. Then, it computes kullback-leibler divergence of different co-occurring feature vector, which is gotten from large scale training corpus, to reflect the similarity of different feature. This algorithm considers the influences of similar features and uses it in self-organizing-mapping algorithm. It can make latently similar documents into same cluster. The experiment results demonstrate that because of adjusting the similar featurespsila weights, enlarging feature adjusting range, it can efficiently improve clustering precision and reduce training time.
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 0 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
