
handle: 10214/20545
When provided with enough labelled training examples, a supervised learning algorithm can learn reasonably accurately. However, creating sufficient labelled data to train accurate classifiers is time consuming and expensive. On the other hand, unlabelled data is usually easy to obtain. This research introduces a novel approach, Guelph Cluster Class (GCC), which improves the task of classification with the use of unlabelled data. The novelty of this approach lies in the use of an unsupervised network, 'Self-Organizing Map', to select natural clusters in labelled and unlabelled data. Sub-classes (made by labelled data) are used to assign labels to unlabelled patterns to produce ' self-labelled' data. The performance of several variants of the GCC system have been obtained by running a 'Back-Propagation' network on labelled and self-labelled data. Results of experiments on several benchmark datasets demonstrate an increasing power for the classification procedure even when the number of labelled data is very small.
supervised learning algorithm, classification, Guelph Cluster Class, unsupervised network, train, Self-Organizing Map, unlabelled data, labelled data
supervised learning algorithm, classification, Guelph Cluster Class, unsupervised network, train, Self-Organizing Map, unlabelled data, labelled data
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 0 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
