
doi: 10.1002/widm.16
handle: 20.500.14802/11933
AbstractTraditional clustering partitions a group of objects into a number of nonoverlapping sets based on a similarity measure. In real world, the boundaries of these sets or clusters may not be clearly defined. Some of the objects may be almost equidistant from the center of multiple clusters. Traditional set theory mandates that these objects be assigned to a single cluster. Rough set theory can be used to represent the overlapping clusters. Rough sets provide more flexible representation than conventional sets, at the same time they are less descriptive than the fuzzy sets. This paper describes the basic concept of rough clustering based onk‐means, genetic algorithms, Kohonen self‐organizing maps, and support vector clustering. The discussion also includes a review of rough cluster validity measures, and applications of rough clustering to such diverse areas as forestry, medicine, medical imaging, web mining, super markets, and traffic engineering. © 2011 John Wiley & Sons, Inc.WIREs Data Mining Knowl Discov2011 1 64‐72 DOI: 10.1002/widm.16This article is categorized under:Technologies > Computational IntelligenceTechnologies > Machine LearningTechnologies > Structure Discovery and Clustering
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 51 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 10% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Top 10% | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Top 10% |
