
We propose a similarity measure based on the multivariate hypergeometric distribution for the pairwise comparison of images and data vectors. The formulation and performance of the proposed measure are compared with other similarity measures using synthetic data. A method of piecewise approximation is also implemented to facilitate application of the proposed measure to large samples. Example applications of the proposed similarity measure are presented using mass spectrometry imaging data and gene expression microarray data. Results from synthetic and biological data indicate that the proposed measure is capable of providing meaningful discrimination between samples, and that it can be a useful tool for identifying potentially related samples in large-scale biological data sets.
Models, Statistical, Gene Expression Profiling, Computational Biology, Breast Neoplasms, Mass Spectrometry, Pattern Recognition, Automated, Gene Expression Regulation, Neoplastic, Receptors, Estrogen, Multivariate Analysis, Humans, Female, Algorithms, Oligonucleotide Array Sequence Analysis, Probability
Models, Statistical, Gene Expression Profiling, Computational Biology, Breast Neoplasms, Mass Spectrometry, Pattern Recognition, Automated, Gene Expression Regulation, Neoplastic, Receptors, Estrogen, Multivariate Analysis, Humans, Female, Algorithms, Oligonucleotide Array Sequence Analysis, Probability
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 9 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
