
doi: 10.1007/bf00377169
pmid: 28309352
Asymmetry in binary data arises when one of the two states (e.g. state "1") is interpreted as more informative than the other state. A common example in ecology occurs when one state represents presence of some unit and the other state represents absence. The problem of the classification of individuals based upon a set of such characters is related to the goal of group homogeneity. The homogeneity of a group of individuals is defined as the count over all possible pairs of individuals and all characters, of the number of shared 1 states, minus the number of mismatches or 0-1, 1-0 combinations. The shared 0 states are effectively neutral, then, in terms of 1-state homogeneity.The behaviour of some common binary similarity measures is examined in relation to 1-state homogeneity. Although the Jaccard coefficient comes close to having the desired behaviour it exhibits undesirable behaviour for some data values and a proportionality relationship between matches and mismatches that may not always be desirable. A new coefficient, "C", is introduced which overcomes these problems and leads to homogeneous classifications in the sense described above. Further general recommendations are made for the use of these coefficients in various contexts.
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 38 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 10% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Top 10% | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
