
arXiv: 2009.05098
AbstractBiclustering is used for simultaneous clustering of the observations and variables when there is no group structure known a priori. It is being increasingly used in bioinformatics, text analytics, and so on. Previously, biclustering has been introduced in a model‐based clustering framework by utilizing a structure similar to a mixture of factor analyzers. In such models, observed variablesare modeled using a latent variablethat is assumed to be from. Clustering of variables are introduced by imposing constraints on the entries of the factor loading matrix to be 0 and 1 that results in block diagonal covariance matrices. However, this approach is overly restrictive as off‐diagonal elements in the blocks of the covariance matrices can only be 1 which can lead to unsatisfactory model fit on complex data. Here, the latent variableis assumed to be from awhereis a diagonal matrix. This ensures that the off‐diagonal terms in the block matrices within the covariance matrices are non‐zero and not restricted to be 1. This leads to a superior model fit on complex data. A family of models is developed by imposing constraints on the components of the covariance matrix. For parameter estimation, an alternating expectation conditional maximization (AECM) algorithm is used. Finally, the proposed method is illustrated using simulated and real datasets.
Methodology (stat.ME), FOS: Computer and information sciences, Statistics - Computation, Statistics - Methodology, Computation (stat.CO), 62H30
Methodology (stat.ME), FOS: Computer and information sciences, Statistics - Computation, Statistics - Methodology, Computation (stat.CO), 62H30
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 1 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
