
pmid: 37330468
pmc: PMC10276920
AbstractCapturing the conditional covariances or correlations among the elements of a multivariate response vector based on covariates is important to various fields including neuroscience, epidemiology and biomedicine. We propose a new method called Covariance Regression with Random Forests (CovRegRF) to estimate the covariance matrix of a multivariate response given a set of covariates, using a random forest framework. Random forest trees are built with a splitting rule specially designed to maximize the difference between the sample covariance matrix estimates of the child nodes. We also propose a significance test for the partial effect of a subset of covariates. We evaluate the performance of the proposed method and significance test through a simulation study which shows that the proposed method provides accurate covariance matrix estimates and that the Type-1 error is well controlled. An application of the proposed method to thyroid disease data is also presented. is implemented in a freely available R package on CRAN.
FOS: Computer and information sciences, Models, Statistical, Random Forest, Variable importance, QH301-705.5, Research, Computer applications to medicine. Medical informatics, R858-859.7, Machine Learning (stat.ML), Random forests, Statistics - Applications, Methodology (stat.ME), Statistics - Machine Learning, Multivariate response, Humans, Covariance regression, Computer Simulation, Applications (stat.AP), Biology (General), Child, Statistics - Methodology
FOS: Computer and information sciences, Models, Statistical, Random Forest, Variable importance, QH301-705.5, Research, Computer applications to medicine. Medical informatics, R858-859.7, Machine Learning (stat.ML), Random forests, Statistics - Applications, Methodology (stat.ME), Statistics - Machine Learning, Multivariate response, Humans, Covariance regression, Computer Simulation, Applications (stat.AP), Biology (General), Child, Statistics - Methodology
| citations This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 8 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 10% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Top 10% |
