
pmid: 30289163
arXiv: 1611.06350
handle: 11577/3218859 , 11390/1147470 , 11573/1741945 , 11573/1741930
pmid: 30289163
arXiv: 1611.06350
handle: 11577/3218859 , 11390/1147470 , 11573/1741945 , 11573/1741930
AbstractWe introduce a novel class of factor analysis methodologies for the joint analysis of multiple studies. The goal is to separately identify and estimate (1) common factors shared across multiple studies, and (2) study-specific factors. We develop an Expectation Conditional-Maximization algorithm for parameter estimates and we provide a procedure for choosing the numbers of common and specific factors. We present simulations for evaluating the performance of the method and we illustrate it by applying it to gene expression data in ovarian cancer. In both, we clarify the benefits of a joint analysis compared to the standard factor analysis. We have provided a tool to accelerate the pace at which we can combine unsupervised analysis across multiple studies, and understand the cross-study reproducibility of signal in multivariate data. An R package (MSFA), is implemented and is available on GitHub.
FOS: Computer and information sciences, dimension reduction, Gene Expression, Statistics - Applications, Paired and multiple comparisons; multiple testing, cross-study analysis; Dimension reduction; ECM algorithm; gene expression; meta-analysis; reproducibility; Statistics and Probability; Biochemistry, Genetics and Molecular Biology (all); Immunology and Microbiology (all); Agricultural and Biological Sciences (all); Applied Mathematics, Applications of statistics to biology and medical sciences; meta analysis, Humans, Computer Simulation, Applications (stat.AP), reproducibility, Ovarian Neoplasms, Measures of association (correlation, canonical correlation, etc.), ECM algorithm, Reproducibility of Results, Factor analysis and principal components; correspondence analysis, meta-analysis, Immune System, gene expression, Female, cross-study analysis, Factor Analysis, Statistical, Algorithms
FOS: Computer and information sciences, dimension reduction, Gene Expression, Statistics - Applications, Paired and multiple comparisons; multiple testing, cross-study analysis; Dimension reduction; ECM algorithm; gene expression; meta-analysis; reproducibility; Statistics and Probability; Biochemistry, Genetics and Molecular Biology (all); Immunology and Microbiology (all); Agricultural and Biological Sciences (all); Applied Mathematics, Applications of statistics to biology and medical sciences; meta analysis, Humans, Computer Simulation, Applications (stat.AP), reproducibility, Ovarian Neoplasms, Measures of association (correlation, canonical correlation, etc.), ECM algorithm, Reproducibility of Results, Factor analysis and principal components; correspondence analysis, meta-analysis, Immune System, gene expression, Female, cross-study analysis, Factor Analysis, Statistical, Algorithms
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 41 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 10% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Top 10% | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Top 10% |
