
pmid: 38965376
pmc: PMC11343713
AbstractData within biobanks capture broad yet detailed indices of human variation, but biobank-wide insights can be difficult to extract due to complexity and scale. Here, using large-scale factor analysis, we distill hundreds of variables (diagnoses, assessments and survey items) into 35 latent constructs, using data from unrelated individuals with predominantly estimated European genetic ancestry in UK Biobank. These factors recapitulate known disease classifications, disentangle elements of socioeconomic status, highlight the relevance of psychiatric constructs to health and improve measurement of pro-health behaviours. We go on to demonstrate the power of this approach to clarify genetic signal, enhance discovery and identify associations between underlying phenotypic structure and health outcomes. In building a deeper understanding of ways in which constructs such as socioeconomic status, trauma, or physical activity are structured in the dataset, we emphasize the importance of considering the interwoven nature of the human phenome when evaluating public health patterns.
Male, UK Biobank, Statistical methods, Epidemiology, 610, 600, Middle Aged, Article, United Kingdom, /dk/atira/pure/core/keywords/population_health_SRI, Phenotype, Social Class, Humans, Data integration, Female, name=Bristol Population Health Science Institute, Biological Specimen Banks
Male, UK Biobank, Statistical methods, Epidemiology, 610, 600, Middle Aged, Article, United Kingdom, /dk/atira/pure/core/keywords/population_health_SRI, Phenotype, Social Class, Humans, Data integration, Female, name=Bristol Population Health Science Institute, Biological Specimen Banks
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 10 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 10% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Top 10% |
