
Awareness that disease susceptibility is not only dependent on genetic make up, but can be affected by lifestyle decisions, has brought more attention to the role of diet. However, food is often treated as a black box, or the focus is limited to few, well-studied compounds, such as polyphenols, lipids and nutrients. In this work, we applied text mining and Naïve Bayes classification to assemble the knowledge space of food-phytochemical and food-disease associations, where we distinguish between disease prevention/amelioration and disease progression. We subsequently searched for frequently occurring phytochemical-disease pairs and we identified 20,654 phytochemicals from 16,102 plants associated to 1,592 human disease phenotypes. We selected colon cancer as a case study and analyzed our results in three directions; i) one stop legacy knowledge-shop for the effect of food on disease, ii) discovery of novel bioactive compounds with drug-like properties, and iii) discovery of novel health benefits from foods. This works represents a systematized approach to the association of food with health effect, and provides the phytochemical layer of information for nutritional systems biology research.
QH301-705.5, Nutritional Sciences, Phytochemicals, DIET, Data Mining, Humans, Physiological aspects, /dk/atira/pure/sustainabledevelopmentgoals/good_health_and_well_being; name=SDG 3 - Good Health and Well-being, Biology (General), Life Style, Systems Biology, Bayes Theorem, Plants, Lipids, Phenotype, Food, Colonic Neoplasms, Disease Progression, Disease Susceptibility, Algorithms, Medical Informatics, Software, Research Article
QH301-705.5, Nutritional Sciences, Phytochemicals, DIET, Data Mining, Humans, Physiological aspects, /dk/atira/pure/sustainabledevelopmentgoals/good_health_and_well_being; name=SDG 3 - Good Health and Well-being, Biology (General), Life Style, Systems Biology, Bayes Theorem, Plants, Lipids, Phenotype, Food, Colonic Neoplasms, Disease Progression, Disease Susceptibility, Algorithms, Medical Informatics, Software, Research Article
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 28 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Top 10% | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Top 10% |
