HIPred: an integrative approach to predicting haploinsufficient genes

descriptionPublicationkeyboard_double_arrow_right Article 30 Jan 2017 United Kingdom English Publisher:Oxford University Press (OUP)Journal:Bioinformatics, volume 33, pages 1,751-1,757 (issn: 1367-4803, eissn: 1367-4811,

Copyright policy )Funded by:UKRI | Data mining and bioinform...

Authors: Hashem A. Shihab; Mark F. Rogers; Colin Campbell; Tom R. Gaunt;

doi: 10.1093/bioinformatics/btx028

pmid: 28137713

pmc: PMC5581952

handle: 1983/337b93f5-4184-4ea7-b9b5-b8b65cf13757

HIPred: an integrative approach to predicting haploinsufficient genes

- Summary
- Subjects
- Metrics

Abstract

Abstract Motivation A major cause of autosomal dominant disease is haploinsufficiency, whereby a single copy of a gene is not sufficient to maintain the normal function of the gene. A large proportion of existing methods for predicting haploinsufficiency incorporate biological networks, e.g. protein-protein interaction networks that have recently been shown to introduce study bias. As a result, these methods tend to perform best on well-studied genes, but underperform on less studied genes. The advent of large genome sequencing consortia, such as the 1000 genomes project, NHLBI Exome Sequencing Project and the Exome Aggregation Consortium creates an urgent need for unbiased haploinsufficiency prediction methods. Results Here, we describe a machine learning approach, called HIPred, that integrates genomic and evolutionary information from ENSEMBL, with functional annotations from the Encyclopaedia of DNA Elements consortium and the NIH Roadmap Epigenomics Project to predict haploinsufficiency, without the study bias described earlier. We benchmark HIPred using several datasets and show that our unbiased method performs as well as, and in most cases, outperforms existing biased algorithms. Availability and Implementation HIPred scores for all gene identifiers are available at: https://github.com/HAShihab/HIPred. Supplementary information Supplementary data are available at Bioinformatics online.

Country

United Kingdom

Related Organizations

University of Bristol
United Kingdom
MRC Integrative Epidemiology Unit
United Kingdom
University of Bristol (UoB)
United Kingdom

Keywords

570, Genome, Human, Sequence Analysis, RNA, 610, Genomics, Haploinsufficiency, Sequence Analysis, DNA, Original Papers, Chromatin, Epigenesis, Genetic, Histones, Machine Learning, Humans, Protein Interaction Maps

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	40
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 10%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Top 10%
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 10%