
Abstract Background Gene ontology (GO) enrichment is commonly used for inferring biological meaning from systems biology experiments. However, determining differential GO and pathway enrichment between DNA-binding experiments or using the GO structure to classify experiments has received little attention. Results Herein, we present a bioinformatics tool, CompGO, for identifying Differentially Enriched Gene Ontologies, called DiEGOs, and pathways, through the use of a z-score derivation of log odds ratios, and visualizing these differences at GO and pathway level. Through public experimental data focused on the cardiac transcription factor NKX2-5, we illustrate the problems associated with comparing GO enrichments between experiments using a simple overlap approach. Conclusions We have developed an R/Bioconductor package, CompGO, which implements a new statistic normally used in epidemiological studies for performing comparative GO analyses and visualizing comparisons from .BED data containing genomic coordinates as well as gene lists as inputs. We justify the statistic through inclusion of experimental data and compare to the commonly used overlap method. CompGO is freely available as a R/Bioconductor package enabling easy integration into existing pipelines and is available at: http://www.bioconductor.org/packages/release/bioc/html/CompGO.html packages/release/bioc/html/CompGO.html
570, Serum Response Factor, QH301-705.5, Computer applications to medicine. Medical informatics, R858-859.7, 610, Jaccard Coefficient, anzsrc-for: 49 Mathematical sciences, Biochemistry, 3102 Bioinformatics and Computational Biology, Genetics, Hard Thresholding, Homeobox, Humans, anzsrc-for: 31 Biological Sciences, Biology (General), Molecular Biology, anzsrc-for: 46 Information and computing sciences, Human Genome, anzsrc-for: 01 Mathematical Sciences, Genes, Homeobox, Computational Biology, DNA, Genomics, Computer Science Applications, Gene Ontology, Networking and Information Technology R&D (NITRD), Genes, Enrich Gene Ontology, anzsrc-for: 06 Biological Sciences, anzsrc-for: 3102 Bioinformatics and Computational Biology, anzsrc-for: 08 Information and Computing Sciences, Software, 31 Biological Sciences, Biotechnology
570, Serum Response Factor, QH301-705.5, Computer applications to medicine. Medical informatics, R858-859.7, 610, Jaccard Coefficient, anzsrc-for: 49 Mathematical sciences, Biochemistry, 3102 Bioinformatics and Computational Biology, Genetics, Hard Thresholding, Homeobox, Humans, anzsrc-for: 31 Biological Sciences, Biology (General), Molecular Biology, anzsrc-for: 46 Information and computing sciences, Human Genome, anzsrc-for: 01 Mathematical Sciences, Genes, Homeobox, Computational Biology, DNA, Genomics, Computer Science Applications, Gene Ontology, Networking and Information Technology R&D (NITRD), Genes, Enrich Gene Ontology, anzsrc-for: 06 Biological Sciences, anzsrc-for: 3102 Bioinformatics and Computational Biology, anzsrc-for: 08 Information and Computing Sciences, Software, 31 Biological Sciences, Biotechnology
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 39 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 10% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Top 10% | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Top 10% |
