The Gene Ontology Categorizer

descriptionPublicationkeyboard_double_arrow_right Article , Conference object 04 Aug 2004 English Publisher:Oxford University Press (OUP)Journal:Bioinformatics, volume 20, pages i169-i177 (issn: 1367-4803, eissn: 1367-4811,

Copyright policy )

Authors: Cliff A. Joslyn; Susan M. Mniszewski; Andy W. Fulmer; Gary Heaton;

doi: 10.1093/bioinformatics/bth921

pmid: 15262796

The Gene Ontology Categorizer

- Summary
- Subjects
- Metrics

Abstract

Abstract Summary: The Gene Ontology Categorizer, developed jointly by the Los Alamos National Laboratory and Procter & Gamble Corp., provides a capability for the categorization task in the Gene Ontology (GO): given a list of genes of interest, what are the best nodes of the GO to summarize or categorize that list? The motivating question is from a drug discovery process, where after some gene expression analysis experiment, we wish to understand the overall effect of some cell treatment or condition by identifying ‘where’ in the GO the differentially expressed genes fall: ‘clustered’ together in one place? in two places? uniformly spread throughout the GO? ‘high’, or ‘low’? In order to address this need, we view bio-ontologies more as combinatorially structured databases than facilities for logical inference, and draw on the discrete mathematics of finite partially ordered sets (posets) to develop data representation and algorithms appropriate for the GO. In doing so, we have laid the foundations for a general set of methods to address not just the categorization task, but also other tasks (e.g. distances in ontologies and ontology merger and exchange) in both the GO and other bio-ontologies (such as the Enzyme Commission database or the MEdical Subject Headings) cast as hierarchically structured taxonomic knowledge systems.

Related Organizations

Los Alamos National Laboratory
United States
Procter & Gamble (United States)
United States

Keywords

Medical Subject Headings, Genes, Terminology as Topic, Database Management Systems, Information Storage and Retrieval, Proteins, Databases, Protein

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	53
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 10%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Top 1%
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 10%

Found an issue? Give us feedback

53

Top 10%

Top 1%

Top 10%

gold

Fields of Science (4) View all

engineering and technology

medical engineering

Fields of Science

engineering and technology

medical engineering

View all