
pmid: 17142812
AbstractMotivation: The discovery of regulatory pathways, signal cascades, metabolic processes or disease models requires knowledge on individual relations like e.g. physical or regulatory interactions between genes and proteins. Most interactions mentioned in the free text of biomedical publications are not yet contained in structured databases.Results: We developed RelEx, an approach for relation extraction from free text. It is based on natural language preprocessing producing dependency parse trees and applying a small number of simple rules to these trees. We applied RelEx on a comprehensive set of one million MEDLINE abstracts dealing with gene and protein relations and extracted ~150 000 relations with an estimated perfomance of both 80% precision and 80% recall.Availability: The used natural language preprocessing tools are free for use for academic research. Test sets and relation term lists are available from our website ().Contact: katrin.fundel@bio.ifi.lmu.de
MEDLINE, Protein Interaction Mapping, Database Management Systems, Gene Expression, Information Storage and Retrieval, Proteins, Algorithms, Software, Natural Language Processing
MEDLINE, Protein Interaction Mapping, Database Management Systems, Gene Expression, Information Storage and Retrieval, Proteins, Algorithms, Software, Natural Language Processing
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 371 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 1% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Top 1% | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Top 10% |
