Faster computation of exact RNA shape probabilities

descriptionPublicationkeyboard_double_arrow_right Article 14 Jan 2010 Germany English Publisher:Oxford University Press (OUP)Journal:Bioinformatics, volume 26, pages 632-639 (issn: 1367-4803, eissn: 1367-4811,

Copyright policy )

Authors: Janssen, Stefan; Giegerich, Robert;

doi: 10.1093/bioinformatics/btq014

pmid: 20080511

pmc: PMC2828121

Faster computation of exact RNA shape probabilities

- Summary
- Subjects
- Metrics

Abstract

Abstract Motivation: Abstract shape analysis allows efficient computation of a representative sample of low-energy foldings of an RNA molecule. More comprehensive information is obtained by computing shape probabilities, accumulating the Boltzmann probabilities of all structures within each abstract shape. Such information is superior to free energies because it is independent of sequence length and base composition. However, up to this point, computation of shape probabilities evaluates all shapes simultaneously and comes with a computation cost which is exponential in the length of the sequence. Results: We device an approach called RapidShapes that computes the shapes above a specified probability threshold T by generating a list of promising shapes and constructing specialized folding programs for each shape to compute its share of Boltzmann probability. This aims at a heuristic improvement of runtime, while still computing exact probability values. Conclusion: Evaluating this approach and several substrategies, we find that only a small proportion of shapes have to be actually computed. For an RNA sequence of length 400, this leads, depending on the threshold, to a 10–138 fold speed-up compared with the previous complete method. Thus, probabilistic shape analysis has become feasible in medium-scale applications, such as the screening of RNA transcripts in a bacterial genome. Availability: RapidShapes is available via http://bibiserv.cebitec.uni-bielefeld.de/rnashapes Contact: robert@techfak.uni-bielefeld.de Supplementary information: Supplementary data are available at Bioinformatics online.

Country

Germany

Related Organizations

Bielefeld University
Germany

Keywords

Base Sequence, Sequence Analysis, RNA, Databases, Genetic, Molecular Sequence Data, Computational Biology, Nucleic Acid Conformation, RNA, Original Papers, 004

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	17
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 10%

Found an issue? Give us feedback

17

Average

Top 10%

Green

gold

Fields of Science (4) View all

engineering and technology

medical engineering

Fields of Science

engineering and technology

medical engineering

View all