Simultaneous estimation of transcript abundances and transcript specific fragment distributions of RNA-Seq data with the Mix2 model

Preprint OPEN
Tuerk, Andreas; Wiktorin, Gregor;
  • Related identifiers: doi: 10.1101/005918
  • Subject: bepress|Life Sciences|Biology | bepress|Life Sciences|Bioinformatics

Quantification of RNA transcripts with RNA-Seq is inaccurate due to positional fragmentation bias, which is not represented appropriately by current statistical models of RNA-Seq data. Another, less investigated, source of error is the inaccuracy of transcript start and... View more
  • References (17)
    17 references, page 1 of 2

    [1] A. P. Dempster, N. M. Laird, and D. B. Rubin. Maximum likelihood from incomplete data via the EM algorithm. Journal of the Royal Statistical Society, Series B, 39(1):1-38, 1977.

    [2] Thasso Griebel, Benedikt Zacher, Paolo Ribeca, Emanuele Raineri, Vincent Lacroix, Roderic Guig, and Michael Sammeth. Modelling and simulating generic RNA-Seq experiments with the flux simulator. Nucleic Acids Research, 40(20):10073-10083, 2012.

    [3] Kasper D Hansen, Steven E Brenner, and Sandrine Dudoit. Biases in Illumina transcriptome sequencing caused by random hexamer priming. Nucleic Acids Res, 38(12):e131, Jul 2010.

    [4] L. L. Hsiao, R. V. Jensen, T. Yoshida, K. E. Clark, J. E. Blumenstock, and S. R. Gullans. Correcting for signal saturation errors in the analysis of microarray data. BioTechniques, 32(2), February 2002.

    [5] Yu Hu, Yichuan Liu, Xianyun Mao, Cheng Jia, Jane F. Ferguson, Chenyi Xue, Muredach P. Reilly, Hongzhe Li, and Mingyao Li. PennSeq: accurate isoform-specific gene expression quantification in RNA-Seq by modeling non-uniform read distribution. Nucleic Acids Research, 42(3):e20, 2014.

    [6] Bo Li and Colin Dewey. RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome. BMC Bioinformatics, 12(1):323, 2011.

    [7] Bo Li, Victor Ruotti, Ron M Stewart, James A Thomson, and Colin N Dewey. RNA-Seq gene expression estimation with read mapping uncertainty. Bioinformatics, 26(4):493-500, Feb 2010.

    [8] Heng Li, Bob Handsaker, Alec Wysoker, Tim Fennell, Jue Ruan, Nils Homer, Gabor Marth, Goncalo Abecasis, Richard Durbin, and 1000 Genome Project Data Processing Subgroup. The Sequence Alignment/Map format and SAMtools. Bioinformatics, 25(16):2078-2079, August 2009.

    [9] Jun Li, Hui Jiang, and Wing Wong. Modeling non-uniformity in short-read rates in rna-seq data. Genome Biology, 11(5):R50, 2010.

    [10] Wei Li and Tao Jiang. Transcriptome assembly and isoform expression level estimation from biased RNASeq reads. Bioinformatics, 28(22):2914-2921, 2012.

  • Related Research Results (1)
    Inferred by OpenAIRE
    pennseq software on SourceForge
  • Metrics
Share - Bookmark

  • Download from
    bioRxiv via bioRxiv (Preprint, 2014)
  • Cite this publication