
RNA-seq is a next generation sequencing method with a wide range of applications including single nucleotide polymorphism (SNP) detection, splice junction identification, and gene expression level measurement. However, the RNA-seq sequence data can be biased during library constructions resulting in incorrect data for SNP, splice junction, and gene expression studies. Here, we developed new library preparation methods to limit such biases.A whole transcriptome library prepared for the SOLiD system displayed numerous read duplications (pile-ups) and gaps in known exons. The pile-ups and gaps of the whole transcriptome library caused a loss of SNP and splice junction information and reduced the quality of gene expression results. Further, we found clear sequence biases for both 5' and 3' end reads in the whole transcriptome library. To remove this bias, RNaseIII fragmentation was replaced with heat fragmentation. For adaptor ligation, T4 Polynucleotide Kinase (T4PNK) was used following heat fragmentation. However, its kinase and phosphatase activities introduced additional sequence biases. To minimize them, we used OptiKinase before T4PNK. Our study further revealed the specific target sequences of RNaseIII and T4PNK.Our results suggest that the heat fragmentation removed the RNaseIII sequence bias and significantly reduced the pile-ups and gaps. OptiKinase minimized the T4PNK sequence biases and removed most of the remaining pile-ups and gaps, thus maximizing the quality of RNA-seq data.
Ribonuclease III, OptiKinase, T4PNK, Hot Temperature, Polynucleotide 5'-Hydroxyl-Kinase, Agricultural and Biological Sciences(all), biology, Biochemistry, Genetics and Molecular Biology(all), Sequence Analysis, RNA, Research, heat fragmentation, sequence bias, 612, t4pnk, optikinase, rna-seq, rnaseiii, RNaseIII, RNA-seq, Transcriptome, Gene Library
Ribonuclease III, OptiKinase, T4PNK, Hot Temperature, Polynucleotide 5'-Hydroxyl-Kinase, Agricultural and Biological Sciences(all), biology, Biochemistry, Genetics and Molecular Biology(all), Sequence Analysis, RNA, Research, heat fragmentation, sequence bias, 612, t4pnk, optikinase, rna-seq, rnaseiii, RNaseIII, RNA-seq, Transcriptome, Gene Library
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 18 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 10% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Top 10% |
