
Btrim is a fast and lightweight software to trim adapters and low quality regions in reads from ultra high-throughput next-generation sequencing machines. It also can reliably identify barcodes and assign the reads to the original samples. Based on a modified Myers's bit-vector dynamic programming algorithm, Btrim can handle indels in adapters and barcodes. It removes low quality regions and trims off adapters at both or either end of the reads. A typical trimming of 30M reads with two sets of adapter pairs can be done in about a minute with a small memory footprint. Btrim is a versatile stand-alone tool that can be used as the first step in virtually all next-generation sequence analysis pipelines. The program is available at \url{http://graphics.med.yale.edu/trim/}.
8 pages, 1 figure
Genomics (q-bio.GN), FOS: Computer and information sciences, Barcode assignment, Approximate string matching, Bit-vector algorithm, High-Throughput Nucleotide Sequencing, Sequence Analysis, DNA, Computational Engineering, Finance, and Science (cs.CE), Adapters trimming, FOS: Biological sciences, Computer Science - Data Structures and Algorithms, Next-generation sequencing, Genetics, Quantitative Biology - Genomics, Data Structures and Algorithms (cs.DS), Computer Science - Computational Engineering, Finance, and Science, Algorithms, Software
Genomics (q-bio.GN), FOS: Computer and information sciences, Barcode assignment, Approximate string matching, Bit-vector algorithm, High-Throughput Nucleotide Sequencing, Sequence Analysis, DNA, Computational Engineering, Finance, and Science (cs.CE), Adapters trimming, FOS: Biological sciences, Computer Science - Data Structures and Algorithms, Next-generation sequencing, Genetics, Quantitative Biology - Genomics, Data Structures and Algorithms (cs.DS), Computer Science - Computational Engineering, Finance, and Science, Algorithms, Software
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 508 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 0.1% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Top 1% | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Top 1% |
