
Abstract Next-generation DNA sequencing has revolutionized the study of biology. However, the short read lengths of the dominant instruments complicate assembly of complex genomes and haplotype phasing of mixtures of similar sequences. Here we demonstrate a method to reconstruct the sequences of individual nucleic acid molecules up to 11.6 kilobases in length from short (150-bp) reads. We show that our method can construct 99.97%-accurate synthetic reads from bacterial, plant, and animal genomic samples, full-length mRNA sequences from human cancer cell lines, and individual HIV env gene variants from a mixture. The preparation of multiple samples can be multiplexed into a single tube, further reducing effort and cost relative to competing approaches. Our approach generates sequencing libraries in three days from less than one microgram of DNA in a single-tube format without custom equipment or specialized expertise.
DNA, Bacterial, 570, DNA, Plant, General Science & Technology, Science, Bioinformatics and Computational Biology, 610, Genetics, Animals, Humans, Gene Library, Genome, Human Genome, Q, Bacterial, R, High-Throughput Nucleotide Sequencing, DNA, Plant, DNA, Neoplasm, Sequence Analysis, DNA, Biological Sciences, Haplotypes, HIV/AIDS, Neoplasm, Medicine, Infection, Sequence Analysis, Algorithms, Biotechnology, Research Article
DNA, Bacterial, 570, DNA, Plant, General Science & Technology, Science, Bioinformatics and Computational Biology, 610, Genetics, Animals, Humans, Gene Library, Genome, Human Genome, Q, Bacterial, R, High-Throughput Nucleotide Sequencing, DNA, Plant, DNA, Neoplasm, Sequence Analysis, DNA, Biological Sciences, Haplotypes, HIV/AIDS, Neoplasm, Medicine, Infection, Sequence Analysis, Algorithms, Biotechnology, Research Article
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 33 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 10% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Top 10% | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Top 10% |
