
doi: 10.7717/peerj.10029
pmid: 33150059
pmc: PMC7585375
handle: 10852/81195 , 11250/2759662 , 11250/2688279
doi: 10.7717/peerj.10029
pmid: 33150059
pmc: PMC7585375
handle: 10852/81195 , 11250/2759662 , 11250/2688279
Nanopore sequencing is rapidly becoming more popular for use in various microbiota-based applications. Major limitations of current approaches are that they do not enable de novo species identification and that they cannot be used to verify species assignments. This severely limits applicability of the nanopore sequencing technology in taxonomic applications. Here, we demonstrate the possibility of de novo species identification and verification using hexamer frequencies in combination with k-means clustering for nanopore sequencing data. The approach was tested on the human infant gut microbiota of 3-month-old infants. Using the hexamer k-means approach we identified two new low abundant species associated with vaginal delivery. In addition, we confirmed both the vaginal delivery association for two previously identified species and the overall high levels of bifidobacteria. Taxonomic assignments were further verified by mock community analyses. Therefore, we believe our de novo species identification approach will have widespread application in analyzing microbial communities in the future.
Nanopore, 570, QH301-705.5, Bioinformatics, Infant gut, Microbiota, R, Medicine, 16S rrNA, Biology (General)
Nanopore, 570, QH301-705.5, Bioinformatics, Infant gut, Microbiota, R, Medicine, 16S rrNA, Biology (General)
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 3 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
