
Abstract Utilizing base-conversion (BC) techniques, single-base resolution profiling of RNA and DNA modifications has significantly advanced. BC strategies range from one-way conversions (e.g. cytosine-to-thymine for 5-methylcytosine, adenine-to-guanine for N6-methyladenosine), to multi-way conversions (e.g. adenine to cytosine/guanine/thymine for N1-methyladenosine) and deletion-induced conversions (e.g. pseudouridine-to-deletion). Existing sequence aligners struggle with these diverse conversions, often leading to misaligning or inefficiency. We introduce BASAL (BAse-conversion Sequencing ALigner), which leverages bit-masking technology to accurately calculate mismatch penalties and supports all BC strategies. BASAL outperforms state-of-the-art tools in both mapping accuracy and efficiency. Through simulated and real data testing, along with experimental validation, we demonstrate that BASAL excels at identifying reliable modification sites. Moreover, BASAL enhances single-cell m6A analysis, revealing cell subpopulations and a cell evolutionary direction that align with biological functions, which other aligners fall short. BASAL’s versatility establishes it as a universal aligner for RNA and DNA modification sequencing, facilitating groundbreaking discoveries in epigenomics and epitranscriptomics.
Sequence Analysis, RNA, Nucleotides, Computational Biology, High-Throughput Nucleotide Sequencing, Sequence Analysis, DNA, Humans, RNA, Single-Cell Analysis, Algorithms, Software
Sequence Analysis, RNA, Nucleotides, Computational Biology, High-Throughput Nucleotide Sequencing, Sequence Analysis, DNA, Humans, RNA, Single-Cell Analysis, Algorithms, Software
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 2 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 10% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
