High precision in microRNA prediction: a novel genome-wide approach based on convolutional deep residual networks

Cristian Yones; Jonathan Raad; Leandro A. Bugnon; Diego H. Milone; Georgina Stegmayer

Found an issue? Give us feedback

https://doi.org/10.1...arrow_drop_down

https://doi.org/10.1101/2020.1...

Article . 2020 . Peer-reviewed

License: CC BY

Data sources: Crossref

https://www.biorxiv.org/conten...

Article

License: CC BY

Data sources: UnpayWall

https://dx.doi.org/10.60692/qf...

Other literature type . 2020

Data sources: Datacite

https://dx.doi.org/10.60692/1f...

Other literature type . 2020

Data sources: Datacite

https://dx.doi.org/10.1101/202...

Other literature type

Data sources: Microsoft Academic Graph

High precision in microRNA prediction: a novel genome-wide approach based on convolutional deep residual networks

دقة عالية في التنبؤ بالحمض النووي الريبي المجهري: نهج جديد على نطاق الجينوم يعتمد على الشبكات التلافيفية العميقة المتبقية

descriptionPublicationkeyboard_double_arrow_right Article , Other literature type 25 Oct 2020Publisher:Cold Spring Harbor Laboratory

Authors: Cristian Yones; Jonathan Raad; Leandro A. Bugnon; Diego H. Milone; Georgina Stegmayer;

doi: 10.1101/2020.10.23.352179 , 10.60692/qfxr3-vwx81 , 10.60692/1ft2x-abv97

High precision in microRNA prediction: a novel genome-wide approach based on convolutional deep residual networks

- Summary
- Subjects
- Metrics

Abstract

AbstractMotivationMicroRNAs (miRNAs) are small non-coding RNAs that have a key role in the regulation of gene expression. The importance of miRNAs is widely acknowledged by the community nowadays, and the precise prediction of novel candidates with computational methods is still very needed. This could be done by searching homologous with sequence alignment tools, but this will be restricted only to sequences very similar to the known miRNA precursors (pre-miRNAs). Further-more, other important properties of pre-miRNAs, such as the secondary structure, are not taken into account by these methods. Many machine learning approaches were proposed in the last years to fill this gap, but these methods were tested in very controlled conditions, which are not fulfilled, for example, when predicting in newly sequenced genomes, where no miRNAs are known. If these methods are used under real conditions, the precision achieved is far from the one published.ResultsThis work provides a novel approach for dealing with the computational prediction of pre-miRNAs: a convolutional deep residual neural network. The proposed model has been tested on several complete genomes of animals and plants, achieving a precision up to 5 times higher than other approaches at the same recall rates. Also, a novel validation methodology is used to ensure that the performance reported can be achieved when using the method on new unknown species.AvailabilityTo provide fast an easy access to mirDNN, a web demo is availablehere. It can process fasta files with multiple sequences to calculate the prediction scores, and can generate the nucleotide importance plots. The full source code of this project is availablehereandhere.Contactcyones@sinc.unl.edu.ar

Related Organizations

National Scientific and Technical Research Council
Argentina

Keywords

Cancer Research, Artificial intelligence, Non-coding RNA Networks, MicroRNA Regulation in Cancer and Development, genome annotation, Role of Long Noncoding RNAs in Cancer and Development, Convolutional neural network, Gene prediction, Gene, Computational biology, Computer security, Biochemistry, Genetics and Molecular Biology, Machine learning, Genetics, FOS: Mathematics, RNA Sequencing Data Analysis, Key (lock), Molecular Biology, Data mining, Biology, Coding (social sciences), Precision and recall, Genome, microRNA, RNA Regulation, Statistics, Life Sciences, Deep learning, Computer science, Algorithm, MicroRNAs, Residual, FOS: Biological sciences, sequence alignment, Mathematics

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	0
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average