Powered by OpenAIRE graph
Found an issue? Give us feedback
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/ The Journal of Engin...arrow_drop_down
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/
The Journal of Engineering
Article . 2022 . Peer-reviewed
License: CC BY
Data sources: Crossref
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/
The Journal of Engineering
Article . 2022
Data sources: DOAJ
https://dx.doi.org/10.60692/cp...
Other literature type . 2022
Data sources: Datacite
https://dx.doi.org/10.60692/2t...
Other literature type . 2022
Data sources: Datacite
versions View all 4 versions
addClaim

This Research product is the result of merged Research products in OpenAIRE.

You have already added 0 works in your ORCID record related to the merged Research product.

An intelligent ubiquitous compression technique for DNA sequencing using Hadoop

تقنية ضغط ذكية في كل مكان لتسلسل الحمض النووي باستخدام Hadoop
Authors: Wenlin Sun; Ashutosh Sharma; Evans Asenso;

An intelligent ubiquitous compression technique for DNA sequencing using Hadoop

Abstract

Abstract To solve the problem of reducing the amount of data storage in the practical application of massive biomedical data and efficiently using existing storage devices and bandwidth resources to store shared data. The proposed model includes both compression modes: the first is a single sequence compression mode designed for the characteristics of a large number of repeated substrings in DNA sequences; the second is a reference‐based multi‐sequence compression mode designed for the very similar characteristics of DNA sequences of different individuals of the same species. Both types of compression use the Lempel–Ziv–Welch (LZ) compression method, which is quite comparable to one another, to examine the sequence, as well as to study and classify the repetitive data that exists between a single sequence and numerous sequences in the sequence set. The proposed method aims to solve the problem of high pressure caused by single‐point processing of large sequence files, effectively reduces redundant information by using the local correlation of data, and effectively uses the computing resources of a cloud platform that is used for biological information processing to support the efficient storage, transmission, and sharing of data.

Related Organizations
Keywords

FOS: Computer and information sciences, Composite material, Computer Networks and Communications, Set (abstract data type), Substring, Real-time computing, Artificial Intelligence, Data Mining Techniques and Applications, Hashing, Genetics, Cloud computing, Data mining, Biology, Text Compression and Indexing Algorithms, Compression, Engineering (General). Civil engineering (General), Computer science, Materials science, Programming language, Algorithm, Operating system, Distributed Storage Systems and Network Coding, Data compression, FOS: Biological sciences, Computer Science, Physical Sciences, Compression (physics), TA1-2040, Information Systems, Sequence (biology)

  • BIP!
    Impact byBIP!
    selected citations
    These citations are derived from selected sources.
    This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    0
    popularity
    This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
    Average
    influence
    This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    Average
    impulse
    This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
    Average
Powered by OpenAIRE graph
Found an issue? Give us feedback
selected citations
These citations are derived from selected sources.
This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Citations provided by BIP!
popularity
This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
BIP!Popularity provided by BIP!
influence
This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Influence provided by BIP!
impulse
This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
BIP!Impulse provided by BIP!
0
Average
Average
Average
gold