Downloads provided by UsageCounts
arXiv: 1712.03254
handle: 2117/119211 , 2117/127558
The emergence of Next Generation Sequencing (NGS) platforms has increased the throughput of genomic sequencing and in turn the amount of data that needs to be processed, requiring highly efficient computation for its analysis. In this context, modern architectures including accelerators and non-volatile memory are essential to enable the mass exploitation of these bioinformatics workloads. This paper presents a redesign of the main component of a state-of-the-art reference-free method for variant calling, SMUFIN, which has been adapted to make the most of GPUs and NVM devices. SMUFIN relies on counting the frequency of \textit{k-mers} (substrings of length $k$) in DNA sequences, which also constitutes a well-known problem for many bioinformatics workloads, such as genome assembly. We propose techniques to improve the efficiency of k-mer counting and to scale-up workloads like \sm that used to require 16 nodes of \mn to a single machine with a GPU and NVM drives. Results show that although the single machine is not able to improve the time to solution of 16 nodes, its CPU time is 7.5x shorter than the aggregate CPU time of the 16 nodes, with a reduction in energy consumption of 5.5x.
Submitted to the 19th IEEE International Conference on high Performance Computing and Communication (HPC 2017). Partially funded by European Research Council (ERC) under the European Union's Horizon 2020 research and innovation programme (grant agreement No 639595) - HiEST Project
Energy utilization, Nonvolatile storage, FOS: Computer and information sciences, Bioinformatics, Scale-up, Acceleration, GPU, Modern architectures, K-mer, DNA sequences, Efficient computation, Scheduling algorithms, Reduction in energy consumption, Instruction sets, Supercomputadors, Genomic sequencing, Bioinformàtica, :Informàtica [Àrees temàtiques de la UPC], Data communication systems, Scale-up Acceleration, Genomes, Smart city, Genome, Nonvolatile memory, DNA, Genomics, Program processors, Graphics processing unit, Genes, Non-volatile memory, Computer Science - Distributed, Parallel, and Cluster Computing, Bioinformatics--Data processing, Next-generation sequencing, NVM, Distributed, Parallel, and Cluster Computing (cs.DC), Àrees temàtiques de la UPC::Informàtica, Non-Volatile Memory, Graphics processing units
Energy utilization, Nonvolatile storage, FOS: Computer and information sciences, Bioinformatics, Scale-up, Acceleration, GPU, Modern architectures, K-mer, DNA sequences, Efficient computation, Scheduling algorithms, Reduction in energy consumption, Instruction sets, Supercomputadors, Genomic sequencing, Bioinformàtica, :Informàtica [Àrees temàtiques de la UPC], Data communication systems, Scale-up Acceleration, Genomes, Smart city, Genome, Nonvolatile memory, DNA, Genomics, Program processors, Graphics processing unit, Genes, Non-volatile memory, Computer Science - Distributed, Parallel, and Cluster Computing, Bioinformatics--Data processing, Next-generation sequencing, NVM, Distributed, Parallel, and Cluster Computing (cs.DC), Àrees temàtiques de la UPC::Informàtica, Non-Volatile Memory, Graphics processing units
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 7 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
| views | 80 | |
| downloads | 80 |

Views provided by UsageCounts
Downloads provided by UsageCounts