Analyzing billion-objects catalog interactively: Apache Spark for physicists

Article, Preprint English OPEN
Plaszczynski, S.; Peloton, J.; Arnault, C.; Campagne, J.E.;
(2018)
  • Publisher: HAL CCSD
  • Related identifiers: doi: 10.1016/j.ascom.2019.100305
  • Subject: [PHYS.ASTR]Physics [physics]/Astrophysics [astro-ph] | Astrophysics - Instrumentation and Methods for Astrophysics | [PHYS.PHYS.PHYS-INS-DET]Physics [physics]/Physics [physics]/Instrumentation and Detectors [physics.ins-det] | [ PHYS.ASTR ] Physics [physics]/Astrophysics [astro-ph] | [ PHYS.PHYS.PHYS-INS-DET ] Physics [physics]/Physics [physics]/Instrumentation and Detectors [physics.ins-det] | Astrophysics - Cosmology and Nongalactic Astrophysics

International audience; Apache Spark is a Big Data framework for working on large distributed datasets. Although widely used in the industry, it remains rather limited in the academic community or often restricted to software engineers. The goal of this paper is to show... View more
Share - Bookmark