publication . Report . 2019

Big Data Analysis and Machine Learning at Scale with Oracle Cloud Infrastructure

Michał Bień;
Open Access
  • Published: 22 Nov 2019
Abstract
This work has successfully deployed two different use cases of interest for High Energy Physics using cloud resources:  CMS Big data reduction: This use case consists in running a data reduction workloads for physics data. The code and implementation has originally been developed by CERN openlab in collaboration with CMS and Intel in 2017-2018. It aims at demonstrating the scalability of a data reduction workflow, by processing ROOT files using Apache Spark  Spark DL Trigger: This use case consists in the deployment of a full data preparation and machine learning pipeline, starting from data ingestion (4.5 TB of ROOT data), to the training of classifie...
Subjects
free text keywords: CERN openlab, summer student programme, CERN openlab, summer student programme
Any information missing or wrong?Report an Issue