Powered by OpenAIRE graph
Found an issue? Give us feedback
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/ ZENODOarrow_drop_down
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/
ZENODO
Other literature type . 2023
License: CC BY
Data sources: ZENODO
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/
ZENODO
Other literature type . 2023
License: CC BY
Data sources: ZENODO
ZENODO
Presentation . 2023
License: CC BY
Data sources: Datacite
ZENODO
Presentation . 2023
License: CC BY
Data sources: Datacite
versions View all 2 versions
addClaim

This Research product is the result of merged Research products in OpenAIRE.

You have already added 0 works in your ORCID record related to the merged Research product.

Next generation research data repositories: bringing computation to data for exploratory data analysis and visualisation

Authors: Girgin, Serkan; Castellanos Nash, Pablo; De la Paz Ruiz, Nestor; Matcov, Alexandru;

Next generation research data repositories: bringing computation to data for exploratory data analysis and visualisation

Abstract

Data repositories allow open access to research data, but most of the repository platforms have limitations reducing FAIRness. Lack of support for folders results in datasets to be published as large and compressed archive files that reduces accessibility and interoperability. Similarly, limited support for random data access prevents cloud-native formats to be utilized effectively and efficiently through the repositories. Because data preview capabilities of the repositories are also limited, in practice, the researchers need to download large datasets and find ways to explore them to understand their content and quality. Open Data Explorer aims to lessen these needs and facilitate rapid exploratory data analysis and visualisation of research data by providing a ready-to-use interactive computing platform where research data is directly available for computing. Each user is provided with a dedicated computing environment based on JupyterLab that supports interactive notebooks and a rich set of data access, analysis, and visualization packages in multiple languages (e.g., Python, R). To enable zero-waiting time access to a large number of datasets, the platform caches research data in a useful state (i.e., uncompressed) on-demand and also proactively by monitoring popular and new datasets available on selected data repositories. Example exploratory data analysis notebooks are automatically generated for each dataset based on its file types (e.g., CVS, NetCDF, GeoTIFF) and they are further tailored according to the dataset content so that they can be directly used for analysis with minimum user input. With the provided features, the open-source Open Data Explorer platform prevents unnecessary and ineffective downloading of datasets and reduces the time to explore research data. It also resolves the need for a separate computing environment to explore research data, and moreover facilitates the exploration task through tailor-made template notebooks. This talk will provide in-depth information about the architecture of the platform and its capabilities. The platform will also be demonstrated by using different research data repositories and research datasets.

Related Organizations
  • BIP!
    Impact byBIP!
    citations
    This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    0
    popularity
    This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
    Average
    influence
    This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    Average
    impulse
    This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
    Average
Powered by OpenAIRE graph
Found an issue? Give us feedback
citations
This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Citations provided by BIP!
popularity
This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
BIP!Popularity provided by BIP!
influence
This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Influence provided by BIP!
impulse
This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
BIP!Impulse provided by BIP!
0
Average
Average
Average
Green