Powered by OpenAIRE graph
Found an issue? Give us feedback
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/ ZENODOarrow_drop_down
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/
ZENODO
Dataset . 2025
License: CC BY
Data sources: ZENODO
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/
ZENODO
Dataset . 2025
License: CC BY
Data sources: ZENODO
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/
ZENODO
Dataset . 2024
License: CC BY
Data sources: ZENODO
ZENODO
Dataset . 2025
License: CC BY
Data sources: Datacite
ZENODO
Dataset . 2025
License: CC BY
Data sources: Datacite
ZENODO
Dataset . 2025
License: CC BY
Data sources: Datacite
ZENODO
Dataset . 2024
License: CC BY
Data sources: Datacite
versions View all 4 versions
addClaim

Met Office UKCP Local CPM precipitation ML emulator dataset

Authors: Addison, Henry; Kendon, Elizabeth; Ravuri, Suman; Aitchison, Laurence; Watson, Peter AG;

Met Office UKCP Local CPM precipitation ML emulator dataset

Abstract

Met Office UKCP Local CPM precipitation ML emulator dataset This is a collection of three pairs of datasets. For each pair one sourced from CPM data and one from GCM data. Each dataset is made up of climate model variables extracted from the Met Office's storage system, combining many variables over many years. It consists of 3 NetCDF files (train.nc, test.nc and val.nc), a YML ds-config.yml file and a README (similar to this one but tailored to the source of the data). Code used to create the dataset can be found here: https://github.com/henryaddison/mlde-data (specifically the james-submission tag). The pairs are: bham64_ccpm-4x_12em_psl-sphum4th-temp4th-vort4th_pr.tar.gz and bham64_gcm-4x_12em_psl-sphum4th-temp4th-vort4th_pr.tar.gz: the main datasets used by the emulators, pairing coarse atmospheric conditions with precipitation as the target. bham64_ccpm-4x_12em_linpr_pr.tar.gz and bham64_gcm-4x_12em_linpr_pr.tar.gz: for cases when using coarse precipitaiton as predictor (same precipitation as target). bham64_ccpm-60km_12em_pr_pr.tar.gz and bham64_gcm-60km_12em_rawpr_pr.tar.gz: CPM and GCM precipitation on the 60km grid (predictor and target are the same variable). The YML file contains the configuration for the creation of the dataset, including the variables, scenario, ensemble members, spatial domain and resolution, and the scheme for splitting the data across the three subsets. Each NetCDF contains the same variables but split into different subsets (train, val and test) of the based on time dimension. Otherwise the NetCDF files have the sames dimensions and coordinates for ensemble_member, grid_longitude and grid_latitude. Spatial resolution: This has two parts - the resolution of the data and the grid resolution stored at in the file. For predictand variables this is 2.2km variables coarsened 4 times to 8.8km (this is the target grid). For predictor variables this is 2.2km variables conservatively regriddded to GCM 60km grid or variables from GCM (so already on 60km grid) then regrid (nearest neighbour) to the target grid of predictands. In the naming convention of resolution used in config files, 60km resolution is synonamous with the GCM grid and 2.2km resolution is synonamous with the CPM grid. The exceptions are bham64_ccpm-60km_12em_pr_pr.tar.gz and bham64_gcm-60km_12em_rawpr_pr.tar.gz where resolution is 60km GCM grid for predictors and predictands. Spatial domain: A 64x64 section of the 8.8km target grid covering England and Wales Time resolution: daily Time domain: 1st Dec 1980 to 30th Nov 2000; 1st Dec 2020 to 30th Nov 2040; 1st Dec 2060 to 30th Nov 2080. Uses a 360-day calendar. Scenario: RCP8.5 Ensemble Members: 01, 04-13 & 15 (these correspond to the 12 ensemble member runs from the CPM but don't carry intrinsic meaning). Split scheme: 70% training, 15% validation, 15% testing, split by choosing complete seasons at random, with an equal number of each season from each of the 3 time periods. Predictor variables bham64_ccpm-4x_12em_psl-sphum4th-temp4th-vort4th_pr.tar.gz and bham64_gcm-4x_12em_psl-sphum4th-temp4th-vort4th_pr.tar.gz: psl (hPa) - mean sea level pressure temp850, temp700, temp500, temp250 - air temperature (K) at 850, 700, 500 and 250 hPa vorticity850, vorticity700, vorticity500, vorticity250 - relative vorticity (s^-1) at 850, 700, 500 and 250 hPa spechum850, spechum700, spechum500, spechum250 - specific humidity at 850, 700, 500 and 250 hPa bham64_ccpm-4x_12em_linpr_pr.tar.gz and bham64_gcm-4x_12em_linpr_pr.tar.gz: linpr (kg m^-2 s^-1) bham64_ccpm-60km_12em_pr_pr.tar.gz and bham64_gcm-60km_12em_rawpr_pr.tar.gz: pr (kg m^-2 s^-1) Predictand variable bham64_ccpm-4x_12em_psl-sphum4th-temp4th-vort4th_pr.tar.gz and bham64_gcm-4x_12em_psl-sphum4th-temp4th-vort4th_pr.tar.gz: target_pr - precipitation rate (kg m^-2 s^-1) bham64_ccpm-4x_12em_linpr_pr.tar.gz and bham64_gcm-4x_12em_linpr_pr.tar.gz: target_pr - precipitation rate (kg m^-2 s^-1) bham64_ccpm-60km_12em_pr_pr.tar.gz and bham64_gcm-60km_12em_rawpr_pr.tar.gz: target_pr - precipitation rate (kg m^-2 s^-1) UPDATE 2025-03-27: Dataset tars are renamed to make it clearer their source (ccpm for coarsened CPM and gcm for GCM). UPDATE 2025-10: Add datasets with coarse precip as predictor and datasets on the 60km grid.

Related Organizations
  • BIP!
    Impact byBIP!
    selected citations
    These citations are derived from selected sources.
    This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    0
    popularity
    This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
    Average
    influence
    This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    Average
    impulse
    This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
    Average
Powered by OpenAIRE graph
Found an issue? Give us feedback
selected citations
These citations are derived from selected sources.
This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Citations provided by BIP!
popularity
This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
BIP!Popularity provided by BIP!
influence
This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Influence provided by BIP!
impulse
This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
BIP!Impulse provided by BIP!
0
Average
Average
Average