Powered by OpenAIRE graph
Found an issue? Give us feedback
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/ ZENODOarrow_drop_down
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/
ZENODO
Dataset . 2025
License: CC BY
Data sources: ZENODO
ZENODO
Dataset . 2025
License: CC BY
Data sources: Datacite
addClaim

Dynamic Sparsity: Challenging Common Sparsity Assumptions for Learning World Models in Robotic Reinforcement Learning Benchmarks

Dynamic Sparsity: Challenging Common Sparsity Assumptions for Learning World Models in Robotic Reinforcement Learning Benchmarks

Abstract

The dataset contains data collected for 1 million time steps across different RL environments in Mujoco Playground using trained-agents with actions sampled from the learnt Gausian distribution. Here only the data for the following environments are released as an example - CartpoleBalance, CheetahRun, FingerSpinEach environment specific folder contains .npz file which has the following keys described in the table below (here num_timesteps=1e6) Key Definition states True state values, including initial and terminal states, with shape (num_timesteps x state_dimension) actions Action values with shape (num_timesteps x action_dimension) state_jacobians Jacobian matrices representing the derivative of the next true state with respect to the current state, with shape (num_timesteps x state_dimension x state_dimension) action_jacobians Jacobian matrices representing the derivative of the next true state with respect to the action, with shape (num_timesteps x state_dimension x action_dimension) obs Observation values with shape (num_timesteps x obs_dim) rewards Reward values with shape (num_timesteps x 1) dones Done Indicator Vector: A binary vector of shape (num_timesteps, 1) indicating episode terminations. A value of 1 marks a timestep where an episode ends. The corresponding index after this in the states data represents the terminal state. At this terminal index, all keys except for "observation" are set to -1. total_steps_collected The total number of timesteps collected

  • BIP!
    Impact byBIP!
    selected citations
    These citations are derived from selected sources.
    This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    0
    popularity
    This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
    Average
    influence
    This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    Average
    impulse
    This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
    Average
Powered by OpenAIRE graph
Found an issue? Give us feedback
selected citations
These citations are derived from selected sources.
This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Citations provided by BIP!
popularity
This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
BIP!Popularity provided by BIP!
influence
This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Influence provided by BIP!
impulse
This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
BIP!Impulse provided by BIP!
0
Average
Average
Average