Powered by OpenAIRE graph
Found an issue? Give us feedback
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/ ZENODOarrow_drop_down
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/
ZENODO
Dataset . 2025
License: CC BY
Data sources: ZENODO
ZENODO
Dataset . 2025
License: CC BY
Data sources: Datacite
ZENODO
Dataset . 2025
License: CC BY
Data sources: Datacite
versions View all 2 versions
addClaim

Deconstruct To Reconstruct: A Datatset For Parsing Complex CT Assemblies

Authors: Lippmann, Peter;

Deconstruct To Reconstruct: A Datatset For Parsing Complex CT Assemblies

Abstract

We have generated a high-quality dataset of simulated, physically accurate CT scans with ground truth annotations. The dataset comprises seven complex LEGO assemblies – six car models and one articulate vertebrate (T-Rex) – serving as proxies for real-world industrial assemblies. The number of parts per assembly ranges from 450 to 3600, with individual part catalogs containing between 86 and 205 unique parts.The task is challenging for the following reasons: The simulated CT scans are physically accurate, and include realistic noise and imaging artifacts. The CT scans are of high resolution ($\sim \! 2000^3$ voxels) and the 3D context is important for the precise localization of boundaries. The CT scans contain complex assemblies with up to $\sim \! 3600$ parts. Adjacent parts fit tightly. Parts vary substantially in size, ranging from small connectors to elongated components. Parts share identical subparts and exhibit symmetries. Two distinct parts may differ in fine details only. A small exemplary assembly is available as first_assembly in the ct_assembly_dataset_small . Furthermore, we make the checkpoint of our 3D-UNet available trained on boundary detection on the annotated CT scans.The folder unet_model contains the best validation check point of the trained model and a config-file for training and inference.The dataset for each assembly (e.g. first_assembly) is structured as follows: first_assembly_x10y5z5_dataset.h5: an HDF5 file with the raw CT scan and ground truth instance annotation and semantic labels. stl_catalog: a folder which contains all the meshes of the part catalog and a `first_assembly_info.json` file which contains all information on how to assemble the parts in the scene (the folder stls_watertight_replacements contains some manually fixed versions of non-watertight catalog parts). Details on the HDF5 file structure and contents Each file contains a raw scan (raw_input_volume), a corresponding ground-truth segmentation (gt_instance_volume), together with metadata about the scan setup and the part semantics: Datasets raw_input_volume (uint16): The raw volumetric scan data. gt_instance_volume (uint16): Ground-truth instance labels for each voxel. Attributes name (str): Name of the scanned assembly, e.g. "first_assembly". semantic_label_list (np.ndarray[str]): List of semantic class identifiers corresponding to instance IDs in the volume. raw_min / raw_max (np.uint16): Intensity range of the raw input volume. clipping_min_corner (np.ndarray[int]): Origin of the cropped region in the original simulated CT scan volume (which included more air around the assebmly), needed for alignment in the 3D scene. relative_scale_to_artist (float): Only needed if a scaling was applied to part meshes before simulating the CT scans (1.0 for all datasets). shift_to_place_meshes_in_volume (np.ndarray[float]): Offsets for aligning meshes extracted from volumes and meshes from the 3D scene. streak_str_addition (str): String indicating the rotation performed on the assembly to avoid streak artifacts in the CT scan, e.g. "x10y5z5" meaning that a rotation by 10° around the x-axis, by 5° around the y-axis and by 5° around the z-axis. voxelization_scale (np.float64): Resolution of the voxel grid (5.185185 for all datasets).

Keywords

Machine learning, Computed tomography

  • BIP!
    Impact byBIP!
    selected citations
    These citations are derived from selected sources.
    This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    0
    popularity
    This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
    Average
    influence
    This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    Average
    impulse
    This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
    Average
Powered by OpenAIRE graph
Found an issue? Give us feedback
selected citations
These citations are derived from selected sources.
This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Citations provided by BIP!
popularity
This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
BIP!Popularity provided by BIP!
influence
This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Influence provided by BIP!
impulse
This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
BIP!Impulse provided by BIP!
0
Average
Average
Average