
Introduction This dataset is part of the plantAI project and contains images and field inventory data of tree seedlings in Swedish forestry. The dataset includes: Field inventory data for individual seedlings Site and environmental properties Image metadata Images from the right camera lens Annotation of a subset of the right camera images Images from the left camera lens are provided in a separate dataset. See the "Related works" section for links to associated datasets. Dataset structure The dataset is organized as follows: Images are provided as zip archives containing approximately 3000 images each, ordered by photo timestamp. seedling_observations.csv: The main data file containing seedling inventory data, image metadata, and site properties. Each row corresponds to one image. All categorical variables encoded as numeric codes are supplemented by a corresponding "_label" column containing human-readable descriptions. columns.csv: Describes all columns in seedling_observations.csv, including type and definitions. codebook.csv Lists and describes all categorical codes used in the dataset. labelstudio_annotation_photo_angles_2-5.zip Annotations of a subset of the images Only side views are represented, that is photo_angle 2-5. Region of interest as bounding box Seedling as oriented bounding box Seedling pose keypoints Column details Detailed column descriptions are provided in columns.csv. This section highlights selected variables. Column: `planting_spot` The `planting_spot` variable describes the type of soil preperation done at the planting location, based on visual judgement, and the types are defined as: Code Description SWE Description ENG 1 Omvänd torva mineral Inverted turf with mineral soil 2 Omvänd torva utan mineral Inverted turf without mineral soil 3 Grop, högt läge Pit, elevated position 4 Grop, lågt läge Pit, low position 5 Gångjärn Hinge 6 Mineralfläck Mineral spot 7 Pytsning mineraljord Spread of mineral soil 8 Pytsning humus Spread of humus 9 Störd humus Disturbed humus 10 Körspår Track rut 11 Invers Inversion 12 Omarkberett No preparation 13 Odefinierad Undefined Column: `site_preparation_method` The `site_preparation_method` column describes the type of soil preparation done on the site as whole, and are defined as: Code Description SWE Description ENG 0 Ingen None 1 Okänd Unknown 2 Harvning Scarification 3 Grävmaskin Excavator 4 Grävmaskin, utan entreprenör Excavator, non-entrepreneur 5 Grävmaskin med vissa spår, liknande harvning Excavator with some pulled tracks (similar to scarification) 6 Högläggning Mounding 7 Fläck, högläggning Spot mounding 8 Delvis högläggning Partial mounding Soil types The soil types are based on the Soil types 1:25 000–1:100 000 dated 2018-01-30, provided by SGU – Geological Survey of Sweden. The GNSS position of each seedling was used to extract soil type information from two map layers: Parent Material – Base Layer (JG2): Represents the dominant soil type at ~0.5 m depth. This includes areas of exposed bedrock or thin soil layers. All data points have a value in this layer. Surface Layer – Thin or Discontinuous (JY1): Represents surface layers thinner than ~0.5 m, or discontinuous layers averaging 0.5–1 m. Common examples include thin peat or till on bedrock. If present, JY1 overlays JG2. The following soil types appear in the dataset: Code Description SWE Description ENG 5 Kärrtorv Fen peat 19 Postglacial finlera Postglacial clay 40 Glacial lera Glacial clay 48 Glacial silt Glacial silt 55 Isälvssediment, sand Glaciofluvial sediment, sand 75 Torv Peat 95 Sandig morän Sandy till 100 Morän Till 888 Berg Rock 890 Urberg Bedrock (Precambrian rock) Label Studio annotation The annotations are provided as exported from open source software Label Studio. Data quality and uncertainty Several variables in the dataset are based on field observations and manual classification, including species, planting spot type, seedling vitality, and vegetation within the planting spot. These variables are subject to observer interpretation and may include classification uncertainty. Position data is provided together with estimated horizontal and vertical errors in meters (as provided from the GNSS).
