Powered by OpenAIRE graph
Found an issue? Give us feedback
ZENODOarrow_drop_down
ZENODO
Dataset . 2026
License: CC BY
Data sources: Datacite
ZENODO
Dataset . 2026
License: CC BY
Data sources: Datacite
versions View all 2 versions
addClaim

Dataset for R workflow for data analysis and comparison studies of phenotypic traits of Faba bean seeds

Authors: Ortega Polo, Rodrigo; Larkan, Nicholas; Hao Nan Tobey, Wang;

Dataset for R workflow for data analysis and comparison studies of phenotypic traits of Faba bean seeds

Abstract

The data has been used R workflows and PDF summaries for comparing faba bean seed measurements across different methods, including ground truth (GT-MM, GT-DM) and automated pipelines (CP, SVD, min, colorbox). Two sets of analyses are included: one for Length, Width, and Area, and another for Perimeter, Aspect-Ratio, and Circularity. Both workflows generate boxplots, scatterplots, and Altman-Bland plots with regression metrics and confidence intervals to assess measurement accuracy and agreement. Data Cleaning and Filtering: The workflow begins by handling missing and invalid entries: zeros and blank fields are systematically converted to NA across all measurement columns. To prepare the data for accurate comparison, the workflow employs two key strategies: Group Sorting: Measurements within each seed group are sorted independently by column to ensure that data points corresponding to the same physical seed are correctly aligned across all measurement methods. Localized Group Removal (Filtering): It is used to maximize the sample size N. Unlike row-wise deletion, which discards an entire data row if any single measurement is NA, this method is column-by-column. An entire seed group is removed only if data is missing in the two specific columns currently undergoing a comparison (e.g., Length-CP vs. Length-GT-MM).

  • BIP!
    Impact byBIP!
    selected citations
    These citations are derived from selected sources.
    This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    0
    popularity
    This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
    Average
    influence
    This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    Average
    impulse
    This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
    Average
Powered by OpenAIRE graph
Found an issue? Give us feedback
selected citations
These citations are derived from selected sources.
This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Citations provided by BIP!
popularity
This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
BIP!Popularity provided by BIP!
influence
This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Influence provided by BIP!
impulse
This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
BIP!Impulse provided by BIP!
0
Average
Average
Average