Powered by OpenAIRE graph
Found an issue? Give us feedback
ZENODOarrow_drop_down
ZENODO
Dataset . 2026
License: CC BY
Data sources: Datacite
ZENODO
Dataset . 2026
License: CC BY
Data sources: Datacite
versions View all 2 versions
addClaim

Parameter-Efficient Adaptation of Generative-Foundation (Flux, Qwen) vs. Zero-Shot (Gemini, SAM3) Models for Aerial Image Segmentation

Authors: Shata, Dina; Denman, Simon; Omrani, Sara; Drogemuller, Robin; Ali, Hend; Wagdy, Ayman;

Parameter-Efficient Adaptation of Generative-Foundation (Flux, Qwen) vs. Zero-Shot (Gemini, SAM3) Models for Aerial Image Segmentation

Abstract

Dataset Overview: This dataset and the associated LoRA-adapted model weights (FLUX.1 Kontext, FLUX.2, and Qwen) provide the experimental framework and results for the study: 'Parameter-Efficient Adaptation of Generative-Foundation (Flux, Qwen) vs. Zero-Shot (Gemini, SAM3) Models for Aerial Image Segmentation', published in Buildings (MDPI), 2026, 16(7), 1369 https://doi.org/10.3390/buildings16071369 . The records include the Brisbane metropolitan training/testing tiles and the fine-tuned adapters that achieved a peak mean Intersection over Union (IoU) of 89% and 96% pixel accuracy, demonstrating the efficacy of generative foundation models for high-precision spatial analysis. Technical Specifications: · Geographic Focus: Brisbane, Australia (Metropolitan regions). · Task: Binary Semantic Segmentation (Rooftop vs. Non-rooftop). · Data Split: 121 total tiles (80/20 split)- Training: 97 tiles. · Testing: 24 tiles.Format: High-resolution aerial imagery paired with dense binary annotations. Benchmarked Models & Performance: This dataset serves as the benchmark for several state-of-the-art architectures evaluated under a unified protocol: Model Category Name IoU Zero-Shot Gemini 3 Pro 85% Segmentation Baseline SAM3 (Segment Anything Model) 84% LoRA-Adapted Diffusion FLUX.1-Kontext 89% Key Research Insights: · Diffusion as Segmentors: The dataset demonstrates that with only 250–5000 steps of LoRA fine-tuning, generative models like FLUX.1-Kontext can outperform dedicated segmentation models (SAM3), reaching a mean IoU of 89%. · Generalization: While Gemini 3 Pro provides a powerful zero-shot baseline, fine-tuned diffusion models offer superior boundary fidelity in heterogeneous urban morphologies. · Efficiency: The results highlight that parameter-efficient tuning is a viable path for transforming general-purpose AI into scalable spatial analysis tools for urban monitoring and solar potential assessment. Citation Dataset could be cited from: https://explore.openaire.eu/search/result?pid=10.5281%2Fzenodo.18571111 APA style ciataion: "Shata, D., Denman, S., Omrani, S., Drogemuller, R., Ali, H., & Wagdy, A. (2026). Parameter-Efficient Adaptation of Generative-Foundation (Flux, Qwen) vs. Zero-Shot (Gemini, SAM3) Models for Aerial Image Segmentation. https://doi.org/10.5281/zenodo.18571111" APA style Publication ciataion: "Shata, D., Denman, S., Omrani, S., Drogemuller, R., Ali, H., & Wagdy, A. (2026). Parameter-Efficient Adaptation of Generative-Foundation (Flux, Qwen) vs. Zero-Shot (Gemini, SAM3) Models for Aerial Image Segmentation. Buildings, 16(7), 1369. https://doi.org/10.3390/buildings16071369"

  • BIP!
    Impact byBIP!
    selected citations
    These citations are derived from selected sources.
    This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    0
    popularity
    This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
    Average
    influence
    This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    Average
    impulse
    This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
    Average
Powered by OpenAIRE graph
Found an issue? Give us feedback
selected citations
These citations are derived from selected sources.
This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Citations provided by BIP!
popularity
This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
BIP!Popularity provided by BIP!
influence
This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Influence provided by BIP!
impulse
This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
BIP!Impulse provided by BIP!
0
Average
Average
Average
Related to Research communities
OpenAIRE