Powered by OpenAIRE graph
Found an issue? Give us feedback
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/ IEEE Accessarrow_drop_down
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/
IEEE Access
Article . 2024 . Peer-reviewed
License: CC BY NC ND
Data sources: Crossref
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/
IEEE Access
Article . 2024
Data sources: DOAJ
DBLP
Article . 2024
Data sources: DBLP
versions View all 3 versions
addClaim

LTEA-YOLO: An Improved YOLOv5s Model for Small Object Detection

Authors: Bo Li; Shengbao Huang; Guangjin Zhong;

LTEA-YOLO: An Improved YOLOv5s Model for Small Object Detection

Abstract

Small target information has a lower proportion and severe background interference in the image, which significantly restrains the performance of small object detection algorithms. Most detection models today have a large size, making them unsuitable for deployment on mobile terminals. Based on YOLOv5s, we proposed a light-weight model, LTEA-YOLO, with a model size of only 13.2MB, which has a Light-weight Transformer and Efficient Attention mechanism for small object detection. Firstly, a new light-weight Transformer module, called the inverted Residual Mobile Block (iRMB), is employed as a back-bone network to extract features. Secondly, we created a DBMCSP module (Diverse Branch Modules are inserted into Cross-Stage Partial network), which takes the place of all $C3$ modules in the fusion section, to extract a wider range of feature information without compromising the speed of inference. We then employ $WIoU_{v3}$ as the loss function of box regression to accelerate training convergence and improve positioning precision. Finally, we developed a light-weight and efficient Coordinate and Adaptive Pooling Attention (CAPA) module, which performs better than the Coordinate Attention (CA) module, to be embedded into the SPPF module to enhance detection accuracy. Our model gets 97.8% at mAP@0.5 on the NWPU VHR-10 dataset, which is 3.7% better than YOLOv8s and 6% better than the baseline model YOLOv5s-7.0. In experiments with the VisDrone 2019 dataset, its mAP@0.5 reached 35.8%, outperforming other comparison models. Our LTEA-YOLO, with its small model size, demonstrates superior overall performance in detecting challenging small objects.

Keywords

YOLOv5, small object detection, lightweight transformer, Attention mechanism, Electrical engineering. Electronics. Nuclear engineering, TK1-9971

  • BIP!
    Impact byBIP!
    selected citations
    These citations are derived from selected sources.
    This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    1
    popularity
    This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
    Average
    influence
    This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    Average
    impulse
    This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
    Average
Powered by OpenAIRE graph
Found an issue? Give us feedback
selected citations
These citations are derived from selected sources.
This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Citations provided by BIP!
popularity
This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
BIP!Popularity provided by BIP!
influence
This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Influence provided by BIP!
impulse
This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
BIP!Impulse provided by BIP!
1
Average
Average
Average
gold