
Object detection plays a crucial role in various applications, including surveillance, autonomous driving, and industrial automation, where accurate and timely identification of objects is essential. This research proposes a novel framework that combines the YOLOv8 backbone network with an attention mechanism and a Transformer-based detection head, significantly enhancing object detection performance in real-time images and video. The incorporation of attention mechanisms refines feature extraction from complex scenes, enabling the model to focus on relevant regions within images. Using the integration of Transformer architecture, the model leverages long-range dependencies and global context, leading to more accurate bounding box predictions. The proposed system effectively processes real-time data, demonstrating superior classification performance with precision rates reaching 96.78 % and recall rates of 96.89 %. The mean average precision (mAP) is calculated at 89.67 %, showcasing the framework's robustness across various practical scenarios. The framework is developed to address challenges in object detection, such as detecting multiple objects in crowded environments and varying lighting conditions. The Python architecture supports the implementation of the proposed model. The Python architecture supports the implementation of the proposed model. The results section assesses the Attention Transformer-YOLOv8 model against established algorithms like Faster R-CNN, YOLOv3, YOLOv5n, and SSD, utilizing metrics.
Object detection, Real-time processing, YOLOv8, Attention mechanism, TA1-2040, Engineering (General). Civil engineering (General), Transformer architecture
Object detection, Real-time processing, YOLOv8, Attention mechanism, TA1-2040, Engineering (General). Civil engineering (General), Transformer architecture
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 16 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 10% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Top 10% | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Top 10% |
