
AbstractConsidering the challenges of low multi‐object detection accuracy and difficulty in identifying small targets caused by challenging environmental conditions including irregular lighting patterns and ambient noise levels in the mining environment with autonomous electric locomotives. A new network model based on SOD−YOLOv5s−4L has been proposed to detect multi‐objects for autonomous electric locomotives in underground coal mines. Improvements have been applied in YOLOv5s to construct the SOD−YOLOv5s−4L model, by introducing the SIoU loss function to address the mismatch between real and predicted bounding box directions, facilitating the model to learn target position information more efficiently. This research introduces a decoupled head to enhance feature fusion and improve the positioning precision of the network model, enabling rapid capture of multi‐scale target features. Furthermore, the detection capability of the model has been increased by introducing the small target detection layer which is developed by increasing the number of detection layers from three to four. The experimental results on multiple object detection dataset show that the proposed model achieves significant improvement in mean average precision (mAP) of almost 98% for various types of targets and an average precision (AP) of nearly 99% for small targets on the other hand it achieves 5.19% (mAP) and 9.79% (AP) compared to the YOLOv5s model. Furthermore, comparative analysis with other models like YOLOv7 and YOLOv8 shows that the proposed model has superior performance in terms of object detection.
Radiology, Nuclear Medicine and Imaging, Artificial intelligence, Object detection, Economics, Feature (linguistics), Noise (video), Pattern recognition (psychology), Motion Detection, Real-time computing, Visual Object Tracking and Person Re-identification, Bounding overwatch, QA76.75-76.765, Health Sciences, Object Detection, Photography, Image (mathematics), Computer software, TR1-1050, Data mining, Position (finance), object detection, Linguistics, Applications of Deep Learning in Medical Imaging, Computer science, image processing, FOS: Philosophy, ethics and religion, Object Tracking, Philosophy, image recognition, Computer Science, Physical Sciences, Multiple Object Tracking, FOS: Languages and literature, identification, Medicine, Deep Learning in Computer Vision and Image Recognition, Foreground Segmentation, Object (grammar), Computer vision, Computer Vision and Pattern Recognition, Minimum bounding box, Finance
Radiology, Nuclear Medicine and Imaging, Artificial intelligence, Object detection, Economics, Feature (linguistics), Noise (video), Pattern recognition (psychology), Motion Detection, Real-time computing, Visual Object Tracking and Person Re-identification, Bounding overwatch, QA76.75-76.765, Health Sciences, Object Detection, Photography, Image (mathematics), Computer software, TR1-1050, Data mining, Position (finance), object detection, Linguistics, Applications of Deep Learning in Medical Imaging, Computer science, image processing, FOS: Philosophy, ethics and religion, Object Tracking, Philosophy, image recognition, Computer Science, Physical Sciences, Multiple Object Tracking, FOS: Languages and literature, identification, Medicine, Deep Learning in Computer Vision and Image Recognition, Foreground Segmentation, Object (grammar), Computer vision, Computer Vision and Pattern Recognition, Minimum bounding box, Finance
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 3 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 10% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
