
Neural networks have been widely employed in the field of object detection. Transformers enable effective object detection through global context awareness, modular design, scalability, and adaptability to diverse target scales. However, small object detection requires careful consideration due to its comprehensive computations, data requirements, and real-time performance challenges. To address these issues, we present SwinVision, an innovative framework for small object detection in low-light environments. This research shows a balanced approach between computational efficiency and detection accuracy for advancing object detection in low-light scenarios. Firstly, a Swin Transformer-based computing network is introduced and optimized for object detection in large-scale areas. The framework balances computational power and resource efficiency, surpassing conventional transformers. Secondly, we present the STLE module, which enhances the features of low-light images for beneficial object detection. The last building block is a specialized Swin-based detection block for accurate detection of small, detailed objects in resource-constrained scenarios. Experiments conducted on the VisDrone dataset significantly ameliorated existing methods such as YOLOv8x, with a 6.31% increase in mAP and 12.55% in AP50. SwinVision’s effectiveness in low-light environments, especially with small objects, establishes a foundation for robust detection systems adapting to various environmental challenges.
Low-light image enhancement, improved Swin Transformer, Electrical engineering. Electronics. Nuclear engineering, attention mechanism, multi-scale feature fusion, TK1-9971
Low-light image enhancement, improved Swin Transformer, Electrical engineering. Electronics. Nuclear engineering, attention mechanism, multi-scale feature fusion, TK1-9971
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 1 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
