
Small targets in long-distance aerial photography have the problems of small size and blurry appearance, and traditional object detection algorithms face great challenges in the field of small-object detection. With the collection of massive data in the information age, traditional object detection algorithms have been gradually replaced by deep learning algorithms and have an advantage. In this paper, the YOLOV3-Tiny backbone network is augmented by using the pyramid structure of image features to achieve multi-level feature fusion prediction. In order to eliminate the loss of spatial feature information and hierarchical information caused by pooling operations in convolution processes and multi-scale operations in multi-layer structures, a spatial attention mechanism based on residual structure is proposed. At the same time, the idea of reinforcement learning is introduced to guide bounding box regression on the basis of the rough positioning of the native boundary regression strategy, and the variable IoU calculation method is used as the evaluation index of the reward function, and the boundary regression model based on the reward mechanism is proposed for fine adjustment. The VisDrone2019 data set was selected as the experimental data support. Experimental results show that the mAP value of the improved small-object detection model is 33.15%, which is 11.07% higher than that of the native network model, and the boundary regression accuracy is improved by 23.74%.
small target, spatial attention, residual structure, boundary regression, YOLOv3
small target, spatial attention, residual structure, boundary regression, YOLOv3
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 10 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 10% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Top 10% |
