
pmid: 37161118
<abstract> <p>Small object detection (SOD) is significant for many real-world applications, including criminal investigation, autonomous driving and remote sensing images. SOD has been one of the most challenging tasks in computer vision due to its low resolution and noise representation. With the development of deep learning, it has been introduced to boost the performance of SOD. In this paper, focusing on the difficulties of SOD, we analyze the deep learning-based SOD research papers from four perspectives, including boosting the resolution of input features, scale-aware training, incorporating contextual information and data augmentation. We also review the literature on crucial SOD tasks, including small face detection, small pedestrian detection and aerial image object detection. In addition, we conduct a thorough performance evaluation of generic SOD algorithms and methods for crucial SOD tasks on four well-known small object datasets. Our experimental results show that network configuring to boost the resolution of input features can enable significant performance gains on WIDER FACE and Tiny Person. Finally, several potential directions for future research in the area of SOD are provided.</p> </abstract>
Artificial intelligence, neural network, Object detection, Boosting (machine learning), Pattern recognition (psychology), Motion Detection, computer vision, Pedestrian detection, Visual Object Tracking and Person Re-identification, benchmark, Engineering, small object detection, Object Detection, Machine learning, QA1-939, deep learning, Deep learning, Pedestrian, Transport engineering, Computer science, Salient Object Detection, Object Tracking, Computer Science, Physical Sciences, Multiple Object Tracking, Computational Modeling of Visual Saliency Detection, Deep Learning in Computer Vision and Image Recognition, Computer vision, Computer Vision and Pattern Recognition, TP248.13-248.65, Mathematics, Biotechnology
Artificial intelligence, neural network, Object detection, Boosting (machine learning), Pattern recognition (psychology), Motion Detection, computer vision, Pedestrian detection, Visual Object Tracking and Person Re-identification, benchmark, Engineering, small object detection, Object Detection, Machine learning, QA1-939, deep learning, Deep learning, Pedestrian, Transport engineering, Computer science, Salient Object Detection, Object Tracking, Computer Science, Physical Sciences, Multiple Object Tracking, Computational Modeling of Visual Saliency Detection, Deep Learning in Computer Vision and Image Recognition, Computer vision, Computer Vision and Pattern Recognition, TP248.13-248.65, Mathematics, Biotechnology
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 47 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 1% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Top 10% | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Top 1% |
