EXPANSION OF THE FEATURE SPACE IN THE TASK OF SMALL OBJECT DETECTION IN IMAGES

  • V.V. Kovalev Scientific design bureau of computer systems
  • N.E. Sergeev Southern Federal University
Keywords: Small objects detection, convolutional neural networks, moving objects, retraining of neural networks

Abstract

One of the current trends in creating early object detection systems is the development of algorithms
for searching and recognizing small objects in images. In the early detection task, it is necessary
to recognize objects at long distances from the place where they were recorded by the camera.
The image in the image of such objects is represented by a small compact group of pixels, which
undergoes spatial and brightness changes from frame to frame. To successfully solve this problem,
real-world target objects must have large physical dimensions. In addition to the physical dimensions
of the object, the image of the object in the image is influenced by a large number of factors: the resolution
of the camera matrix, the focal length of the lens, the photosensitivity of the matrix, etc. The
vector for solving this problem is directed towards convolutional neural networks. However, even
advanced convolutional neural network architectures face challenges in finding and recognizing
small objects in images. This problem is directly related to the effect of overtraining the neural network
model. Retraining of a neural network model can be assessed based on learning curve analysis.
To reduce the likelihood of overfitting, special methods are used, which are united by the term regularization.
However, in recognizing small-sized objects, existing regularization methods are not
enough. The work examines the developed algorithm for preprocessing a sequence of video frames,
which increases the original feature space with a new independent feature of movement in the frame.
The preprocessing algorithm is based on spatiotemporal filtering of a sequence of video frames, the
application of which extends to a wide range of convolutional neural network architectures. To study
the characteristics of accuracy and recognition of convolutional neural networks, datasets of
grayscale images and images with a sign of motion were generated based on the 3D graphics development
environment Unreal Engine 5. The work presents a criterion for the small size of objects in
images. The accuracy characteristics of the test model of the convolutional neural network were
trained and assessed, and the dynamics of the learning curves of the test model were analyzed. The
positive influence of the proposed algorithm for preprocessing a sequence of video frames on the
integral accuracy of detection of small-sized objects is shown.

References

1. Chen C., Liu M.-Y., Tuzel O., Xiao J. R-CNN for small object detection // Asian Conference on
Computer Vision, Springer, Taipei, Taiwan, November, 2016. – P. 214-230.
2. Bochkovskiy A., Wang C.-Y., Mark Liao H.-Y. YOLOv4: Optimal Speed and Accuracy of Object
Detection // arXiv. – 2020. – Available at: https://arxiv.org/abs/2004.10934.
3. Hu G., Yang Z., Hu L., Huang L., Han J. Small Object Detection with Multiscale Features //
International Journal of Digital Multimedia Broadcasting. – 2018.
4. Nguyen N.-D., Do T., Ngo T. D., Le D.-D. An evaluation of deep learning methods for small
object detection // Journal of Electrical and Computer Engineering. – 2020.
5. Feng J. et al. Small Object Detection in Optical Remote Sensing Video with Motion Guided
R-CNN // IEEE International Geoscience and Remote Sensing Symposium, Waikoloa, HI,
USA, 2020. – P. 272-275.
6. Cao G., Xie X., Yang W., Liao Q., Shi G., Wu J. Feature-fused SSD: fast detection for small objects
// Ninth International Conference on Graphic and Image Processing (ICGIP 2017). – 2018.
7. Zhang X., Izquierdo E., Chandramouli K. Dense and Small Object Detection in UAV Vision
Based on Cascade Network // 2019 IEEE/CVF International Conference on Computer Vision
Workshop (ICCVW), Seoul, Korea (South), 2019. – P. 118-126.
8. Kisantal M., Wojna Z., Murawski J., Naruniec J., Cho K. Augmentation for small object detection
// arXiv. – 2019. – Available at:1902.07296.
9. Redmon J., Farhadi A. YOLOv3: An incremental improvement // arXiv. – 2018. – Available
at: https: arxiv.org/abs/1804.02767v1.
10. Liu W., Anguelov D., Erhan D. et al. SSD: single shot multibox detector // European Conference
on Computer Vision, vol. 9905 of Lecture Notes in Computer Science. – Springer, Cham,
Switzerland, 2016. – P. 21-37.
11. Zhou X., Wang D. Objects as Points // arXiv. – 2019. – Available at: ArXiv, abs/1904.07850.
12. Akyon F.C., Onur Altinuc S., Temizel A. Slicing Aided Hyper Inference and Fine-Tuning for
Small Object Detection // IEEE International Conference on Image Processing (ICIP), Bordeaux,
France, 2022. – P. 966-970.
13. Zhi T., Chunhua S., Hao C., Tong H. FCOS: Fully convolutional one-stage object detection //
Proceedings of the IEEE International Conference on Computer Vision (ICCV). – 2019.
– P. 9627-9636.
14. Smeelen M., Schwering P., Toet A., Loog M. Semi-hidden target recognition in gated viewer
images fused with thermal IR images // Inf. Fusion. – 2014. – P. 131-147.
15. Niu Y., Xu S., Wu L., et al. Airborne infrared and visible image fusion for target perception
based on target region segmentation and discrete wavelet transform[J] // Mathematical Problems
in Engineering. – 2012.
16. Liu S., Liu Z. Multi-Channel CNN-based Object Detection for Enhanced Situation Awareness
// arXiv. – 2020. – Available at: https://arxiv.org/ abs/1712.00075.
17. Lin T.-Y., Dollár P., Girshick R., He K., Hariharan B., Belongie S. Feature pyramid networks
for object detection // Proceedings of the IEEE Conference on Computer Vision and Pattern
Recognition (CVPR). – 2017. – P. 2117-2125.
18. Girshick R., Donahue J., Darrell T., Malik J. Rich feature hierarchies for accurate object detection
and semantic segmentation // Proceedings of the IEEE Conference on Computer Vision
and Pattern Recognition (CVPR). – 2014. – P. 580-587.
19. Верхаген К., Дейн Р., Грун Ф. и др. Распознавание образов. Состояние и перспективы.
– М.: Радио и связь, 1985. – 104 с.
20. Ковалев В.В. Сергеев Н.Е. Методы решения проблемы переобучения нейронных сетей в
задаче обнаружения малоразмерных объектов на изображениях // Международный кон-
гресс по интеллектуальным системам и информационным технологиям «IS&IT'23».
– 2023. – № 1. – C. 38-45.
21. Padilla R., Netto S.L., Silva E.A.B. A Survey on Performance Metrics for Object-Detection
Algorithms // 2020 International Conference on Systems, Signals and Image Processing
(IWSSIP), Niteroi, Brazil, 2020. – P. 237-242.
Published
2024-04-16
Section
SECTION IV. TECHNICAL VISION