Comprehensive review of recent developments in visual object detection based on deep learning
Published in Electrical & Electronic Engineering, Physics, and Computational Sciences

In the fastevolving world of artificial intelligence, visual object detection has emerged as a foundational technology in computer vision, enabling machines to not only recognize objects in digital imagery but to also locate them with remarkable precision. This capability is pivotal in a broad range of intelligent systems from autonomous vehicles and smart surveillance cameras to medical imaging tools and industrial robotics. At the heart of this transformation lies deep learning (DL), a subfield of AI that has profoundly redefined the possibilities of object detection.
This paper presents a comprehensive review of the most recent and impactful developments in visual object detection, with a particular focus on deep learning-based methods. While traditional approaches relied heavily on handcrafted features and rule-based algorithms, modern detection systems leverage data-driven learning, end-to-end training pipelines, and highly optimized neural architectures to achieve extraordinary performance. This review captures that transition in detail, highlighting how convolutional neural networks (CNNs) and, more recently, transformer-based models have become central to state-of-the-art detection systems.
What sets this work apart is its dual focus on both depth and accessibility. It explores a wide array of detection frameworks, classifying them into one-stage and two-stage models. One-stage detectors like YOLO (You Only Look Once) and SSD (Single Shot Detector) are celebrated for their real-time speed, making them suitable for applications requiring immediate responsiveness. On the other hand, two-stage detectors such as Faster R-CNN offer superior accuracy, making them ideal for use cases where precision is critical. The paper examines how architectural refinements, backbone networks, and detection heads contribute to the trade-offs between speed and performance.
One of the key contributions of this review is a detailed comparative analysis of various detection algorithms, including both traditional and modern methods. Models are evaluated using standardized metrics such as mean Average Precision (mAP) and Frames Per Second (FPS), allowing readers to understand the strengths and limitations of each approach. Special attention is given to emerging architectures that incorporate transformers, which have recently gained popularity for their superior contextual awareness and ability to model long-range dependencies in images.
Beyond technical performance, the paper emphasizes the role of large-scale annotated datasets such as COCO, PASCAL VOC, and ImageNet in shaping the evolution of object detection models. These datasets serve not only as training ground but also as benchmarks that drive comparative research and competition in the field. By mapping models against these datasets, the review provides a grounded perspective on real-world applicability.
Another highlight is the exploration of practical applications. The review delves into how object detection is being deployed across industries whether it’s enhancing road safety through pedestrian detection in driver assistance systems, improving security via automated surveillance, or enabling more accurate diagnostics in medical imaging. Such examples underscore the far-reaching impact of DL-based detection systems in shaping the future of intelligent automation.
Moreover, the review identifies ongoing research trends and challenges. Topics like few-shot learning, edge deployment, interpretability, and ethical concerns especially in surveillance and bias are thoughtfully discussed, highlighting both the potential and the responsibility that come with technological advancement.
This review is uniquely positioned to benefit both new researchers seeking an entry point into object detection and seasoned practitioners looking to benchmark or innovate. It offers a well-organized synthesis of past achievements, current capabilities, and future directions in this rapidly progressing field.
For full access to this insightful paper, visit the article at Springer via https://rdcu.be/eqMG6 or explore the official publication at https://doi.org/10.1007/s10462-025-11284-w.
Follow the Topic
-
Artificial Intelligence Review
Artificial Intelligence Review is a fully open-access journal publishing cutting-edge AI and cognitive science research. It features evaluation of applications and algorithms, offers a platform for researchers and developers, and presents surveys, tutorials and commentary on key developments.
Please sign in or register for FREE
If you are a registered user on Research Communities by Springer Nature, please sign in