ejedsml Open Access Journal

European Journal of Emerging Data Science and Machine Learning

eISSN: Applied
Publication Frequency : 2 Issues per year.

  • Peer Reviewed & International Journal
Table of Content
Issues (Year-wise)
Loading…

Open Access iconOpen Access

ARTICLE

OPTIMIZING YOLOV8N FOR ENHANCED PRECISION IN SMALL OBJECT DETECTION ON CUSTOM DATASETS

1 School of Artificial Intelligence, Tsinghua University, China
2 Department of Information Science, University of Tokyo, Japan

Citations: Loading…
ABSTRACT VIEWS: 43   |   FILE VIEWS: 90   |   PDF: 90   HTML: 0   OTHER: 0   |   TOTAL: 133
Views + Downloads (Last 90 days)
Cumulative % included

Abstract

Object detection, a fundamental task in computer vision, has witnessed significant advancements with the advent of deep learning. While state-of-the-art models like the YOLO series exhibit impressive performance across various applications, the accurate detection of small objects remains a persistent challenge. This article presents a comprehensive study on enhancing the YOLOv8n architecture, the smallest variant of the YOLOv8 family, specifically for improved small object recognition within custom datasets. We explore architectural modifications, advanced loss functions, and refined training strategies to bolster its capabilities. Experimental results on a simulated custom dataset, representative of scenarios with prevalent small targets, demonstrate that our refined YOLOv8n achieves superior performance metrics compared to its baseline counterpart, particularly in mean Average Precision (mAP) for small objects. These findings underscore the potential of targeted enhancements to off-the-shelf models for specialized object detection tasks.


Keywords

Object Detection, Deep Learning, YOLOv8n, Small Object Detection

References

[1] Girshick, R. (2015). Fast R-CNN (Version 2). arXiv. https://doi.org/10.48550/ARXIV.1504.08083

[2] Girshick, R., Donahue, J., Darrell, T., & Malik, J. (2013). Rich feature hierarchies for accurate object detection and semantic segmentation (Version 5). arXiv. https://doi.org/10.48550/ARXIV.1311.2524

[3] He, K., Gkioxari, G., Dollár, P., & Girshick, R. (2017). Mask R-CNN (Version 3). arXiv. https://doi.org/10.48550/ARXIV.1703.06870

[4] He, K., Zhang, X., Ren, S., & Sun, J. (2014). Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition. In D. Fleet, T. Pajdla, B. Schiele, & T. Tuytelaars (Eds.), Computer Vision – ECCV 2014 (Vol. 8691, pp. 346–361). Springer International Publishing. https://doi.org/10.1007/978-3-319-10578-9_23

[5] Howard, A. G., Zhu, M., Chen, B., Kalenichenko, D., Wang, W., Weyand, T., Andreetto, M., & Adam, H. (2017). MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications (Version 1). arXiv. https://doi.org/10.48550/ARXIV.1704.04861


How to Cite

OPTIMIZING YOLOV8N FOR ENHANCED PRECISION IN SMALL OBJECT DETECTION ON CUSTOM DATASETS. (2024). European Journal of Emerging Data Science and Machine Learning, 1(01), 63-80. https://parthenonfrontiers.com/index.php/ejedsml/article/view/128

Share Link