[1] Viola P, Jones M. Rapid object detection using a boosted cascade of simple features [C]// Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. 2001: 990517. [2] Dalal N, Triggs B. Histograms of oriented gradients for human detection [C]// 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. 2005: 886-893. [3] Felzenszwalb P F, Girshick R B, McAllester D, et al. Object detection with discriminatively trained part-based models [J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2010, 32(9): 1627-1645. [4] Krizhevsky A, Sutskever I, Hinton G E. ImageNet classiflcation with deep convolutional neural networks [J]. Communications of the ACM, 2017, 60(6): 84-90. [5] Girshick R, Donahue J, Darrell T, et al. Rich feature hierarchies for accurate object detection and semantic segmentation [C]// Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2014: 580-587. [6] He K, Zhang X, Ren S, et al. Spatial pyramid pooling in deep convolutional networks for visual recognition [J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2015, 37(9): 1904-1916. [7] Girshick R. Fast R-CNN [C]// Proceedings of the IEEE International Conference on Computer Vision. 2015: 1440-1448. [8] Ren S, He K, Girshick R, et al. Faster R-CNN: towards real-time object detection with region proposal networks [J]. Advances in Neural Information Processing Systems, 2015, 39(6): 1137-1149. [9] Lin T Y, Dollar P, Girshick R Á, et al. Feature pyramid networks for object detection [C]// Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2017: 2117- 2125. [10] Cai Z, Vasconcelos N. Cascade R-CNN: delving into high quality object detection [C]// Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2018: 6154- 6162. [11] Redmon J, Divvala S, Girshick R, et al. You Only Look Once: unifled, real-time object detection [C]// Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2016: 779-788. [12] Wang C Y, Bochkovskiy A, Liao H Y M. YOLOv7: trainable bag-of-freebies sets new state-of-the-art for real-time object detectors [C]// Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2023: 7464-7475. [13] Vaswani A. Attention is all you need [C]// Proceedings of the 31st International Conference on Neural Information Processing Systems. 2017: 6000-6010 [14] Carion N, Massa F, Synnaeve G, et al. End-to-end object detection with transformers [C]// European Conference on Computer Vision. 2020: 213-229. [15] Zhao Y, Lu W, Xu S, et al. DETRs beat YOLOs on real-time object detection [C]// Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2024: 16965-16974. [16] Varghese R, Sambath M. YOLOv8: a novel object detection algorithm with enhanced performance and robustness [C]// 2024 International Conference on Advances in Data Engineering and Intelligent Computing Systems. 2024: 1-6. [17] Wang A, Chen H, Liu L, et al. YOLOv10: real-time end-to-end object detection [C]// 38th Conference on Neural Information Processing Systems. 2024: 14458. |