Compared with conventional targets, small objects often face challenges such as smaller size, lower resolution, weaker contrast, and more background interference, making their detection more difficult. To address this issue, this paper proposes an improved small object detection method based on the YOLO11 model-PC-YOLO11s. The core innovation of PC-YOLO11s lies in the optimization of the detection network structure, which includes the following aspects: Firstly, PC-YOLO11s has adjusted the hierarchical structure of the detection network and added a P2 layer specifically for small object detection. By extracting the feature information of small objects in the high-resolution stage of the image, the P2 layer helps the network better capture small objects. At the same time, in order to reduce unnecessary calculations and lower the complexity of the model, we removed the P5 layer. In addition, we have introduced the coordinate spatial attention mechanism, which can help the network more accurately obtain the spatial and positional features required for small targets, thereby further improving detection accuracy. In the VisDrone2019 datasets, experimental results show that PC-YOLO11s outperforms other existing YOLO-series models in overall performance. Compared with the baseline YOLO11s model, PC-YOLO11s mAP@0.5 increased from 39.5% to 43.8%, mAP@0.5:0.95 increased from 23.6% to 26.3%, and the parameter count decreased from 9.416M to 7.103M. Not only that, we also applied PC-YOLO11s to tea bud datasets, and experiments showed that its performance is superior to other YOLO-series models. Experiments have shown that PC-YOLO11s exhibits excellent performance in small object detection tasks, with strong accuracy improvement and good generalization ability, which can meet the needs of small object detection in practical applications.
Keywords: VisDrone2019; YOLO11; attention mechanism; small object detection; tea bud.