Welcome to Chinese Journal of Applied Ecology! Today is Share:

Chinese Journal of Applied Ecology ›› 2026, Vol. 37 ›› Issue (1): 263-272.doi: 10.13287/j.1001-9332.202601.036

• Original Articles • Previous Articles     Next Articles

Intelligent identification method of river and lake birds based on improved YOLOv8n model

WANG Junwen1, ZHANG Zhengyin2, LIU Chang2, LIU Zifeng3, QI Haoran3, ZHAO Zhiyao4*   

  1. 1School of Artificial Intelligence, China University of Mining &Technology-Beijing, Beijing 100083, China;
    2School of Science, China University of Mining & Technology-Beijing, Beijing 100083, China;
    3College of Information Science and Technology, Beijing University of Chemical Technology, Beijing 100029, China;
    4School of Computer and Artificial Intelligence, Beijing Technology and Business University, Beijing 100048, China
  • Received:2025-07-07 Revised:2025-12-03 Published:2026-07-18

Abstract: To address the challenge of balancing lightweight design and high accuracy in bird target recognition algorithms, we developed an improved model named YOLOv8-MAT-2H based on YOLOv8n, by fully considering the small size, sparse distribution, and complex backgrounds of birds in river and lake environments. First, we introduced the multi-scale feature module (MSBlock) to enhance the representation capability of bird features. Second, we adopted the adaptive down-sampling module (ADown) to strengthen extraction of edge and fine-grained features. Then, we employed a reduced detection head (Reduced Head) to trim the P5 branch, which significantly reduced parameter count and computational cost. Finally, we incorporated the adaptive-threshold focal loss (ATFL) into the loss function, so the model focused more on hard-to-detect targets. We conducted actual testing on a dataset containing 19003 images across 17 categories. The results showed that our model increased the mean value of average precision from 0.704 to 0.722, reduced parameters from 3.01 M to 2.41 M, and decreased giga floating-point operations per second from 8.1 to 7.3. Our approach could maintain real-time detection performance at 714.3 frames per second, and significantly enhanced responsiveness to small targets and complex backgrounds, providing an efficient and practical solution for deploying intelligent bird monitoring systems on edge devices in river and lake environments.

Key words: object detection, river and lake bird recognition, lightweight YOLOv8, data augmentation