Biodiv Sci ›› 2026, Vol. 34 ›› Issue (4): 25287.  DOI: 10.17520/biods.2025287  cstr: 32101.14.biods.2025287

• Technology and Methodology • Previous Articles     Next Articles

Wildlife pose estimation based on the SCD-HRNet model and its application in biodiversity monitoring: A case study of the Saihanwula Region, Inner Mongolia

Ziyi Kong1,2,3, Degang Wang1,2,3, Jiantao Wang4, Zhiyong Pei5, Jing Sun6, Changchun Zhang1,2,3,*(), Junguo Zhang1,2,3,*()   

  1. 1 College of Engineering, Beijing Forestry University, Beijing 100083, China
    2 National Key Laboratory of Efficient Production of Forest Resources, Beijing 100083, China
    3 Research Center for Intelligent Biodiversity Monitoring, Beijing Forestry University, Beijing 100083, China
    4 Administration of Ulanba National Nature Reserve, Chifeng, Inner Mongolia 025450, China
    5 College of Energy and Transportation Engineering, Inner Mongolia Agricultural University, Hohhot 010018, China
    6 Wulanhe Local Nature Reserve Administration, Hinggan League, Ulanhot, Inner Mongolia 137400, China
  • Received:2025-07-20 Accepted:2025-10-22 Online:2026-04-20 Published:2026-05-27
  • Contact: *E-mail: zhangchangchun@bjfu.edu.cn; zhangjunguo@bjfu.edu.cn
  • Supported by:
    National Natural Science Foundation of China(32371874);National Natural Science Foundation of China(32401569);Beijing Natural Science Foundation(6244053);the Science and Technology Program of Shaanxi Academy of Sciences(2025K-32)

Abstract:

Aims: The conservation of wild animals in the Saihanwula region of Inner Mongolia is of great significance for maintaining regional biodiversity. Behavioral analysis helps enhance the scientific basis and intelligent management of biodiversity conservation, with pose estimation serving as the prerequisite and core support for behavioral analysis.
Methods: Aiming to solve the problem of decreased pose estimation accuracy caused by illumination changes, high-speed animal movement and complex environmental occlusion factors in wildlife monitoring, this paper proposed a novel wildlife pose estimation method combining attention mechanism and dynamic confidence suppression (selective coordinate-enhanced decoupling-HRNet, SCD-HRNet). Firstly, combined with the squeeze-and-excitation (SE) attention mechanism, the channel-level context features were extracted by global average pooling to enhance the discrimination ability of the network for species morphological features and effectively solve the problem of feature distortion caused by illumination changes. Secondly, in order to deal with the positioning deviation caused by high-speed animal movement, the coordinate attention (CA) mechanism was introduced to decompose the two-dimensional coordinates into horizontal and vertical components, and the bidirectional attention mechanism was used to establish the cross-direction long-range dependence relationship to improve the joint positioning accuracy under motion blur. Finally, the dynamic confidence suppression (DCS) module was proposed to establish an adaptive threshold function based on model inference accuracy to achieve robust detection of occluded key points.
Results: This paper carried out comparative experiments to verify the performance of the model. The experimental results showed that the mean average precision of SCD-HRNet method reaches 82.61% and 69.79% on the collected and labeled wild animal dataset in Saihanwula area and on the AP-10K public animal dataset, respectively, outperforming the existing methods.
Conclusion: The proposed SCD-HRNet method significantly improves the pose estimation accuracy of wildlife images in complex ecological scenes and provides reliable technical support for wildlife behavior analysis in ecological monitoring.

Key words: wild animals, pose estimation, HRNet, squeeze-and-excitation attention mechanism, coordinate attention mechanism, dynamic confidence suppression, infrared camera monitoring, biodiversity conservation