生物多样性 ›› 2024, Vol. 32 ›› Issue (10): 24313.  DOI: 10.17520/biods.2024313  cstr: 32101.14.biods.2024313

• 综述 • 上一篇    下一篇

鸟声标注技术及其在被动声学监测中的应用

郭倩茸1, 段淑斐1,*()(), 谢捷2(), 董雪燕3, 肖治术4()   

  1. 1.太原理工大学电子信息工程学院, 太原 030600
    2.南京师范大学计算机与电子信息学院/人工智能学院, 南京 210023
    3.北京联合大学特殊教育学院, 北京 100075
    4.中国科学院动物研究所农业虫害鼠害综合治理研究国家重点实验室, 北京 100101
  • 收稿日期:2024-07-12 接受日期:2024-09-27 出版日期:2024-10-20 发布日期:2024-12-09
  • 通讯作者: *E-mail: duanshufei@tyut.edu.cn
  • 基金资助:
    国家自然科学基金(32371556);国家自然科学基金(12004275);山西省自然科学基金(202403021211098);山西省回国留学人员科研教研资助项目(2024-060)

Advances in bird sound annotation methods for passive acoustic monitoring

Qianrong Guo1, Shufei Duan1,*()(), Jie Xie2(), Xueyan Dong3, Zhishu Xiao4()   

  1. 1. College of Electronic Information Engineering, Taiyuan University of Technology, Taiyuan 030600, China
    2. College of Computer and Electronic Information/College of Artificial Intelligence, Nanjing Normal University, Nanjing 210023, China
    3. College of Special Education, Beijing Union University, Beijing 100075, China
    4. State Key Laboratory of Integrated Management of Pest Insects and Rodents in Agriculture, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China
  • Received:2024-07-12 Accepted:2024-09-27 Online:2024-10-20 Published:2024-12-09
  • Contact: *E-mail: duanshufei@tyut.edu.cn
  • Supported by:
    National Natural Science Foundation of China(32371556);National Natural Science Foundation of China(12004275);Natural Science Foundation of Shanxi Province(202403021211098);Shanxi Scholarship Council of China(2024-060)

摘要:

鸟声标注用于标记声音中的鸟类信息, 如种类、声音结构等, 是鸟类被动声学监测及相关声学数据分析、物种自动识别分类的重要基础。本文以鸟声标注为重点, 比较了人工标注、自动标注和半自动标注等常用方法的优势, 点明了各自在数据质量、标注一致性和标注效率等方面面临的挑战, 同时探讨了这些标注方法在被动声学监测中的应用进展, 提出了自动标注模型优化、跨地区数据集建立和半自动标注系统完善等未来发展方向。尽管目前自动标注方法取得了显著进展, 但鸟声标注仍面临冷启动问题, 亟需更大规模的跨地区数据集和高效的质量检测半自动标注系统, 以满足标注数量和质量的双重要求。本综述有助于帮助鸟声数据集创建者和标注者更好地理解现有标注技术及其潜在的发展趋势, 为大规模鸟类声学监测数据的高效物种自动识别提供技术支撑。

关键词: 鸟声数据集, 人工标注, 半自动标注, 自动标注, 鸟声识别, 被动声学监测

Abstract

Background & Aim Bird sound annotation is essential for marking bird-related information in audio data, such as species identification and sound structure. It serves as a crucial foundation for passive acoustic monitoring, birds acoustic data analysis, as well as automatic species identification and classification. The purpose of this review is to help bird sound dataset creators and annotators better understand the existing labeling technologies and their potential development trends. It also provides technical support for improving the efficiency of automatic species identification in large-scale avian acoustic monitoring data.

Summary This paper compares the advantages of various common methods such as manual annotation, automatic annotation, and semi-automatic annotation. It highlights the challenges each method faces in terms of data quality, annotation consistency and annotation efficiency. The review also discusses recent applications of these methods in passive acoustic monitoring annotation models, establishing cross-regional datasets, and enhancing semi-automatic annotation systems.

Perspectives Despite significant progress in automatic annotation methods, challenges such as cold start remain. The field urgently needs larger-scale cross-regional datasets and efficient semi-automatic annotation systems to ensure quality control to meet the increasing demands for both annotation volume and accuracy.

Key words: bird sound dataset, manual annotation, semi-automatic annotation, automatic annotation, bird sound recognition, passive acoustic monitoring