生物多样性 ›› 2026, Vol. 34 ›› Issue (1): 25278.  DOI: 10.17520/biods.2025278  cstr: 32101.14.biods.2025278

• 生态学数据分析方法专题 • 上一篇    下一篇

α多样性指数选择: 不等量采样下的模拟比较

邹怡*   

  1. 西交利物浦大学理学院健康与环境科学系, 江苏苏州 215123
  • 收稿日期:2025-07-17 修回日期:2025-09-08 接受日期:2025-09-24 出版日期:2026-01-20 发布日期:2026-01-21
  • 通讯作者: 邹怡

Alpha-diversity index selection: Simulation comparison under unequal sampling

Yi Zou*   

  1. Department of Health and Environmental Sciences, School of Science, Xi’an Jiaotong-Liverpool University, Suzhou, Jiangsu 215123, China
  • Received:2025-07-17 Revised:2025-09-08 Accepted:2025-09-24 Online:2026-01-20 Published:2026-01-21
  • Contact: Yi Zou

摘要: 采样不均衡是群落生态学实地调查的普遍问题。如何选择合适的α多样性度量指标, 使其在样点间样本量差异下有稳定的表现对于生物多样性研究十分重要。本文通过模拟群落的方法, 评估了9个α多样性度量指标的表现, 包含5个直接计算的“观测型指数”: (1)物种丰富度, (2) Shannon指数, (3) Simpson指数, (4) Hurlbert稀释物种数, (5) Fisher’s α指数, 以及4个估算丰富度的“估算型指数”: (1) Chao1指数, (2)基于丰度的覆盖估计值(abundance-based coverage estimator, ACE), (3) iNEXT (interpolation/extrapolation)外推值, (4)总预期物种数(total expected species, TES)。模拟评估了各个指数在不同的采样阈值下, 其样点间的方差被环境梯度解释力(线性模型R2)的准确性与精确性。模拟构建了20个样点的虚拟群落, 假设真实物种数S与环境梯度x呈线性关系且理论R2为0.8, 然后生成一系列梯度下, 不同最小采样阈值模拟的不等量采样场景, 并计算各指数与x的线性回归R2。结果显示, 采样强度(样点记录到的个体数及与之等价的采样完整度)是决定指数有效性的首要因素。随着样本量的提升, 所有α多样性度量指标的模型R2显著提升。在极低采样场景下(样点中最低样本量低于20个个体, 采样完整度 < 20%), 稀释物种数的平均R2明显优于其余指数; 最低样本量升至100个个体后, 估算型指数整体优于观测型指数。本研究进一步明确了各个指数恢复设定R2所需要的最小样本量及对应的采样完整度。综合来看, 在样本极少的不等量采样场景中, 优先推荐采用稀释物种数。在实际研究中, 应将稀释值设定在一个相对较高的水平(如 > 40个个体), 即使因此丢弃极端不足的样点, 也能在总体上提高样点间的可比性。当样本量充足时, 可采用物种丰富度估算指数, 以获得最接近真实梯度的丰富度外推值。

关键词: 群落生态学, 生物多样性指数, 物种丰富度, 不完全探测, 环境因子, 采样完整度

Abstract

Aims: Unequal sampling is a common issue in fieldbased community ecology. Choosing αdiversity metrics that remain robust when sample sizes vary among plots is critical for reliable biodiversity assessment. This study evaluated the performance of nine diversity indices, including five “observed” indices calculated directly from the data: (1) species richness, (2) Shannon index, (3) Simpson index, (4) Hurlbert’s rarefied richness, and (5) Fisher’s α; and four “richnessestimator” indices: (1) Chao1 index, (2) abundance-based coverage estimator (ACE),  (3) the extrapolated value of iNEXT (interpolation/extrapolation), and (4) total expected species (TES). 

Methods: Using simulation, the performance of each index was evaluated under a gradient of minimumsample thresholds, and for each case the accuracy and precision of between-sites variance (linear regression R2) was recorded. The simulation built up 20 sites in which “true” species richness (S) was linearly correlated with an environmental gradient (x) with a theoretical coefficient of determination R2 = 0.80. Four unequalsampling scenarios were then generated by imposing different minimum sample sizes per site. For each scenario, linear models were fitted between every diversity index and x, recording the corresponding R2

Results: The results indicate that sample size (the number of individuals recorded at a sampling site, as well as the equivalent sampling completeness) is the primary factor determining index performance. As sample size increased, model R2 of all diversity metrics significantly improved. Under extremely low sampling (minimum < 20 individuals; sampling coverage < 20 %), rarefied richness had a higher R2 than other indices. When the minimum sample size reached 100 individuals, the estimator indices group outperformed the observed indices. This study further clarified the minimum sample size and the corresponding sampling completeness required for each index to recover the predetermined R2

Conclusion: Overall, rarefied richness is recommended for highly unequal, sample sizepoor scenarios. In practice, rarefaction threshold should be set at a relatively high level (e.g., > 40 individuals) to enhance the overall comparability among sampling sites, even if it results in the exclusion of extremely under sampled sites. Once sampling completeness is adequate, richness estimators are preferable, as they can generate extrapolated richness that are close to the true gradient.

Key words: community ecology, biodiversity metrics, species richness, incomplete detection, environmental factor, sampling completeness