Biodiv Sci ›› 2026, Vol. 34 ›› Issue (1): 25278.  DOI: 10.17520/biods.2025278  cstr: 32101.14.biods.2025278

• Special Feature: Ecological Data Analysis Methods • Previous Articles     Next Articles

Alpha-diversity index selection: Simulation comparison under unequal sampling

Yi Zou*   

  1. Department of Health and Environmental Sciences, School of Science, Xi’an Jiaotong-Liverpool University, Suzhou, Jiangsu 215123, China
  • Received:2025-07-17 Revised:2025-09-08 Accepted:2025-09-24 Online:2026-01-20 Published:2026-01-21
  • Contact: Yi Zou

Abstract:

Aims: Unequal sampling is a common issue in fieldbased community ecology. Choosing αdiversity metrics that remain robust when sample sizes vary among plots is critical for reliable biodiversity assessment. This study evaluated the performance of nine diversity indices, including five “observed” indices calculated directly from the data: (1) species richness, (2) Shannon index, (3) Simpson index, (4) Hurlbert’s rarefied richness, and (5) Fisher’s α; and four “richnessestimator” indices: (1) Chao1 index, (2) abundance-based coverage estimator (ACE),  (3) the extrapolated value of iNEXT (interpolation/extrapolation), and (4) total expected species (TES). 

Methods: Using simulation, the performance of each index was evaluated under a gradient of minimumsample thresholds, and for each case the accuracy and precision of between-sites variance (linear regression R2) was recorded. The simulation built up 20 sites in which “true” species richness (S) was linearly correlated with an environmental gradient (x) with a theoretical coefficient of determination R2 = 0.80. Four unequalsampling scenarios were then generated by imposing different minimum sample sizes per site. For each scenario, linear models were fitted between every diversity index and x, recording the corresponding R2

Results: The results indicate that sample size (the number of individuals recorded at a sampling site, as well as the equivalent sampling completeness) is the primary factor determining index performance. As sample size increased, model R2 of all diversity metrics significantly improved. Under extremely low sampling (minimum < 20 individuals; sampling coverage < 20 %), rarefied richness had a higher R2 than other indices. When the minimum sample size reached 100 individuals, the estimator indices group outperformed the observed indices. This study further clarified the minimum sample size and the corresponding sampling completeness required for each index to recover the predetermined R2

Conclusion: Overall, rarefied richness is recommended for highly unequal, sample sizepoor scenarios. In practice, rarefaction threshold should be set at a relatively high level (e.g., > 40 individuals) to enhance the overall comparability among sampling sites, even if it results in the exclusion of extremely under sampled sites. Once sampling completeness is adequate, richness estimators are preferable, as they can generate extrapolated richness that are close to the true gradient.

Key words: community ecology, biodiversity metrics, species richness, incomplete detection, environmental factor, sampling completeness