Biodiv Sci

Previous Articles     Next Articles

Alpha-diversity index selection: Simulation comparison under unequal sampling

Yi Zou*   

  1. Xi’an Jiaotong-Liverpool University, Suzhou, Jiangsu 215123, China
  • Received:2025-07-17 Revised:2025-09-08 Accepted:2025-09-24
  • Contact: Yi Zou

Abstract: Unequal sampling is a common issue in field‑based community ecology. Choosing α‑diversity metrics that remain robust when sample sizes vary among plots is critical for reliable biodiversity assessment. This study evaluated the performance of nine diversity indices, including five “observed” indices calculated directly from the data (species richness, Shannon index, Simpson index, Hurlbert’s rarefied richness and Fisher’s α) and four “richness‑estimator” indices (Chao1, ACE, iNEXT, and TES). Using simulation, the performance of each index was evaluated under a gradient of minimum‑sample thresholds, and for each case the accuracy and precision between-sites variance (linear regression R2) was recorded. The simulation build up 20 sites in which “true” species richness (S) was linearly correlated to an environmental gradient (x) with a theoretical coefficient of determination R2 = 0.80. Four unequal‑sampling scenarios were then generated by imposing different minimum sample sizes per site. For each scenario, linear models were fitted between every diversity index and x, recording the corresponding R2. The results indicate that sample size (the number of individuals recorded at a sampling site, as well as the equivalent sampling completeness) is the primary factor determining index performance. As sample size increased, model R2 of all diversity metrics significantly improved. Under extremely low sampling (minimum < 20 individuals; sampling coverage < 20 %), rarefied richness had a higher R2 than other indices. When the minimum sample size reached 100 individuals, the estimator indices group outperformed the observed indices. This study further clarified the minimum sample size and the corresponding sampling completeness required for each index to recover the predetermined R2. Overall, rarefied richness is recommended for highly unequal, sample size‑poor scenarios. In practice, rarefaction threshold should be set at a relatively high level (e.g., > 40 individuals) that can enhance the overall comparability among sampling sites, even if it results in the exclusion of extremely under sampled sites. Once sampling completeness is adequate, richness estimators are preferable, as they can generate extrapolated richness that are close to the true gradient.

Key words: community ecology, biodiversity metrics, species richness, incomplete detection, environmental gradient