生物多样性 ›› 2014, Vol. 22 ›› Issue (3): 293-301. DOI: 10.3724/SP.J.1003.2014.13269
所属专题: 生物多样性信息学专题(II)
收稿日期:
2013-12-30
接受日期:
2014-05-12
出版日期:
2014-05-20
发布日期:
2014-06-04
通讯作者:
黄晓磊,乔格侠
基金资助:
Xiaolei Huang*(), Gexia Qiao*(
)
Received:
2013-12-30
Accepted:
2014-05-12
Online:
2014-05-20
Published:
2014-06-04
Contact:
Huang Xiaolei,Qiao Gexia
摘要:
生物多样性研究、保护实践、自然资源管理及科学决策等越来越依赖于大量数据的共享和整合。虽然关于数据共享的呼吁和实践越来越多, 但很多科学家仍然主动或被动地拒绝共享数据。关于数据共享, 现实中存在一些认知和技术上的障碍, 比如科学家不愿意共享数据, 担心同行竞争, 认为缺少足够的回报, 不熟悉相关数据保存机构, 缺少简便的数据提交工具, 没有足够时间和经费等。解决这些问题及改善共享文化的关键在于使共享者获得适当的回报(比如数据引用)。基于同行评审的数据发表被认为不但能够为生产、管理和共享数据的科学家提供一种激励机制, 并且能够有效地促进数据再利用。因而, 数据发表作为数据共享的方式之一, 近来引起了较多关注, 在生物多样性领域出现了专门发表数据论文的期刊。在采取数据论文的模式上, 数据保存机构和科技期刊采用联合数据政策在促进数据共享方面可能更具可行性。本文总结了数据共享和发表方面的进展, 讨论了数据论文能在何种程度上促进数据共享, 以及数据共享和数据发表的关系等问题, 提出如下建议: (1)个体科学家应努力践行数据共享; (2)使用DOI号解决数据所有权和数据引用的问题; (3)科技期刊和数据保存机构联合采用更加合理和严格的数据保存政策; (4)资助机构和研究单位应当在数据共享中起到更重要的作用。
黄晓磊, 乔格侠 (2014) 生物多样性数据共享和发表: 进展和建议. 生物多样性, 22, 293-301. DOI: 10.3724/SP.J.1003.2014.13269.
Xiaolei Huang, Gexia Qiao (2014) Sharing and publishing of biodiversity data: recent trends and future suggestions. Biodiversity Science, 22, 293-301. DOI: 10.3724/SP.J.1003.2014.13269.
图1 数据共享流程简图, 包括主要参与方和决定共享效果的主要因素。资助机构在数据共享的不同环节都可能起作用。
Fig. 1 A sketch of the data sharing flow, including main participants and determining factors in data sharing. Funding agency spans each stage of the data flow.
图3 数据共享和数据发表的关系。数据共享包含非基于发表的数据共享(电子邮件、可移动磁盘等)和基于发表的数据共享(数据库、数据论文、个人网站等), 即数据发表。
Fig. 3 The conceptual relationship between ‘data sharing’ and ‘data publishing’. Data sharing includes data sharing without publishing (e.g. E-mail, removable disk) and data sharing through publishing (public database, data journal, personal website), i.e., data publishing.
数据保存机构 Data repository | 数据类型和说明 Data type and notes | 元数据 Metadata | 可引用对象 Citable object | 开放程度 Access |
---|---|---|---|---|
GenBank http://www.ncbi.nlm.nih.gov/genbank | DNA序列; 标准格式 DNA sequences; standard format | 要求 Required | 序列号 Accession number | 开放 Open |
Barcode of Life Database (BOLD) http://www.boldsystems.org | DNA条形码及相关数据; 标准格式 DNA barcodes; standard format | 要求 Required | 序列号 Accession number | 开放 Open |
Global Biodiversity Information Facility (GBIF) http://www.gbif.org | 物种分布数据; 标准格式 Species occurrence data; standard format | 要求 Required | 无 No | 开放 Open |
TreeBASE http://www.treebase.org | 系统发育树及相关数据 Phylogenetic trees and related data | 建议 Suggested | 无 No | 开放 Open |
Dryad http://datadryad.org | 生态进化出版物相关数据; 无标准格式 Ecology & evolution data associated with publications; no standard format | 建议 Suggested | 数字对象标识 DOI | 开放 Open |
figshare http://figshare.com | 任何类型的数据; 无标准格式 Any type of data; no standard format | 可选 Optional | 数字对象标识 DOI | 根据情况 Variable |
中国国家标本资源共享平台 China National Specimen Information Infrastructure http://www.nsii.org.cn | 标本数据 Specimen data | 可选 Optional | 无 No | 根据情况 Variable |
表1 一些生物多样性数据保存机构及其特点(更多数据保存机构及介绍参见Databib (http://databib.org))
Table 1 Some popular data repositories archiving biodiversity data. More repositories refer to Databib (http://databib.org)
数据保存机构 Data repository | 数据类型和说明 Data type and notes | 元数据 Metadata | 可引用对象 Citable object | 开放程度 Access |
---|---|---|---|---|
GenBank http://www.ncbi.nlm.nih.gov/genbank | DNA序列; 标准格式 DNA sequences; standard format | 要求 Required | 序列号 Accession number | 开放 Open |
Barcode of Life Database (BOLD) http://www.boldsystems.org | DNA条形码及相关数据; 标准格式 DNA barcodes; standard format | 要求 Required | 序列号 Accession number | 开放 Open |
Global Biodiversity Information Facility (GBIF) http://www.gbif.org | 物种分布数据; 标准格式 Species occurrence data; standard format | 要求 Required | 无 No | 开放 Open |
TreeBASE http://www.treebase.org | 系统发育树及相关数据 Phylogenetic trees and related data | 建议 Suggested | 无 No | 开放 Open |
Dryad http://datadryad.org | 生态进化出版物相关数据; 无标准格式 Ecology & evolution data associated with publications; no standard format | 建议 Suggested | 数字对象标识 DOI | 开放 Open |
figshare http://figshare.com | 任何类型的数据; 无标准格式 Any type of data; no standard format | 可选 Optional | 数字对象标识 DOI | 根据情况 Variable |
中国国家标本资源共享平台 China National Specimen Information Infrastructure http://www.nsii.org.cn | 标本数据 Specimen data | 可选 Optional | 无 No | 根据情况 Variable |
[1] | Alsheikh-Ali AA, Qureshi W, Al-Mallah MH, Loannidis JPA (2011) Public availability of published research data in high-impact journals.PLoS ONE, 6, e24357. |
[2] | BBSRC (2010) BBSRC Data Sharing Policy.. (accessed 2013-12-20) |
[3] | Cassey P, Blackburn TM (2006) Reproducibility and repeatability in ecology.BioScience, 56, 958-959. |
[4] | Chavan V, Penev L (2011) The data paper: a mechanism to incentivize data publishing in biodiversity science.BMC Bioinformatics, 12(Suppl. 15), S2. doi:10.1186/1471-2105- 12-S15-S2 |
[5] | Chavan V, Penev L, Hobern D (2013) Cultural change in data publishing is essential.BioScience, 63, 419-420. |
[6] | Convention on Biological Diversity (2012) A Review of Barriers to the Sharing of Biodiversity Data and Information, with Recommendations for Eliminating Them.. (accessed 2013-12-20) |
[7] | Costello MJ, Michener WK, Gahegan M, Zhang ZQ, Bourne PE (2013) Biodiversity data should be published, cited, and peer reviewed.Trends in Ecology and Evolution, 28, 454-461. |
[8] | Costello MJ, Wieczorek J (2013) Best practice for biodiversity data management and publication.Biological Conservation, doi: 10.1016/j.biocon.2013.10.018. |
[9] | De Wever A, Schmidt-Kloiber A, Gessner MO, Tockner K (2012) Freshwater journals unite to boost primary biodiversity data publication.BioScience, 62, 529-530. |
[10] | Giles J (2006) The trouble with replication.Nature, 442, 344-347. |
[11] | Haddaway N, Pullin A (2013) Evidence-based conservation and evidence-informed policy: a response to Adams & Sandbrook.Oryx, 47, 336-338. |
[12] | Hampton SE, Strasser CA, Tewksbury JJ, Gram WK, Budden AE, Batcheller AL, Duke CS, Porter JH (2013) Big data and the future of ecology.Frontiers in Ecology and the Environment, 11, 156-162. |
[13] | Huang XL, Hawkins BA, Lei FM, Miller GL, Favret C, Zhang RL, Qiao GX (2012) Willing or unwilling to share primary biodiversity data: results and implications of an international survey.Conservation Letters, 5, 399-406. |
[14] | Huang XL, Hawkins BA, Qiao GX (2013) Biodiversity data sharing: will peer-reivewed data papers work?BioScience, 63, 5-6. |
[15] | Huang XL, Qiao GX (2011) Biodiversity databases should gain support from journals.Trends in Ecology and Evolution, 26, 377-378. |
[16] | Huang XL, Qiao GX (2012) Biodiversity data sharing is not just about species names: response to Santos and Branco.Trends in Ecology and Evolution, 27, 7-8. |
[17] | Ma KP (马克平), Lou ZP (娄治平), Su RH (苏荣辉) (2010) Review and outlook of biodiversity research in Chinese Academy of Sciences.Bulletin of Chinese Academy of Sciences(中国科学院院刊), 25, 634-644. (in Chinese with English abstract) |
[18] | Michener WK, Jones MB (2012) Ecoinformatics: supporting ecology as a data-intensive science.Trends in Ecology and Evolution, 27, 85-93. |
[19] | National Research Council (2003) Sharing Publication-Related Data and Materials: Responsibilities of Authorship in the Life Sciences. The National Academies Press, Washington, DC. |
[20] | National Science Foundation (2011) Proposal Award Policies and Procedures Guide. (accessed 2013-12-20) |
[21] | Penev L, Mietchen D, Chavan V, Hagedorn G, Remsen D, Smith V, Shotton D (2011) Pensoft data publishing policies and guidelines for biodiversity data. (accessed 2013-12-20) |
[22] | Pullin AS, Salafsky N (2010) Save the whales? Save the rainforest? Save the data!Conservation Biology, 24, 915-917. |
[23] | Reichman OJ, Jones MB, Schildhauer MP (2011) Challenges and opportunities of open data in ecology.Science, 331, 703-705. |
[24] | Ryan MJ (2011) Replication in field biology: the case of the frog-eating bat.Science, 334, 1229-1230. |
[25] | Shapiro JT, Báldi A (2012) Lost locations and the (ir)repeatability of ecological studies.Frontiers in Ecology and the Environment, 10, 235-236. |
[26] | Stoltzfus A, O'Meara B, Whitacre J, Mounce R, Gillespie EL, Kumar S, Rosauer DF, Vos RA (2012) Sharing and re-use of phylogenetic trees (and associated data) to facilitate synthesis.BMC Research Notes, 5, 574. |
[27] | Tenopir C, Allard S, Douglass K, Aydinoglu AU, Wu L, Read E, Manoff M, Frame M (2011) Data sharing by scientists: practices and perceptions.PLoS ONE, 6, e21101. |
[28] | Thessen AE, Patterson DJ (2011) Data issues in the life sciences.Zookeys, 150, 15-51. |
[29] | Thorpe SE (2013) Scutellista caerulea (Fonscolombe, 1832) (Hymenoptera: Pteromalidae), new to New Zealand for the second time!Biodiversity Data Journal, 1, e959. |
[30] | van den Eynden V, Corti L, Woollard M, Bishop L, Horton L (2011) Managing and Sharing Data: Best Practice for Researchers. . (accessed 2013-12-20) |
[31] | Vines TH, Albert AYK, Andrew RL, Débarre F, Bock DG, Franklin MT, Gilbert KJ, Moore JS, Renaut S, Rennison DJ (2014) The availability of research data declines rapidly with article age.Current Biology, 24, 94-97. |
[32] | Vision TJ (2010) Open data and the social contract of scientific publishing.BioScience, 60, 330-331. |
[33] | White EP, Baldridge E, Brym ZT, Locey KJ, McGlinn DJ, Supp SR (2013) Nine simple ways to make it easier to (re)use your data.Ideas in Ecology and Evolution, 6(2), 1-10. |
[34] | Whitlock MC (2011) Data archiving in ecology and evolution: best practices.Trends in Ecology and Evolution, 26, 61-65. |
[35] | Xu ZP (许哲平), Chen B (陈彬), Wang LS (王利松), Qiao HJ (乔慧捷), Liu FH (刘凤红), Qin HN (覃海宁), Ma KP (马克平) (2014) The development and trends of research on biodiversity informatics. In: The New Biology Yearbook 2013 (新生物学年鉴) (ed. Poo, MM (蒲慕明)), pp. 290-312. Science Press,Beijing. (in Chinese) |
[36] | Yesson C, Brewer PW, Sutton T, Caithness N, Pahwa JS, Burgess M, Gray WA, White RJ, Jones AC, Bisby FA, Culham A (2007) How global is the global biodiversity information facility?PLoS ONE, 2, e1124. |
[37] | Zhang J (张健), Chen SB (陈圣宾), Chen B (陈彬), Du YJ (杜彦君), Huang XL (黄晓磊), Pan XB (潘绪斌), Zhang Q (张强) (2013) Citizen science: integrating scientific research, ecological conservation and public participation.Biodiversity Science(生物多样性), 21, 738-749. (in Chinese with English abstract) |
[1] | 鲁彬悦, 李坤, 王晨溪, 李晟. 基于传感器标记的野生动物追踪技术在中国的应用现状与展望[J]. 生物多样性, 2024, 32(5): 23497-. |
[2] | 王江, 赵一凡, 屈彦福, 张财文, 张亮, 陈传武, 王彦平. 中国蛇类形态、生活史和生态学特征数据集[J]. 生物多样性, 2023, 31(7): 23126-. |
[3] | 雍李明, 张语克, 赵丽媛, 曾千慧, 林龙山, 高旻昊, 程昊, 王先艳. 中华白海豚生态学研究进展[J]. 生物多样性, 2023, 31(5): 22670-. |
[4] | 宋亮, 吴毅, 胡海霞, 刘文耀, 中村彰宏, 陈亚军, 马克平. 基于塔吊的林冠科学研究进展及展望[J]. 生物多样性, 2023, 31(12): 23363-. |
[5] | 商晓凡, 张健, 高浩杰, 库伟鹏, 毕玉科, 李修鹏, 阎恩荣. 岛屿面积与气候共同影响舟山群岛种子植物丰富度格局[J]. 生物多样性, 2023, 31(12): 23392-. |
[6] | 王彦平, 张敏楚, 詹成修. 嵌套分布格局研究进展: 分析方法、影响机制及保护应用[J]. 生物多样性, 2023, 31(12): 23314-. |
[7] | 肖文宏, 李学友, 权锐昌, 连新明, 李明, 聂永刚, 向左甫, 杨维康, 徐峰, 王杰, 周岐海, 范朋飞, 杨锡福, 刘伟, 孙悦华, 张礼标, 黄志旁, 黄华, 范宗骥, 肖治术. 中国兽类多样性监测与研究网络建设: 十年回顾与展望[J]. 生物多样性, 2023, 31(12): 23326-. |
[8] | 岑渝华, 王鹏, 陈庆春, 张承云, 余上, 胡珂, 刘阳, 肖荣波. 城市绿地动物声景的时空特征及其驱动因素[J]. 生物多样性, 2023, 31(1): 22359-. |
[9] | 吴杨, 田瑜, 戴逢斌, 李子圆. “自然对人类的贡献”的实现、发展趋势和启示[J]. 生物多样性, 2022, 30(5): 21549-. |
[10] | 阿卜杜赛麦提·买尔迪亚力, 王云, 陶双成, 孔亚平, 王昊, 吕植. 我国道路对野生动物影响研究的现状与挑战[J]. 生物多样性, 2022, 30(11): 22209-. |
[11] | 肖治术, 肖文宏, 王天明, 李晟, 连新明, 宋大昭, 邓雪琴, 周岐海. 中国野生动物红外相机监测与研究: 现状及未来[J]. 生物多样性, 2022, 30(10): 22451-. |
[12] | 张健, 孔宏智, 黄晓磊, 傅声雷, 郭良栋, 郭庆华, 雷富民, 吕植, 周玉荣, 马克平. 中国生物多样性研究的30个核心问题[J]. 生物多样性, 2022, 30(10): 22609-. |
[13] | 刘艳杰, 黄伟, 杨强, 郑玉龙, 黎绍鹏, 吴昊, 鞠瑞亭, 孙燕, 丁建清. 近十年植物入侵生态学重要研究进展[J]. 生物多样性, 2022, 30(10): 22438-. |
[14] | 张昭臣, 胡健波, 杨庆松, 练琚愉, 李步杭, 王希华, 叶万辉, 张健. 中国亚热带4个森林动态监测样地无人机可见光遥感影像数据集[J]. 生物多样性, 2021, 29(9): 1181-1185. |
[15] | 邹怡. 样本量不一致时的β多样性计算[J]. 生物多样性, 2021, 29(6): 790-797. |
阅读次数 | ||||||
全文 |
|
|||||
摘要 |
|
|||||
备案号:京ICP备16067583号-7
Copyright © 2022 版权所有 《生物多样性》编辑部
地址: 北京香山南辛村20号, 邮编:100093
电话: 010-62836137, 62836665 E-mail: biodiversity@ibcas.ac.cn