A review of the heterogeneous landscape of biodiversity databases: opportunities and challenges for a synthesized biodiversity knowledge base

semanticscholar(2021)

Cited 16|Views23
No score
Abstract
AbstractAim: Addressing global environmental challenges requires access to biodiversity data across wide spatial, temporal and biological scales. Recent decades have witnessed an exponential increase of biodiversity information aggregated by biodiversity databases (hereafter ‘databases’). However, heterogeneous coverage, protocols, and standards of databases hampered the data integration among databases. To stimulate the next stage of data integration, here we present a synthesis of major databases, and investigate i) how the coverages of databases vary across taxonomy, space, and record type; ii) the degree of integration among databases; iii) how integration of databases can increase biodiversity knowledge; iv) the barriers to databases integration.Location: GlobalTime period: ContemporaryMajor taxa studied: Plants and VertebratesMethods: We reviewed the scope of twelve well-established databases and assessed the status of their integration. We synthesized information from these databases to assess major knowledge gaps and barriers to fully integration. We estimated how improved integration can increase the coverage and depth of biodiversity knowledge. Results: Each reviewed database had unique focus of data coverages. Data flows were common among databases, though not always clearly documented. Functional trait databases were more isolated than those pertaining to species distributions. Poor compatibility between taxonomic systems used by different databases posed a major challenge to integration. We demonstrated that integration of distribution databases can lead to greater taxonomic coverage that corresponds to 23 years’ advancement in knowledge accumulation, and improvement in taxonomic coverage could be as high as 22.4% for trait databases. Main conclusions: Rapid increase of biodiversity knowledge can be achieved through the integration of databases, providing the data necessary to address critical environmental challenges. Our synthesis provides an overview of the integration status of databases. Full integration across databases will require tackling the major impediments to data integration – taxonomic incompatibility, lags in data exchange, barriers to effective data synchronization, and isolation of individual initiatives.
More
Translated text
AI Read Science
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined