Logo image
Best practices for building and curating databases for comparative analyses
Journal article   Peer reviewed

Best practices for building and curating databases for comparative analyses

Lisa Schwanz, Alex Gunderson, Maider Iglesias-Carrasco, Michele A. Johnson, Jacinta D. Kong, Julia Riley and Nicholas Wu
Journal of Experimental Biology, Vol.225(Suppl_1), jeb243295
2022
PMID: 35258608

Abstract

Biology Life Sciences & Biomedicine Life Sciences & Biomedicine - Other Topics Science & Technology Zoology
Comparative analyses have a long history of macro-ecological and evolutionary approaches to understand structure, function, mechanism and constraint. As the pace of science accelerates, there is ever-increasing access to diverse types of data and open access databases that are enabling and inspiring new research. Whether conducting a species-level trait-based analysis or a formal meta-analysis of study effect sizes, comparative approaches share a common reliance on reliable, carefully curated databases. Unlike many scientific endeavors, building a database is a process that many researchers undertake infrequently and in which we are not formally trained. This Commentary provides an introduction to building databases for comparative analyses and highlights challenges and solutions that the authors of this Commentary have faced in their own experiences. We focus on four major tips: (1) carefully strategizing the literature search; (2) structuring databases for multiple use; (3) establishing version control within (and beyond) your study; and (4) the importance of making databases accessible. We highlight how one's approach to these tasks often depends on the goal of the study and the nature of the data. Finally, we assert that the curation of single-question databases has several disadvantages: it limits the possibility of using databases for multiple purposes and decreases efficiency due to independent researchers repeatedly sifting through large volumes of raw information. We argue that curating databases that are broader than one research question can provide a large return on investment, and that research fields could increase efficiency if community curation of databases was established.

Details

UN Sustainable Development Goals (SDGs)

This output has contributed to the advancement of the following goals:

#3 Good Health and Well-Being

Source: InCites

Metrics

InCites Highlights

These are selected metrics from InCites Benchmarking & Analytics tool, related to this output

Collaboration types
Domestic collaboration
International collaboration
Citation topics
1 Clinical & Life Sciences
1.155 Medical Ethics
1.155.611 Evidence Based Medicine
Web Of Science research areas
Biology
Zoology
ESI research areas
Biology & Biochemistry
Logo image