Data harmonization is an important method for generating the requisite datasets to support big data analyses. To date however, articles about data harmonization are field-specific and highly technical, making it difficult for researchers to derive general principles for how to engage in and contextualize data harmonization efforts. This article provides general guidance and criteria for researchers who are considering undertaking such efforts or seek to evaluate the quality of existing ones. We derive these guidelines from the extant literature and our own experience in harmonizing data for the emergent and important new field of COVID-19 public health and safety measures (PHSM). We further introduce the methodology we employed for engaging in this data harmonization as a blueprint for researchers interested in engaging in manual data harmonization.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.