A pandemia da COVID-19 é uma ameaça global. Se, por um lado, contabilizamos muitas perdas de vidas, por outro lado tem-se acelerado a geração de datasets e demandas analíticas urgentes. Dentre as estratégias de combate, destacam-se a vacinação e as investigações epidemiológicas centradas em dados. Este artigo apresenta o processo de construção de datasets curados e anotados com metadados de proveniência retrospectiva, tendo como base os dados de registro da Campanha de Vacinação contra COVID-19 no Brasil. O dataset contém milhares de registros tratados até Março de 2021. Os dados foram analisados, investigados, tratados e cruzados com outras fontes, de modo a corrigi-los e complementá-los, resultando em datasets curados e alinhados aos princípios FAIR.
As the world struggles to face the challenges of vaccination against COVID-19, more attention needs to be paid to the issues related to the lack of transparency and accessibility of curated vaccination datasets. Among the strategies to combat COVID-19, vaccination and data-centered epidemiological investigations are the best ones. This paper presents the process of building cured and annotated datasets with provenance metadata. The primary dataset is based on the registration data of the Vaccination Campaign against COVID-19 in Brazil. The dataset contains thousands of records processed up to March 2021. The data were analyzed, treated, cross-checked, and linked with other sources to correct and complement them, resulting in cured datasets and aligned to the FAIR Data principles.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.