Harvard Data Science Review 2022
DOI: 10.1162/99608f92.8a3f2336
|View full text |Cite
|
Sign up to set email alerts
|

Data Inventories for the Modern Age? Using Data Science to Open Government Data

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
9
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
6
1
1

Relationship

0
8

Authors

Journals

citations
Cited by 9 publications
(9 citation statements)
references
References 21 publications
0
9
0
Order By: Relevance
“…Though the results of this evaluation indicate high levels of reliability, the system currently does not feature means for distinguishing the significance of one positively identified citation from another. Previous work has shown the efficacy of applying natural language processing (NLP) techniques to the textual context surrounding individual citations 1718 . Similar techniques could be applied to evaluate DCE results beyond a simple validity assessment.…”
Section: Discussionmentioning
confidence: 99%
“…Though the results of this evaluation indicate high levels of reliability, the system currently does not feature means for distinguishing the significance of one positively identified citation from another. Previous work has shown the efficacy of applying natural language processing (NLP) techniques to the textual context surrounding individual citations 1718 . Similar techniques could be applied to evaluate DCE results beyond a simple validity assessment.…”
Section: Discussionmentioning
confidence: 99%
“…The search of the full-text search corpus employs three open source machine learning models, originally developed as part of the Kaggle competition "Coleridge Initiative -Show US the Data" (Coleridge Initiative, 2021a; Lane et al, 2022). The models are all available in the Coleridge GitHub repository.…”
Section: Searching the Full-text Corpusmentioning
confidence: 99%
“…This integrated approach leads to long-term outcomes in the third stage (Figure 1, right column) that include accessible federal data, knowledgeable and empowered users, an open data community ethos, sustainable data practices, replicable research, and evidence-based policy actions, each of which are components of the core mission of democratizing data. While improving data access and usability of federal data is a primary goal of data democratization (Lane, 2024;Lane et al, 2022;Potok, 2023), another possible outcome, if strategies are implemented effectively, is a community of empowered users. Studies show that individuals who feel empowered using services remain loyal to that service provider (Vatanasombut et al, 2004) and user satisfaction has a positive impact on user retention, defined as the longevity of a user's loyalty to and interest in a product or service Gu et al, 2022).…”
Section: Theory Of Changementioning
confidence: 99%
“…The recent passage of the Foundations for Evidence-Based Policymaking Act of 2018 (hereafter Evidence Act) was a signal from the federal government of its commitment to enhance the efficiency of government programs, as well as public access to government data (Potok, 2023). Its passage also served as a call to action for federal agencies to improve government data ecosystems (Lane et al, 2022;Potok, 2023). New data access platforms like Data.gov, ResearchDataGov.org, and DemocratizingData.ai have emerged to support federal agencies as they respond to federal mandates requiring them to publish their information online (Title II, Foundations for Evidence-Based Policymaking Act of 2018, Public Law 115-435).…”
Section: Introductionmentioning
confidence: 99%