2021
DOI: 10.48550/arxiv.2110.06595
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Refcat: The Internet Archive Scholar Citation Graph

Martin Czygan,
Helge Holzmann,
Bryan Newbold

Abstract: As part of its scholarly data efforts, the Internet Archive (IA) releases a first version of a citation graph dataset, named refcat, derived from scholarly publications and additional data sources. It is composed of data gathered by the fatcat cataloging project 1 (the catalog that underpins IA Scholar), related web-scale crawls targeting primary and secondary scholarly outputs, as well as metadata from the Open Library 2 project and Wikipedia 3 . This first version of the graph consists of over 1.3B citations… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(3 citation statements)
references
References 17 publications
0
3
0
Order By: Relevance
“…The purpose of InnoGraph is to provide data-driven support to each stage of AI innovation which could be understood as a "journey of an AI innovation", from inception to implementation. The journey of innovation can be seen as a composition non strictly in the presented order of the following stages: (1) an innovation typically appears in the academic world; (2) projects are started around the innovation; (3) the innovation gets possibly patented; (4) companies are established around the innovation; (5) companies get investments, possibly in several rounds; (6) investments influence the job market; (7) market reacts to the quality and possible impact of the innovation; (8) public and expert perception gets formed; (9) media starts publishing about the innovation and companies; (10) educational institutions integrate innovation in their curricula; (11) policymakers regulate the innovation; and (12) to close the cycle, funding agencies create new funding opportunities to create space for follow-up innovations.…”
Section: Innograph and Ai Innovationmentioning
confidence: 99%
See 2 more Smart Citations
“…The purpose of InnoGraph is to provide data-driven support to each stage of AI innovation which could be understood as a "journey of an AI innovation", from inception to implementation. The journey of innovation can be seen as a composition non strictly in the presented order of the following stages: (1) an innovation typically appears in the academic world; (2) projects are started around the innovation; (3) the innovation gets possibly patented; (4) companies are established around the innovation; (5) companies get investments, possibly in several rounds; (6) investments influence the job market; (7) market reacts to the quality and possible impact of the innovation; (8) public and expert perception gets formed; (9) media starts publishing about the innovation and companies; (10) educational institutions integrate innovation in their curricula; (11) policymakers regulate the innovation; and (12) to close the cycle, funding agencies create new funding opportunities to create space for follow-up innovations.…”
Section: Innograph and Ai Innovationmentioning
confidence: 99%
“…In addition to structured data, we are looking into science popularization websites and blogs, such as the Stanford AI Index Report 3 and AI Vibrancy Tool 4 , LifeArchitect 5 , State of AI Report 2022 6 , 2022 AI Tech Trends Report 7 by Future Today Institute, Natural Language Processing Progress 8 , etc. Furthermore, we are collecting an extensive Zotero bibliography 9 .…”
Section: Data Sourcesmentioning
confidence: 99%
See 1 more Smart Citation