2021
DOI: 10.1007/s11837-021-04902-9
|View full text |Cite
|
Sign up to set email alerts
|

Challenges and Advances in Information Extraction from Scientific Literature: a Review

Abstract: Scientific articles have long been the primary means of disseminating scientific discoveries. Over the centuries, valuable data and potentially groundbreaking insights have been collected and buried deep in the mountain of publications. In materials engineering, such data are spread across technical handbooks specification sheets, journal articles, and laboratory notebooks in myriad formats. Extracting information from papers on a large scale has been a tedious and time-consuming job to which few researchers h… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
18
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
1
1
1

Relationship

0
8

Authors

Journals

citations
Cited by 28 publications
(18 citation statements)
references
References 72 publications
0
18
0
Order By: Relevance
“…While computers generally still struggle to grasp the meaning of a text fully, NLP algorithms can endow them with knowledge about the sentence’s grammar and sentiment. A more detailed description of NLP for materials science can be found in works by Olivetti and co-workers, ,, Cole and co-workers, and Hong and co-workers . A core aspect of NLP is tokenization, which consists of breaking a larger text down into smaller subunits (Figure ).…”
Section: Case Study: Crystallization Solvents For Small Organic Molec...mentioning
confidence: 99%
See 2 more Smart Citations
“…While computers generally still struggle to grasp the meaning of a text fully, NLP algorithms can endow them with knowledge about the sentence’s grammar and sentiment. A more detailed description of NLP for materials science can be found in works by Olivetti and co-workers, ,, Cole and co-workers, and Hong and co-workers . A core aspect of NLP is tokenization, which consists of breaking a larger text down into smaller subunits (Figure ).…”
Section: Case Study: Crystallization Solvents For Small Organic Molec...mentioning
confidence: 99%
“…While there is rapid progress in the development of state-of-the-art text-mining algorithms for materials science, there remain several challenges . For instance, the MGI advocates for increased standardization of the reporting of data and metadatathe lack of which currently inhibits many text-mining activities .…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…With the development of artificial intelligence, more and more researchers pay attention to the data and phenomena in the published literature. Artificial intelligence methods are introduced in this process to help promote the collection of data from the published literature and build a scientific database in biology [29], chemistry [41,50], and many other disciplines [8,23,56]. In addition, there are also some related works focusing on literature retrieval and dissertation comprehension to extract information from scientific literature [32,52,53].…”
Section: Data Extraction In Scientific Literaturementioning
confidence: 99%
“…But this information hasn't been organized and it isn't easy to do so. The challenges on information extraction from the literature has been well documented in the article by Hong et al [32]. The ever-increasing number of scientific articles mean that different solvents and synthesis techniques are being discussed and it is difficult to keep track of them manually.…”
Section: Complexity Of Information Distributionmentioning
confidence: 99%