2022
DOI: 10.1038/s41597-022-01492-2
|View full text |Cite
|
Sign up to set email alerts
|

Materials information extraction via automatically generated corpus

Abstract: Information Extraction (IE) in Natural Language Processing (NLP) aims to extract structured information from unstructured text to assist a computer in understanding natural language. Machine learning-based IE methods bring more intelligence and possibilities but require an extensive and accurate labeled corpus. In the materials science domain, giving reliable labels is a laborious task that requires the efforts of many professionals. To reduce manual intervention and automatically generate materials corpus dur… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
6
0

Year Published

2023
2023
2025
2025

Publication Types

Select...
8

Relationship

0
8

Authors

Journals

citations
Cited by 13 publications
(6 citation statements)
references
References 27 publications
0
6
0
Order By: Relevance
“… Automated Literature Mining—A fundamental challenge in EBT is comprehensive evidence gathering from enormous toxicology literature. AI text mining uses natural language processing (NLP) to extract facts, relationships, and reported findings from papers to populate structured databases (Yan et al 2022 ). Toxicological ontology mapping further enables mining context-specific information (Foster et al 2024 ).…”
Section: Ai To Accelerate Evidence-based Toxicologymentioning
confidence: 99%
“… Automated Literature Mining—A fundamental challenge in EBT is comprehensive evidence gathering from enormous toxicology literature. AI text mining uses natural language processing (NLP) to extract facts, relationships, and reported findings from papers to populate structured databases (Yan et al 2022 ). Toxicological ontology mapping further enables mining context-specific information (Foster et al 2024 ).…”
Section: Ai To Accelerate Evidence-based Toxicologymentioning
confidence: 99%
“…Traditionally, data has been shared through plots and tables in manuscripts, with important context embedded in manuscript text. While recent efforts make it possible to extract information from these documents using natural language processing methods, [47][48][49][50] traditional publications are not an efficient way of transmitting data. Fortunately, a slow shi towards more open data sharing is underway.…”
Section: Community Scale Data Managementmentioning
confidence: 99%
“…Here, the objective in analyzing textual ER data is to move away from supervised/semisupervised ML model analysis tools [10][11][12][13] and to instead automate the extraction of quantitative knowledge from textual data in order to assist system engineers in assessing SSC health trends and identify SSC anomalous behaviors. Knowledge extraction [20][21][22][23][24] is a very broad concept whose definition may vary depending on the application context. When applied to NPP ER textual data (i.e., IRs or WOs), the knowledge extraction approach described herein is designed to extract its syntactic and semantic elements.…”
Section: Introductionmentioning
confidence: 99%