2023
DOI: 10.1016/j.patter.2023.100843
|View full text |Cite
|
Sign up to set email alerts
|

EXSCLAIM!: Harnessing materials science literature for self-labeled microscopy datasets

Eric Schwenker,
Weixin Jiang,
Trevor Spreadbury
et al.
Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
3

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(3 citation statements)
references
References 50 publications
0
3
0
Order By: Relevance
“…† based approach, namely that nearly a third of the extracted images did not receive "a substring of text from the caption distributor", and that this "is where transformer-based NLP models stand to make the greatest contribution to the overall pipeline". 132 However, this increased exibility comes at the cost of potential hallucinations, lack of explainability, poor reproducibility and a larger computational overhead (explored further in Section 5). A synthesis of the two approaches, for example using classical NLP to ground LLM labels, could prove fruitful.…”
Section: Case Study 2: Labelled Microstructure Dataset Collectionmentioning
confidence: 99%
See 2 more Smart Citations
“…† based approach, namely that nearly a third of the extracted images did not receive "a substring of text from the caption distributor", and that this "is where transformer-based NLP models stand to make the greatest contribution to the overall pipeline". 132 However, this increased exibility comes at the cost of potential hallucinations, lack of explainability, poor reproducibility and a larger computational overhead (explored further in Section 5). A synthesis of the two approaches, for example using classical NLP to ground LLM labels, could prove fruitful.…”
Section: Case Study 2: Labelled Microstructure Dataset Collectionmentioning
confidence: 99%
“…Image-based paper data extractors also exist, like ‘ImageDataExtractor’ 131 which is capable of detecting electron microscopy images from figures, identifying their scale and segmenting any nanoparticles present. ‘EXSCLAIM’ 132 uses rule-based NLP and image processing to extract images and assign hierarchical labels based on the figure caption. These extractors have found use in generating structured datasets for nanoparticles, 133 photocatalysts 134 and self-cleaning coatings.…”
Section: Llm Workflows In Materials Science: Two Case Studiesmentioning
confidence: 99%
See 1 more Smart Citation