Paragraph-based Transformer Pre-training for Multi-Sentence Inference

Liello, Luca Di; Garg, Siddhant; Soldaini, Luca; Moschitti, Alessandro

doi:10.18653/v1/2022.naacl-main.181

Cited by 2 publications

(2 citation statements)

References 25 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Transformers can discern importance through a self-attention mechanism and achieve superior performance by gaining a deeper understanding of the context. , Furthermore, these models have facilitated agent-based modeling, aiding in the automation of problem-solving in the field of materials. − These architectures require a pretraining process during which they learn general characteristics from extensive data sets. When fine-tuned for a specific task, they are known to perform better, even with limited data . They have been especially adapted for predicting properties of materials, ,− including polymer informatics, using Simplified Molecular Input Line Entry System (SMILES) representations.…”

Section: Introductionmentioning

confidence: 99%

Multimodal Transformer for Property Prediction in Polymers

Han,

Kang,

Park

et al. 2024

ACS Appl. Mater. Interfaces

View full text Add to dashboard Cite

In this work, we designed a multimodal transformer that combines both the Simplified Molecular Input Line Entry System (SMILES) and molecular graph representations to enhance the prediction of polymer properties. Three models with different embeddings (SMILES, SMILES + monomer, and SMILES + dimer) were employed to assess the performance of incorporating multimodal features into transformer architectures. Fine-tuning results across five properties (i.e., density, glass-transition temperature (T g ), melting temperature (T m ), volume resistivity, and conductivity) demonstrated that the multimodal transformer with both the SMILES and the dimer configuration as inputs outperformed the transformer using only SMILES across all five properties. Furthermore, our model facilitates in-depth analysis by examining attention scores, providing deeper insights into the relationship between the deep learning model and the polymer attributes. We believe that our work, shedding light on the potential of multimodal transformers in predicting polymer properties, paves a new direction for understanding and refining polymer properties.

show abstract

Section: Introductionmentioning

confidence: 99%

Multimodal Transformer for Property Prediction in Polymers

Han,

Kang,

Park

et al. 2024

ACS Appl. Mater. Interfaces

View full text Add to dashboard Cite

show abstract

“…Previous works (Zhong et al, 2019;Liello et al, 2022;Liu et al, 2020) and systems like FACTGPT 2 , typically formulates the fact verification as a classification task where the input consists of the evidence sentence(s) and the claim, and the output is a label indicating the veracity of the entire claim as SUPPORTED, REFUTED, or IRRELEVANT. As a concrete example, if the claim is "United States is in North America and has 51 states", then a sentence-level classification task would classify this claim as incorrect since there are 50 states in the United States.…”

Section: Introductionmentioning

confidence: 99%

FLEEK: Factual Error Detection and Correction with Evidence Retrieved from External Knowledge

Fatahi Bayat,

Qian,

Han

et al. 2023

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing: System Demonstrations

View full text Add to dashboard Cite

Detecting factual errors in textual information, whether generated by large language models (LLM) or curated by humans, is crucial for making informed decisions. LLMs' inability to attribute their claims to external knowledge and their tendency to hallucinate makes it difficult to rely on their responses. Humans, too, are prone to factual errors in their writing. Since manual detection and correction of factual errors is labor-intensive, developing an automatic approach can greatly reduce human effort. We present FLEEK, a prototype tool that automatically extracts factual claims from text, gathers evidence from external knowledge sources, evaluates the factuality of each claim, and suggests revisions for identified errors using the collected evidence. Initial empirical evaluation on fact error detection (77-85% F1) shows the potential of FLEEK. A video demo of FLEEK can be found at https://youtu.be/NapJFUlkPdQ.

show abstract

Paragraph-based Transformer Pre-training for Multi-Sentence Inference

Cited by 2 publications

References 25 publications

Multimodal Transformer for Property Prediction in Polymers

Multimodal Transformer for Property Prediction in Polymers

FLEEK: Factual Error Detection and Correction with Evidence Retrieved from External Knowledge

Contact Info

Product

Resources

About