Enhanced Story Comprehension for Large Language Models through Dynamic Document-Based Knowledge Graphs

Andrus, Berkeley R; Nasiri, Yeganeh; Cui, Shilong; Cullen, Benjamin; Fulda, Nancy

doi:10.1609/aaai.v36i10.21286

Cited by 11 publications

(9 citation statements)

References 21 publications

(33 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Benchmarking LLMs has been an important research theme in the field of natural language processing, focusing on evaluating their performance across a variety of tasks and datasets, when comprehensive benchmarks provided critical insights into the strengths and limitations of different LLM architectures [28], [29]. Comparative analysis of model performance on established benchmarks such as GLUE (General Language Understanding Evaluation) and SQuAD (Stanford Question Answering Dataset) demonstrated significant advancements in language understanding capabilities [30]- [34]. Performance metrics were used to quantify improvements in tasks such as text classification, sentiment analysis, and question answering, highlighting the progressive enhancements in model architectures and training methodologies [35], [36].…”

Section: B Benchmarking Large Language Modelsmentioning

confidence: 99%

Benchmarking Llama 3 for Chinese News Summation: Accuracy, Cultural Nuance, and Societal Value Alignment

Lu,

Hu,

Chen

2024

Preprint

View full text Add to dashboard Cite

Our benchmarking Llama 3 for Chinese news summarization is a novel approach that integrates cultural and ethical considerations into model evaluation, significantly enhancing the relevance and acceptability of the generated content. The study employs a comprehensive framework to assess accuracy, cultural understanding, and societal value compliance, providing a multifaceted evaluation of Llama 3’s capabilities. The results demonstrate that Llama 3 outperforms traditional and contemporary models, achieving high scores in ROUGE metrics and specialized cultural and ethical indices. Key findings highlight the importance of fine-tuning on culturally rich datasets and the use of advanced evaluation metrics to capture the complex interplay between language, culture, and ethics. Challenges encountered during the research underscore the need for continuous dataset updates and metric refinement, suggesting directions for future studies. The insights gained from this evaluation contribute to the broader field of natural language processing by showcasing the potential of advanced models to produce high-quality, culturally aware, and ethically compliant summaries.

show abstract

Section: B Benchmarking Large Language Modelsmentioning

confidence: 99%

Benchmarking Llama 3 for Chinese News Summation: Accuracy, Cultural Nuance, and Societal Value Alignment

Lu,

Hu,

Chen

2024

Preprint

View full text Add to dashboard Cite

show abstract

“…Many studies [1,2,8,10,13,28,29,32,34,38] have been proposed to incorporate external knowledge to better understand the text or generate the expected output. For example, on the language understanding task, [13] injects expanded knowledge into the language model by adding the entity and relation from the knowledge graph as additional words.…”

Section: Knowledge Informed Language Understanding and Generationmentioning

confidence: 99%

“…Different from the masking strategy of BERT [5], [29] proposes an entity-level masking strategy to incorporate the informative entities into the language model. [2] verbalize extracted facts aligning with input questions as natural language and incorporate them as prompts to the language model to improve story comprehension. For open-domain question answering, [8] incorporate the informative entities extracted from the input question and passage with the output of language model T5 to jointly optimize the knowledge representations based on their proposed relation-aware GNN.…”

Section: Knowledge Informed Language Understanding and Generationmentioning

confidence: 99%

Understand the Dynamic World: An End-to-End Knowledge Informed Framework for Open Domain Entity State Tracking

Huang

2023

Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval

View full text Add to dashboard Cite

Open domain entity state tracking aims to predict reasonable state changes of entities (i.e., [attribute] of [entity] was [before_state] and [after_state] afterwards) given the action descriptions. It's important to many reasoning tasks to support human everyday activities. However, it's challenging as the model needs to predict an arbitrary number of entity state changes caused by the action while most of the entities are implicitly relevant to the actions and their attributes as well as states are from open vocabularies. To tackle these challenges, we propose a novel end-to-end Knowledge Informed framework for open domain Entity State Tracking, namely Kiest, which explicitly retrieves the relevant entities and attributes from external knowledge graph (i.e., ConceptNet) and incorporates them to autoregressively generate all the entity state changes with a novel dynamic knowledge grained encoder-decoder framework. To enforce the logical coherence among the predicted entities, attributes, and states, we design a new constraint decoding strategy and employ a coherence reward to improve the decoding process. Experimental results show that our proposed Kiest framework significantly outperforms the strong baselines on the public benchmark dataset -OpenPI. 1 CCS CONCEPTS• Computing methodologies → Natural language processing.

show abstract

“…With the continual advancement in graph neural networks, graph-based neural network models have garnered significant attention in domains such as natural language processing and computer vision [20,21]. Particularly, graph models can use correlations between data to construct edges in a graph to extract structural information from the data [22,23].…”

Section: Introductionmentioning

confidence: 99%

Attention-Based Two-Dimensional Dynamic-Scale Graph Autoencoder for Batch Process Monitoring

Zhu,

Gao,

Zhang

2024

Processes

View full text Add to dashboard Cite

Traditional two-dimensional dynamic fault detection methods describe nonlinear dynamics by constructing a two-dimensional sliding window in the batch and time directions. However, determining the shape of a two-dimensional sliding window for different phases can be challenging. Samples in the two-dimensional sliding windows are assigned equal importance before being utilized for feature engineering and statistical control. This will inevitably lead to redundancy in the input, complicating fault detection. This paper proposes a novel method named attention-based two-dimensional dynamic-scale graph autoencoder (2D-ADSGAE). Firstly, a new approach is introduced to construct a graph based on a predefined sliding window, taking into account the differences in importance and redundancy. Secondly, to address the training difficulties and adapt to the inherent heterogeneity typically present in the dynamics of a batch across both its time and batch directions, we devise a method to determine the shape of the sliding window using the Pearson correlation coefficient and a high-density gridding policy. The method is advantageous in determining the shape of the sliding windows at different phases, extracting nonlinear dynamics from batch process data, and reducing redundant information in the sliding windows. Two case studies demonstrate the superiority of 2D-ADSGAE.

show abstract

Enhanced Story Comprehension for Large Language Models through Dynamic Document-Based Knowledge Graphs

Cited by 11 publications

References 21 publications

Benchmarking Llama 3 for Chinese News Summation: Accuracy, Cultural Nuance, and Societal Value Alignment

Benchmarking Llama 3 for Chinese News Summation: Accuracy, Cultural Nuance, and Societal Value Alignment

Understand the Dynamic World: An End-to-End Knowledge Informed Framework for Open Domain Entity State Tracking

Attention-Based Two-Dimensional Dynamic-Scale Graph Autoencoder for Batch Process Monitoring

Contact Info

Product

Resources

About