This Patient Looks Like That Patient: Prototypical Networks for Interpretable Diagnosis Prediction from Clinical Text

Aken, Betty van; Papaioannou, Jens-Michalis; Naik, Marcel; Eleftheriadis, Geοrgiοs; Nejdl, Wolfgang; Gers, Felix A.; Löser, Alexander

doi:10.48550/arxiv.2210.08500

Cited by 2 publications

(4 citation statements)

References 17 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Their focus is on learning one representative prototype per class while their performance is dependent on the size of the support set in few-shot learning scenarios. Due to these limitations, there have been relatively few works that utilize prototypical networks to provide interpretability for LLM's in NLP (Garcia-Olano et al, 2022;Das et al, 2022;Van Aken et al, 2022). and Hase et al (2019) use prototypical parts networks with multiple learned prototypes per class but only apply their methods to image classification tasks.…”

Section: Related Workmentioning

confidence: 99%

“…Our work is most closely related to Das et al (2022) and Van Aken et al (2022) in terms of the approach taken. However, our work differs in that the architecture in (Das et al, 2022) only utilizes a single negative prototype for binary classification, while proto-lm enables multi-class classification by using multiple prototypes for each class.…”

Section: Related Workmentioning

confidence: 99%

“…Additionally, we have extended the work of (Das et al, 2022) by implementing token-level attention to identify not only influential samples but also influential sections of text within each sample. Moreover, different from the single-prototype-as-asummary approach in Van Aken et al (2022), by learning multiple prototypes per class, proto-lm creates a prototypical space ( §4.1), where, unlike the embedding space of LLM's, each dimension is meaningful, specifically the distance to a learned prototype, and can be used to explain a decision. Our work is also similar to Friedrich et al (2021) in terms of our loss function design.…”

Section: Related Workmentioning

confidence: 99%

“…In order to help human users understand prototypes in natural language, we identify the closest training data sample for each prototype and project the prototype onto that sample. The projection of prototypes onto the nearest sample is a well-studied and established technique (Das et al, 2022;Van Aken et al, 2022;Hase et al, 2019). We compare the quality of our projections against the projections obtained via the training procedure of Proto-tex (Das et al, 2022) and the loss function used in Protoypical Network by measuring the average normalized distance between each prototype and their projected sample's embedding.…”

Section: Projecting Prototypesmentioning

confidence: 99%

See 3 more Smart Citations

Proto-lm: A Prototypical Network-Based Framework for Built-in Interpretability in Large Language Models

Xie,

Vosoughi,

Hassanpour

2023

Findings of the Association for Computational Linguistics: EMNLP 2023

View full text Add to dashboard Cite

Large Language Models (LLMs) have significantly advanced the field of Natural Language Processing (NLP), but their lack of interpretability has been a major concern. Current methods for interpreting LLMs are post hoc, applied after inference time, and have limitations such as their focus on low-level features and lack of explainability at higherlevel text units. In this work, we introduce proto-lm, a prototypical network-based whitebox framework that allows LLMs to learn immediately interpretable embeddings during the fine-tuning stage while maintaining competitive performance. Our method's applicability and interpretability are demonstrated through experiments on a wide range of NLP tasks, and our results indicate a new possibility of creating interpretable models without sacrificing performance. This novel approach to interpretability in LLMs can pave the way for more interpretable models without the need to sacrifice performance. We release our code at https://github.com/yx131/proto-lm.

show abstract

Section: Related Workmentioning

confidence: 99%

Section: Related Workmentioning

confidence: 99%

Section: Related Workmentioning

confidence: 99%

Section: Projecting Prototypesmentioning

confidence: 99%

See 2 more Smart Citations

Proto-lm: A Prototypical Network-Based Framework for Built-in Interpretability in Large Language Models

Xie,

Vosoughi,

Hassanpour

2023

Findings of the Association for Computational Linguistics: EMNLP 2023

View full text Add to dashboard Cite

show abstract

Considerations on the basis of medical reasoning for the use in AI applications

Koumpis,

Graefe

2024

Front. Med.

View full text Add to dashboard Cite

This study discusses the integration of artificial intelligence (AI) and machine learning (ML) in medical reasoning and decision-making, with a focus on the challenges and opportunities associated with the massive consumption of data required for training AI systems, and contrasts this with the limited data typically available to medical practitioners. We advocate for a balanced approach that includes small data and emphasize the importance of maintaining the art of clinical reasoning amid technological advancements. Finally, we highlight the potential of multidisciplinary research in addressing the complexities of medical reasoning and suggest the necessity of careful abstraction and conceptual modeling in AI applications.

show abstract

This Patient Looks Like That Patient: Prototypical Networks for Interpretable Diagnosis Prediction from Clinical Text

Cited by 2 publications

References 17 publications

Proto-lm: A Prototypical Network-Based Framework for Built-in Interpretability in Large Language Models

Proto-lm: A Prototypical Network-Based Framework for Built-in Interpretability in Large Language Models

Considerations on the basis of medical reasoning for the use in AI applications

Contact Info

Product

Resources

About