2023
DOI: 10.1039/d3dd00112a
|View full text |Cite
|
Sign up to set email alerts
|

Domain-specific chatbots for science using embeddings

Kevin G. Yager

Abstract: We demonstrate how large language models (LLMs) can be adapted to domain-specific science topics by connecting them to a corpus of trusted documents.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2023
2023
2025
2025

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 13 publications
(2 citation statements)
references
References 76 publications
0
2
0
Order By: Relevance
“…Search functionalities also leverage vector embeddings extensively. An example is Google's reverse image search, where images are transformed into vector representations that allow for efficient and accurate retrieval based on visual similarities [22]. This method applies not only to images but also to textual content, where search engines employ vector embeddings to improve the relevance and precision of search results.…”
Section: Source: Compiled By the Authorsmentioning
confidence: 99%
“…Search functionalities also leverage vector embeddings extensively. An example is Google's reverse image search, where images are transformed into vector representations that allow for efficient and accurate retrieval based on visual similarities [22]. This method applies not only to images but also to textual content, where search engines employ vector embeddings to improve the relevance and precision of search results.…”
Section: Source: Compiled By the Authorsmentioning
confidence: 99%
“…Purely generative methods in chatbot development can have drawbacks, including hallucinations, a lack of explainability, biases, and difficulties in verifying model-generated information [13]. In academic settings, these shortcomings are considered unacceptable, and approaches such as specific prompt engineering, fine-tuning, and document embedding have been proposed to mitigate hallucinations and ensure that the model adheres to the given context [14,15]. Recent advancements, including LoRA [16] and prompt-tuning [17], as well as user-friendly frameworks such as the LLM-Adapters developed by [12] or HuggingFace's PEFT library [18], have made fine-tuning more efficient and accessible to researchers with limited computational resources.…”
Section: Review Of Chatbotsmentioning
confidence: 99%