2023
DOI: 10.21203/rs.3.rs-2884789/v1
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Between Reality and Delusion: Challenges of Applying Large Language Models to Companion Robots for Open-Domain Dialogues with Older Adults

Abstract: This work aims to provide initial guidelines towards developing companion robots with large language models (LLMs) to be part of everyday lives of older adults. Using iterative participatory design (co-design) approaches, we analyze the challenges of applying LLMs for multi-modal open-domain dialogue, deriving from older adults' (one-to-one) interactions with a personalized companion robot, built on Furhat robot with GPT-3.5. An initial study with 6 Swedish-speaking older adults (65 and older) showed that the … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
22
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
7
1

Relationship

1
7

Authors

Journals

citations
Cited by 8 publications
(22 citation statements)
references
References 277 publications
0
22
0
Order By: Relevance
“…An example for such collaboration is a story-telling game CreativeBot [8] where a child and a robot Furhat create a story via turn-taking. On the other hand, the authors in [9] examine the difficulties associated with utilizing large language models for multi-modal open-domain dialogues, which derive from the interactions between older adults and a personalized companion robotthe Furhat robot, powered by GPT-3.5. Authors in [7] use the Pepper robot for chitchatting and enhance embodied dialogs by providing additional 'cloud-background' dialogue capabilities to complement the preexisting natural language understanding ability.…”
Section: Related Workmentioning
confidence: 99%
“…An example for such collaboration is a story-telling game CreativeBot [8] where a child and a robot Furhat create a story via turn-taking. On the other hand, the authors in [9] examine the difficulties associated with utilizing large language models for multi-modal open-domain dialogues, which derive from the interactions between older adults and a personalized companion robotthe Furhat robot, powered by GPT-3.5. Authors in [7] use the Pepper robot for chitchatting and enhance embodied dialogs by providing additional 'cloud-background' dialogue capabilities to complement the preexisting natural language understanding ability.…”
Section: Related Workmentioning
confidence: 99%
“…Our prior work (Irfan et al, 2023) (among others described in Section 2.2) provides a starting point for using a 10.3389/frobt.2024.1363713 foundation model (e.g., LLM) for a conversational companion robot for older adults. Deriving from the expectations of older adults outlined in the previous sections, and the challenges of LLMs encountered in our prior work based on the interactions with older adults, we offer actionable design recommendations for developing conversational companion robots that leverage foundation models, such as LLMs, vision-language models, and state-of-the-art architectures as their core, with potential relevance for other conversational robots and agents.…”
Section: Design Recommendationsmentioning
confidence: 99%
“…The generation of text or responses that seem plausible but factually incorrect is referred to as "hallucination" in foundation models, which is a commonly recognized challenge (Weidinger et al, 2021;Irfan et al, 2023). Attention mechanisms, regularization techniques, retrieval-based methods, evaluating…”
Section: Information Credibility and Recencymentioning
confidence: 99%
See 1 more Smart Citation
“…LLMs have opened up new possibilities in the field of robotics and human-robot teaming, most apparently for social robots and Socially Assistive Robots (SARs) (Alessa and Al-Khalifa, 2023;Irfan et al, 2023;Kahambing, 2023;Lee et al, 2023;Lekova et al, 2023). However, the conversational capabilities of LLMs extend beyond mere social interactions; their proficiency in handling a diverse range of textual inputs-without the need for rigidly predefined formats-marks a significant advancement for applications such as speech-controlled robotics, which have historically faced challenges with processing unstructured input.…”
Section: Llms and Robotsmentioning
confidence: 99%