“…Despite these potentials, it is yet fully understood how LLMs read the prompt and use pretrained knowledge [14,59], the development of prompts is usually conducted through iterative trial and error [61]. While the HCI community have actively explored the use of LLMs in various domains (e.g., [20,49,104]), research that leverages LLMs for powering chatbots, particularly task-oriented ones [8,67,94], is still sparse. Due to the inherent characteristics of LLMs, LLM-driven chatbots may be error-prone [44] or digress from their tasks [94].…”