“…I agree with the limitations you have pointed out, keeping in mind that this study is intended as a proof of concept for a novel feature of large language models (LLMs). Although 60 scenarios is not a large sample size, most similar studies in the field of gastroenterology and LLMs studied a similar number of scenarios [3,4,5]. The issue of interrogation of discrepancies as part of a systemic approach to the development and training of LLMs should indeed be further implemented and improved.…”