“…There are many studies on language directed navigation [38], [72], [73], [74], [75], [76], [77], [78], [79], [80], [81], [82], [83]. In the language-directed navigation task, there are many benchmark tasks that have been proposed as visuallanguage-navigation (VLN) (R2R [8] , R4R [34], REVERIE [37], NDH [35], HANNNA [36], Robo-VLN [38] ).…”