“…However, others use interpretable intermediate representations [33,34,35]. In particular, BEV semantic occupancy grid representations are widely used in modern driving approaches [36,22,23,37,26]. This representation can be inferred from images [38,39,40,41,42,26,43,44].…”