In recent years, the integration of XR technology into everyday life has marked a new era. As these technologies advance and innovative design concepts are introduced, the user's interactive experience is continually enhanced. To enhance immersion in virtual worlds, based on the existing virtual-physical model (VPModel) platform, we have introduced multi-sensory modules to XR using the modular design approach, allowing users to choose different sensory modules according to their needs. This approach will provide users with a more comprehensive interactive experience through multi-channel cues. Contrasting with the prevalent use of somatosensory suits in wearable technology, we have selected three commonly used sensors and perceptual stimuli to develop integrated, wearable multi-sensory modules. Subsequently, we employed the modular design approach to iterate our sensory devices, resulting in eight modular multisensory components. These components comprise input devices for "user behavioral data acquisition based on multi-sensor", and output devices for "multi-sensory feedback based on somatosensory simulation devices". This study invited 10 users to conduct product trials on these two generations of devices. Through the analysis of the collected data and user feedback, we finally created the XR sensory device kit that integrates multi-sensory, wearable, modular, and intelligent -"XR CUBE".INDEX TERMS Extended reality, human-computer interaction, integrated design, modular construction, product design.