Translation has been a part of applied linguistics which commonly focuses on verbal rendering. Language elements are explored to produce a number of concepts about equivalence, nonequivalence, and untranslatability. The text turns out not only in the form of verbal language but also non-verbal images such as audio, visual, gestural, and digital images. Therefore, the study of the theory and practice of translation must be capable of explaining phenomena that arise from the transfer of verbal text non-verbal one and the other way round. This study aims to provide an explanation of multimodality in the English-Indonesian translation of verbal, audio-lingual, audiovisual , and gesturo-verbal texts to contribute to the development of multimodality in intersemiotic translation. The object of this study is English-language text and its translation in Indonesian. Eventhough translation can be divided into three categories, the type of translation that people commonly understand is interlingual translation involving verbal aspects; the two other types have only recently received adequate attention. That is why, this study focuses more on audio-lingual, audiovisual , verbo-visual, and gesturo-verbal renderings, besides the common translation.