We present a new large-scale multilingual video description dataset, VATEX 1 , which contains over 41, 250 videos and 825, 000 captions in both English and Chinese. Among the captions, there are over 206, 000 English-Chinese parallel translation pairs. Compared to the widely-used MSR-VTT dataset [66], VATEX is multilingual, larger, linguistically complex, and more diverse in terms of both video and natural language descriptions. We also introduce two tasks for video-and-language research based on VATEX:(1) Multilingual Video Captioning, aimed at describing a video in various languages with a compact unified captioning model, and (2) Video-guided Machine Translation, to translate a source language description into the target language using the video information as additional spatiotemporal context. Extensive experiments on the VATEX dataset show that, first, the unified multilingual model can not only produce both English and Chinese descriptions for a video more efficiently, but also offer improved performance over the monolingual models. Furthermore, we demonstrate that the spatiotemporal video context can be effectively utilized to align source and target languages and thus assist machine translation. In the end, we discuss the potentials of using VATEX for other video-and-language research. * Equal contribution. 1 VATEX stands for Video And TEXt, where X also represents various languages.
The potent new antiviral inhibitor TMC-114 (UIC-94017) of HIV-1 protease (PR) has been studied with three PR variants containing single mutations D30N, I50V and L90M that provide resistance to the major clinical inhibitors. The inhibition constants (Ki) of TMC-114 for mutants PR D30N , PR I50V , and PR L90M were 30-, 9-and 0.14-fold, respectively, relative to wild type PR. The molecular basis for the inhibition was analyzed using high resolution (1.22-1.45 Å) crystal structures of PR mutant complexes with TMC-114. In PR D30N the inhibitor has a water-mediated interaction with the side chain of Asn30 rather than the direct interaction observed in PR, which is consistent with the relative inhibition. Similarly, in PR I50V the inhibitor loses favorable hydrophobic interactions with the side chain of Val50. TMC-114 has additional van der Waals contacts in PR L90M structure compared to the PR structure leading to a tighter binding of the inhibitor. The observed changes in PR structure and activity are discussed in relation to the potential for development of resistant mutants on exposure to TMC-114.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.