Proceedings of the Second Conference on Machine Translation 2017
DOI: 10.18653/v1/w17-4748
|View full text |Cite
|
Sign up to set email alerts
|

The AFRL-OSU WMT17 Multimodal Translation System: An Image Processing Approach

Abstract: This paper introduces the AFRL-OSU Multimodal Machine Translation Task 1 system for submission to the Conference on Machine Translation 2017 (WMT17). This is an atypical MT system in that the image is the catalyst for the MT results, and not the textual content.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
6
0

Year Published

2017
2017
2020
2020

Publication Types

Select...
4
1

Relationship

1
4

Authors

Journals

citations
Cited by 5 publications
(6 citation statements)
references
References 10 publications
0
6
0
Order By: Relevance
“…An n-best list from their SMT is reranked using a bi-directional NMT trained on the aforementioned source/target word sequences. Finally, Duselis et al (2017) and Gwinnup et al (2018) propose a pure retrieval system without any reranking involved. For a given image, they first obtain a set of candidate captions from a pretrained image captioning system.…”
Section: Reranking and Retrieval Based Approachesmentioning
confidence: 99%
“…An n-best list from their SMT is reranked using a bi-directional NMT trained on the aforementioned source/target word sequences. Finally, Duselis et al (2017) and Gwinnup et al (2018) propose a pure retrieval system without any reranking involved. For a given image, they first obtain a set of candidate captions from a pretrained image captioning system.…”
Section: Reranking and Retrieval Based Approachesmentioning
confidence: 99%
“…An n-best list from their SMT is reranked using a bidirectional NMT trained on the aforementioned source/target word sequences. Finally, Duselis et al (2017) and Gwinnup et al (2018) propose a pure retrieval system without any reranking involved. For a given image, they first obtain a set of candidate captions from a pretrained image captioning system.…”
Section: Reranking and Retrieval Based Approachesmentioning
confidence: 99%
“…Here is an overview of the VMT system submitted to the WMT17 submission (Duselis et al, 2017). This system architecture assumes a captionator can be trained in a target language to give meaningful output in the form of a set of the most probable n target language candidate captions.…”
Section: The Afrl-ohio State Wmt17 Submissionmentioning
confidence: 99%
“…Most of the submissions to the Second Conference on Machine Translation (WMT17) Multimodal submissions for Task 1 (Elliott et al, 2017) used the visual domain to enhance machine translation of the image+caption pair. The exception was a Visual Machine Translation (VMT) system where the image is the driver for the translation (Duselis et al, 2017). While the scores for this submission did not approach baseline, except by human scoring, it did introduce the concept that the visual domain can approach parity with the traditional text based MT systems.…”
Section: Introductionmentioning
confidence: 96%