A Mixed Learning Objective for Neural Machine Translation

Lu, Wenjie; Zhou, Leiying; Liu, Gongshen; Zhang, Quanhai

doi:10.1007/978-3-030-63031-7_15

Cited by 3 publications

(7 citation statements)

References 17 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Despite the difference in their inputs and neural models, those approaches are similar in the sense that they are all based on a typical NMT model formulated as an encoder-decoderattention architecture optimized with cross-entropy [11], [15], [16]. As a matter of fact, all the prior works on program repair are based on the NMT architecture with a cross-entropy loss [4]- [9].…”

Section: A Neural Program Repairmentioning

confidence: 99%

“…The cross-entropy loss (a.k.a log loss) is a measure from information theory, building upon entropy and calculating the difference between two probability distributions. In sequence generation, the cross-entropy loss calculates the difference between the generated tokens and the human-written patch tokens in a strict pairwise matching manner [11], [16], [17]. In program repair patches, a low cross-entropy value means that the generated patch is syntactically close to the ground truth patch at the token level.…”

Section: A Neural Program Repairmentioning

confidence: 99%

“…As discussed in subsection II-B and shown in Listing 1, because of this, there exists a discrepancy between training objectives and evaluation loss metric. In the field of neural machine translation (NMT), this problem is known as the overcorrection phenomenon of cross-entropy loss [11], [16]: cross-entropy based model tends to learn strictly identical translations and to overcorrect synonymous words which would be acceptable. In both cases, program repair and natural language translation, the problem is that the NMT-based model does not learn semantic relations.…”

Section: B Syntactic Training Of Rewardrepairmentioning

confidence: 99%

“…The goal of semantic training is to let the patch generator be aware of program-specific knowledge (compilation and execution) beyond the syntactic loss computation at the tokenlevel. For that, we propose a mixed learning objective [16], where "mixed" means that the core learning objective is combined with two or more sub-learning objectives. In this paper, our mixed learning objective combines the core crossentropy objective with compilation and execution information.…”

Section: Semantic Training Of Rewardrepairmentioning

confidence: 99%

“…In neural machine translation, Zhang et al [11] show the limitation of considering crossentropy loss and its tendency to overcorrect synonymous words and phrases. To relieve the problem, further research [11], [16] proposed to combine cross-entropy loss and add translation evaluation at the sentence level. In object detection, Ryou et al [25] proposed AnchorLoss to dynamically rescale the cross-entropy based on prediction difficulty.…”

Section: Improved Backpropagationmentioning

confidence: 99%

See 4 more Smart Citations

Neural Program Repair with Execution-based Backpropagation

Ye,

Martinez,

Monperrus

2021

Preprint

View full text Add to dashboard Cite

Neural machine translation (NMT) architectures have achieved promising results for automatic program repair. Yet, they have the limitation of generating low-quality patches (e.g., not compilable patches). This is because the existing works only optimize a purely syntactic loss function based on characters and tokens without incorporating program-specific information during neural net weight optimization. In this paper, we propose a novel program repair model called RewardRepair. The core novelty of RewardRepair is to improve NMT-based program repair with a loss function based on program compilation and test execution information, rewarding the network to produce patches that compile and that do not overfit. We conduct several experiments to evaluate RewardRepair showing that it is feasible and effective to use compilation and test execution results to optimize the underlying neural repair model. In total, RewardRepair correctly repairs 43 Defects4J bugs including eight that are fixed for the first time.

show abstract

Section: A Neural Program Repairmentioning

confidence: 99%

Section: A Neural Program Repairmentioning

confidence: 99%

Section: B Syntactic Training Of Rewardrepairmentioning

confidence: 99%

Section: Semantic Training Of Rewardrepairmentioning

confidence: 99%

Section: Improved Backpropagationmentioning

confidence: 99%

See 3 more Smart Citations

Neural Program Repair with Execution-based Backpropagation

Ye,

Martinez,

Monperrus

2021

Preprint

View full text Add to dashboard Cite

show abstract

Concept and Implemented Blended Learning for Higher Education

Ismaya

Arifin

Pattiasina³

et al. 2022

KSS

View full text Add to dashboard Cite

Learning is a complex system that requires multiple perspectives and levels of study to understand players’ context, dynamics, and interactions, especially those related to technological innovation. This paper aims to identify some of the most promising trends in the adoption of blended learning in higher education and the capabilities that technology provides (e.g., ratification) and the contexts in which these capabilities are used. This literature review selected and analyzed 45 peer-reviewed journal articles. The findings highlight some of the capabilities of digital education technology. In particular, digital tools or platforms that support human-machine interaction can help enhance automated processes to deliver blended learning. Digital technologies such as video capsules and intelligent guidance systems can help improve teaching and learning in this context. To start, by increasing student access and facilitating independent online learning activities. Second, by providing a personalized learning path for each student, increasing opportunities for activities and feedback outside the classroom. Educational technology capabilities contribute to the identification of optimal approaches to align learning objectives in technology-based implementation. Additional research will be needed to validate these findings empirically. Keywords: Concep Learning,Implemented learning; Blended Learning; Higher Education

show abstract

Adaptor: Objective-Centric Adaptation Framework for Language Models

Štefánik¹,

Novotný²,

Groverová³

et al. 2022

Preprint

View full text Add to dashboard Cite

Progress in natural language processing research is catalyzed by the possibilities given by the widespread software frameworks. This paper introduces AdaptOr library 1 that transposes the traditional model-centric approach composed of pre-training + fine-tuning steps to objective-centric approach, composing the training process by applications of selected objectives. We survey research directions that can benefit from enhanced objective-centric experimentation in multitask training, custom objectives development, dynamic training curricula, or domain adaptation. AdaptOr aims to ease reproducibility of these research directions in practice. Finally, we demonstrate the practical applicability of AdaptOr in selected unsupervised domain adaptation scenarios."The measure of intelligence is the ability to change."-Albert Einstein

show abstract

A Mixed Learning Objective for Neural Machine Translation

Cited by 3 publications

References 17 publications

Neural Program Repair with Execution-based Backpropagation

Neural Program Repair with Execution-based Backpropagation

Concept and Implemented Blended Learning for Higher Education

Adaptor: Objective-Centric Adaptation Framework for Language Models

Contact Info

Product

Resources

About