Tianxiao Shen scite author profile

We propose Blank Language Model (BLM), a model that generates sequences by dynamically creating and filling in blanks. The blanks control which part of the sequence to expand, making BLM ideal for a variety of text editing and rewriting tasks. The model can start from a single blank or partially completed text with blanks at specified locations. It iteratively determines which word to place in a blank and whether to insert new blanks, and stops generating when no blanks are left to fill. BLM can be efficiently trained using a lower bound of the marginal data likelihood. On the task of filling missing text snippets, BLM significantly outperforms all other baselines in terms of both accuracy and fluency. Experiments on style transfer and damaged ancient text restoration demonstrate the potential of this framework for a wide range of applications. 1

show abstract

Learning to Make Generalizable and Diverse Predictions for Retrosynthesis

Chen¹,

Shen²,

Jaakkola³

et al. 2019

Preprint

View full text Add to dashboard Cite

Style Transfer from Non-Parallel Text by Cross-Alignment

Shen¹,

Leí²,

Barzilay³

et al. 2017

Preprint

View full text Add to dashboard Cite

This paper focuses on style transfer on the basis of non-parallel text. This is an instance of a broad family of problems including machine translation, decipherment, and sentiment modification. The key challenge is to separate the content from other aspects such as style. We assume a shared latent content distribution across different text corpora, and propose a method that leverages refined alignment of latent representations to perform style transfer. The transferred sentences from one style should match example sentences from the other style as a population. We demonstrate the effectiveness of this cross-alignment method on three tasks: sentiment modification, decipherment of word substitution ciphers, and recovery of word order. 1 1 Our code and data are available at https://github.com/shentianxiao/language-style-transfer.

show abstract

Mixture Models for Diverse Machine Translation: Tricks of the Trade

Shen

Ott

Auli

et al. 2019

Preprint

View full text Add to dashboard Cite

Forward ultra-low emission for power plants via wet electrostatic precipitators and newly developed demisters: Filterable and condensable particulate matters

Liang

Ding

et al. 2020

Atmospheric Environment

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Tianxiao Shen

Blank Language Models

Learning to Make Generalizable and Diverse Predictions for Retrosynthesis

Style Transfer from Non-Parallel Text by Cross-Alignment

Mixture Models for Diverse Machine Translation: Tricks of the Trade

Forward ultra-low emission for power plants via wet electrostatic precipitators and newly developed demisters: Filterable and condensable particulate matters

Contact Info

Product

Resources

About