A Lexical, Syntactic, and Semantic Perspective for Understanding Style in Text

Verma, Gaurav; Srinivasan, Balaji Vasan

doi:10.48550/arxiv.1909.08349

Cited by 6 publications

(12 citation statements)

References 35 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In terms of style consistency, existing work only measures the style intensity using classifiers (Gao et al, 2019). However, the style of text is an amalgam, and differences between two styles are reflected in multiple linguistic dimensions (Verma and Srinivasan, 2019). Thus, we propose to evaluate the style of response from three perspectives:…”

Section: Evaluation Methodologymentioning

confidence: 99%

StyleDGPT: Stylized Response Generation with Pre-trained Language Models

Yang

et al. 2020

Preprint

View full text Add to dashboard Cite

Generating responses following a desired style has great potentials to extend applications of open-domain dialogue systems, yet is refrained by lacking of parallel data for training. In this work, we explore the challenging task with pre-trained language models that have brought breakthrough to various natural language tasks. To this end, we introduce a KL loss and a style classifier to the fine-tuning step in order to steer response generation towards the target style in both a word-level and a sentence-level. Comprehensive empirical studies with two public datasets indicate that our model can significantly outperform stateof-the-art methods in terms of both style consistency and contextual coherence.

show abstract

Section: Evaluation Methodologymentioning

confidence: 99%

StyleDGPT: Stylized Response Generation with Pre-trained Language Models

Yang

et al. 2020

Preprint

View full text Add to dashboard Cite

show abstract

“…Style parameters are features of text that are stylistically expressive. These parameters can be roughly identified at lexical (vocabulary and words), syntactic (sentence structure) and semantic (abstract meaning/emotion) levels (Verma and Srinivasan, 2019). We focus primarily on lexical and semantic features.…”

Section: Style Parametersmentioning

confidence: 99%

Style Control for Schema-Guided Natural Language Generation

Tsai¹,

Oraby²,

Perera³

et al. 2021

Preprint

View full text Add to dashboard Cite

Natural Language Generation (NLG) for taskoriented dialogue systems focuses on communicating specific content accurately, fluently, and coherently. While these attributes are crucial for a successful dialogue, it is also desirable to simultaneously accomplish specific stylistic goals, such as response length, pointof-view, descriptiveness, sentiment, formality, and empathy. In this work, we focus on stylistic control and evaluation for schema-guided NLG, with joint goals of achieving both semantic and stylistic control. We experiment in detail with various controlled generation methods for large pretrained language models: specifically, conditional training, guided fine-tuning, and guided decoding. We discuss their advantages and limitations, and evaluate them with a broad range of automatic and human evaluation metrics. Our results show that while high style accuracy and semantic correctness are easier to achieve for more lexicallydefined styles with conditional training, stylistic control is also achievable for more semantically complex styles using discriminatorbased guided decoding methods. The results also suggest that methods that are more scalable (with less hyper-parameters tuning) and that disentangle content generation and stylistic variations are more effective at achieving semantic correctness and style accuracy. * Work done as an intern at Amazon Alexa AI.1. We describe how we pre-process and annotate style parameters within the Schema-guided Dialogue (SGD) dataset (Rastogi et al., 2019).

show abstract

“…There has been significant work on understanding binary stylization along dimensions like formalinformal, positive-negative sentiment (Rao and Tetreault, 2018;Kessler et al, 1997;Pavlick and Tetreault, 2016;Collins-Thompson and Callan, 2005;Hovy, 1990;Inkpen and Hirst, 2006;Kantrowitz, 2003), however, there is limited work on understanding an author's writing style (Mc-Carthy et al, 2006;Forgeard, 2008;Verma and Srinivasan, 2019). While style can be a mixture of several factors including, but not limited to, lexical preferences, syntactic/sentential choices, discourse structure, narrative style, tone, we follow Syed et al (2020) and consider an author's style at three levels:…”

Section: Related Workmentioning

confidence: 99%

“…With recent advances in language modeling techniques that have resulted in powerful language models Devlin et al, 2018;Brown et al, 2020) along with an increased interest in stylized content generation, (Hu et al, 2017;Shen et al, 2017;Subramanian et al, 2018;Fu et al, 2018;Niu and Bansal, 2018), large language models have been successfully tuned to achieve text stylization (Lample et al, 2018;Ziegler et al, 2019;Syed et al, 2020;Singh et al, 2020). Apart from transferring an input text to the target style, which has received recent interest from the community, understanding and measuring style have been persistently explored over the last few decades (Kessler et al, 1997;Garera and Yarowsky, 2009;Liu, 2012;Verma and Srinivasan, 2019). Lying at the intersection of style transfer enabled by advanced language models and a deep understanding of style as a nuanced combination of several linguistic concepts, problems like stylized generation or stylized rewriting have gained further traction.…”

Section: Introductionmentioning

confidence: 99%

DRAG: Director-Generator Language Modelling Framework for Non-Parallel Author Stylized Rewriting

Singh¹,

Verma²,

Srinivasan³

2021

Preprint

Self Cite

View full text Add to dashboard Cite

Author stylized rewriting is the task of rewriting an input text in a particular author's style. Recent works in this area have leveraged Transformer-based language models in a denoising autoencoder setup to generate author stylized text without relying on a parallel corpus of data. However, these approaches are limited by the lack of explicit control of target attributes and being entirely data-driven. In this paper, we propose a Director-Generator framework to rewrite content in the target author's style, specifically focusing on certain target attributes. We show that our proposed framework works well even with a limitedsized target author corpus. Our experiments on corpora consisting of relatively small-sized text authored by three distinct authors show significant improvements upon existing works to rewrite input texts in target author's style. Our quantitative and qualitative analyses further show that our model has better meaning retention and results in more fluent generations.

show abstract

A Lexical, Syntactic, and Semantic Perspective for Understanding Style in Text

Cited by 6 publications

References 35 publications

StyleDGPT: Stylized Response Generation with Pre-trained Language Models

StyleDGPT: Stylized Response Generation with Pre-trained Language Models

Style Control for Schema-Guided Natural Language Generation

DRAG: Director-Generator Language Modelling Framework for Non-Parallel Author Stylized Rewriting

Contact Info

Product

Resources

About