Yinpeng Guo scite author profile

Yinpeng Guo

5Publications

18Citation Statements Received

108Citation Statements Given

How they've been cited

How they cite others

105

Affiliations

Northwest Minzu University

Publications

Order By: Most citations

Zero-Shot Paraphrase Generation with Multilingual Language Models

Guo¹,

Liu²,

Jiang³

et al. 2019

Preprint

View full text Add to dashboard Cite

Leveraging multilingual parallel texts to automatically generate paraphrases has drawn much attention as size of high-quality paraphrase corpus is limited. Round-trip translation, also known as the pivoting method, is a typical approach to this end. However, we notice that the pivoting process involves multiple machine translation models and is likely to incur semantic drift during the two-step translations. In this paper, inspired by the Transformer-based language models, we propose a simple and unified paraphrasing model, which is purely trained on multilingual parallel data and can conduct zero-shot paraphrase generation in one step. Compared with the pivoting approach, paraphrases generated by our model is more semantically similar to the input sentence. Moreover, since our model shares the same architecture as GPT (Radford and Sutskever, 2018), we are able to pre-train the model on large-scale unparallel corpus, which further improves the fluency of the output sentences. In addition, we introduce the mechanism of denoising auto-encoder (DAE) to improve diversity and robustness of the model. Experimental results show that our model surpasses the pivoting method in terms of relevance, diversity, fluency and efficiency.

show abstract

Research on the Competitive Strategy of Two Sided Platform Enterprises Based on Hotelling Model

Wang

Guo

2017

View full text Add to dashboard Cite

PanGu-Coder: Program Synthesis with Function-Level Language Modeling

Christopoulou¹,

Λάμπουρας²,

Gritta³

et al. 2022

Preprint

View full text Add to dashboard Cite

We present PANGU-CODER, a pretrained decoder-only language model adopting the PANGU-α architecture for text-to-code generation, i.e. the synthesis of programming language solutions given a natural language problem description. We train PANGU-CODER using a two-stage strategy: the first stage employs Causal Language Modelling (CLM) to pre-train on raw programming language data, while the second stage uses a combination of Causal Language Modelling and Masked Language Modelling (MLM) training objectives that focus on the downstream task of text-to-code generation and train on loosely curated pairs of natural language program definitions and code functions. Finally, we discuss PANGU-CODER-FT, which is fine-tuned on a combination of competitive programming problems and code with continuous integration tests. We evaluate PANGU-CODER with a focus on whether it generates functionally correct programs and demonstrate that it achieves equivalent or better performance than similarly sized models, such as CodeX [16], while attending a smaller context window and training on less data.

show abstract

Training Multilingual Pre-trained Language Model with Byte-level Subwords

Wei¹,

Li²,

Guo³

et al. 2021

Preprint

View full text Add to dashboard Cite

The pre-trained language models have achieved great successes in various natural language understanding (NLU) tasks due to its capacity to capture the deep contextualized information in text by pre-training on large-scale corpora. One of the fundamental components in pre-trained language models is the vocabulary, especially for training multilingual models on many different languages. In the technical report, we present our practices on training multilingual pre-trained language models with BBPE: Byte-Level BPE (i.e., Byte Pair Encoding). BBPE has been adopted by pretrained language models like GPT-2/3 [1, 2] and Roberta [3] and its usage in machine translation has been discussed in [4]. We compared the byte-level vocabulary with the character-level vocabulary adopted in Google's multilingual BERT model through intensive case studies on the tokenization in a variety of languages. In the experiment, we adopted the architecture of NEZHA [5] as the underlying pre-trained language model and the results show that NEZHA trained with byte-level subwords consistently outperforms Google multilingual BERT and vanilla NEZHA by a notable margin in several multilingual NLU tasks. We release the source code of our byte-level vocabulary building tools and the multilingual pre-trained language models at the URLs 12 .

show abstract

MultiCoder: Multi-Programming-Lingual Pre-Training for Low-Resource Code Completion

Gong¹,

Guo²,

Zhou³

et al. 2022

Preprint

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Yinpeng Guo

Zero-Shot Paraphrase Generation with Multilingual Language Models

Research on the Competitive Strategy of Two Sided Platform Enterprises Based on Hotelling Model

PanGu-Coder: Program Synthesis with Function-Level Language Modeling

Training Multilingual Pre-trained Language Model with Byte-level Subwords

MultiCoder: Multi-Programming-Lingual Pre-Training for Low-Resource Code Completion

Contact Info

Product

Resources

About