Stephen Gou scite author profile

Stephen Gou

4Publications

2Citation Statements Received

12Citation Statements Given

How they've been cited

How they cite others

Affiliations

University of Toronto

Publications

Order By: Most citations

Predicting Twitter Engagement With Deep Language Models

Volkovs¹,

Cheng²,

Ravaut³

et al. 2020

View full text Add to dashboard Cite

Twitter has become one of the main information sharing platforms for millions of users world-wide. Numerous tweets are created daily, many with highly time sensitive content such as breaking news, new multimedia content or personal updates. Consequently, accurately recommending relevant tweets to users in a timely manner is a highly important and challenging problem. The 2020 ACM RecSys Challenge is aimed at benchmarking leading recommendation models for this task. The challenge is based on a large and recent dataset of over 200M tweet engagements released by Twitter with content in over 50 languages. In this work we present our approach where we leverage recent advances in deep language modeling and attention architectures, to combine information from extracted features, user engagement history and target tweet content. We first finetune leading multilingual language models M-BERT and XLM-R for Twitter data. Embeddings from these models are used to extract tweet and user history representations. We then combine all components together and jointly train them to maximize engagement prediction accuracy. Our approach achieves highly competitive performance placing 2'nd on the final private leaderboard. Full code is available here: https://github.com/layer6ai-labs/RecSys2020. CCS CONCEPTS• Information systems → Recommender systems; • Computing methodologies → Neural networks.

show abstract

Visualization of Deep Models on Nursing Notes and Physiological Data for Predicting Health Outcomes Through Temporal Sliding Windows

Yao

Liu

et al. 2020

View full text Add to dashboard Cite

Understanding and Correcting Inaccurate Calorie Estimations on Amazon Mechanical Turk

Mok

Gou

et al. 2019

View full text Add to dashboard Cite

Interlocking Backpropagation: Improving depthwise model-parallelism

Gomez

Key²,

Gou³

et al. 2020

Preprint

View full text Add to dashboard Cite

The number of parameters in state of the art neural networks has drastically increased in recent years. This surge of interest in large scale neural networks has motivated the development of new distributed training strategies enabling such models. One such strategy is model-parallel distributed training. Unfortunately, model-parallelism suffers from poor resource utilisation, which leads to wasted resources. In this work, we improve upon recent developments in an idealised model-parallel optimisation setting: local learning. Motivated by poor resource utilisation, we introduce a class of intermediary strategies between local and global learning referred to as interlocking backpropagation. These strategies preserve many of the compute-efficiency advantages of local optimisation, while recovering much of the task performance achieved by global optimisation. We assess our strategies on both image classification ResNets and Transformer language models, finding that our strategy consistently out-performs local learning in terms of task performance, and out-performs global learning in training efficiency.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Stephen Gou

Predicting Twitter Engagement With Deep Language Models

Visualization of Deep Models on Nursing Notes and Physiological Data for Predicting Health Outcomes Through Temporal Sliding Windows

Understanding and Correcting Inaccurate Calorie Estimations on Amazon Mechanical Turk

Interlocking Backpropagation: Improving depthwise model-parallelism

Contact Info

Product

Resources

About