Yuzhuo Dai scite author profile

Yuzhuo Dai

4Publications

21Citation Statements Received

32Citation Statements Given

How they've been cited

How they cite others

Affiliations

Huazhong Agricultural University, Central South University, Tencent (China)

Publications

Order By: Most citations

Multi-Task Fusion via Reinforcement Learning for Long-Term User Satisfaction in Recommender Systems

Zhang

Liu

Dai

et al. 2022

View full text Add to dashboard Cite

Recommender System (RS) is an important online application that affects billions of users every day. The mainstream RS ranking framework is composed of two parts: a Multi-Task Learning model (MTL) that predicts various user feedback, i.e., clicks, likes, sharings, and a Multi-Task Fusion model (MTF) that combines the multi-task outputs into one final ranking score with respect to user satisfaction. There has not been much research on the fusion model while it has great impact on the final recommendation as the last crucial process of the ranking. To optimize long-term user satisfaction rather than obtain instant returns greedily, we formulate MTF task as Markov Decision Process (MDP) within a recommendation session and propose a Batch Reinforcement Learning (RL) based Multi-Task Fusion framework (BatchRL-MTF) that includes a Batch RL framework and an online exploration. The former exploits Batch RL to learn an optimal recommendation policy from the fixed batch data offline for long-term user satisfaction, while the latter explores potential highvalue actions online to break through the local optimal dilemma. With a comprehensive investigation on user behaviors, we model the user satisfaction reward with subtle heuristics from two aspects of user stickiness and user activeness. Finally, we conduct extensive experiments on a billion-sample level real-world dataset to show the effectiveness of our model. We propose a conservative offline policy estimator (Conservative-OPEstimator) to test our model offline. Furthermore, we take online experiments in a real recommendation environment to compare performance of different models. As one of few Batch RL researches applied in MTF task successfully, our model has also been deployed on a large-scale industrial short video platform, serving hundreds of millions of users.

show abstract

Deep learning for tracing esophageal motility function over time

Wang

Hou

et al. 2021

Computer Methods and Programs in Biomedicine

View full text Add to dashboard Cite

Attention graph convolutional nets for esophageal contraction pattern recognition in high-resolution manometries

Wang

Dai

et al. 2021

Biomedical Signal Processing and Control

View full text Add to dashboard Cite

GMFQP: An Ontology-mediated Gut Microbiota Federated Query Platform

Dai

Zhang

Tang

et al. 2022

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Yuzhuo Dai

Multi-Task Fusion via Reinforcement Learning for Long-Term User Satisfaction in Recommender Systems

Deep learning for tracing esophageal motility function over time

Attention graph convolutional nets for esophageal contraction pattern recognition in high-resolution manometries

GMFQP: An Ontology-mediated Gut Microbiota Federated Query Platform

Contact Info

Product

Resources

About