Yitong Ji scite author profile

Yitong Ji

4Publications

39Citation Statements Received

22Citation Statements Given

How they've been cited

How they cite others

Affiliations

Qiqihar University, Nanyang Technological University, State Key Laboratory of Pollution Control and Resource Reuse

Publications

Order By: Most citations

A Re-visit of the Popularity Baseline in Recommender Systems

Sun

Zhang

et al. 2020

View full text Add to dashboard Cite

Popularity is often included in experimental evaluation to provide a reference performance for a recommendation task. To understand how popularity baseline is defined and evaluated, we sample 12 papers from top-tier conferences including KDD, WWW, SIGIR, and RecSys, and 6 open source toolkits. We note that the widely adopted MostPop baseline simply ranks items based on the number of interactions in the training data. We argue that the current evaluation of popularity (i) does not reflect the popular items at the time when a user interacts with the system, and (ii) may recommend items released after a user's last interaction with the system. On the widely used MovieLens dataset, we show that the performance of popularity could be significantly improved by 70% or more, if we consider the popular items at the time point when a user interacts with the system. We further show that, on MovieLens dataset, the users having lower tendencies on movies tend to follow the crowd and rate more popular movies. Movie lovers who rate a large number of movies, rate movies based on their own preferences and interests. Through this study, we call for a re-visit of the popularity baseline in recommender system to better reflect its effectiveness.

show abstract

Release of Zinc and Polycyclic Aromatic Hydrocarbons From Tire Crumb Rubber and Toxicity of Leachate to Daphnia magna: Effects of Tire Source and Photoaging

et al. 2021

Bull Environ Contam Toxicol

View full text Add to dashboard Cite

A Critical Study on Data Leakage in Recommender System Offline Evaluation

Sun

Zhang

et al. 2023

ACM Trans. Inf. Syst.

View full text Add to dashboard Cite

Recommender models are hard to evaluate, particularly under offline setting. In this paper, we provide a comprehensive and critical analysis of the data leakage issue in recommender system offline evaluation. Data leakage is caused by not observing global timeline in evaluating recommenders e.g., train/test data split does not follow global timeline. As a result, a model learns from the user-item interactions that are not expected to be available at prediction time. We first show the temporal dynamics of user-item interactions along global timeline, then explain why data leakage exists for collaborative filtering models. Through carefully designed experiments, we show that all models indeed recommend future items that are not available at the time point of a test instance, as the result of data leakage. The experiments are conducted with four widely used baseline models - BPR, NeuMF, SASRec, and LightGCN, on four popular offline datasets - MovieLens-25M, Yelp, Amazon-music, and Amazon-electronic, adopting leave-last-one-out data split. We further show that data leakage does impact models’ recommendation accuracy. Their relative performance orders thus become unpredictable with different amount of leaked future data in training. To evaluate recommendation systems in a realistic manner in offline setting, we propose a timeline scheme, which calls for a revisit of the recommendation model design.

show abstract

Do Loyal Users Enjoy Better Recommendations?

Sun

Zhang

et al. 2022

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Yitong Ji

A Re-visit of the Popularity Baseline in Recommender Systems

Release of Zinc and Polycyclic Aromatic Hydrocarbons From Tire Crumb Rubber and Toxicity of Leachate to Daphnia magna: Effects of Tire Source and Photoaging

A Critical Study on Data Leakage in Recommender System Offline Evaluation

Do Loyal Users Enjoy Better Recommendations?

Contact Info

Product

Resources

About