“…While the importance of the distribution of ratings on RS has been long recognized, e.g., [1,2,15,36], many popular methods based on latent factor models and recently introduced neural variants [3,14,20,22,25,39] optimize for the head of these distributions, potentially leading to large estimation errors for tail ratings. As we will show in Section 3, these tail estimation errors are common across multiple domains and datasets, leading to large overestimations of the ratings of items with very low ratings, and large under-estimations of the ratings of items with very high ratings.…”