“…Early models like Matrix Factorization (MF) embed users and items in a shared latent space and model the user preference to an item as the inner product between user and item embeddings [17]. However, due to the complex interaction between users and items, the shallow representations in MF-based methods lack expressiveness to model features [11,15]. As deep learning developed, some recommendation approaches utilize neural networks to capture complex interaction behaviors, which enhance the performance of previous shallow models [4,5,9,11,15,42,46].…”