“…In recent years, the amount of searchable micro-videos has increased dramatically and exacerbated the need for recommender systems that can effectively mine users' preference and identify potentially interested micro-videos in a personalized manner. Due to the powerful representation learning capacity, the rapid development of deep learning techniques has nourished the research field of recommendation [17,24,33,41,42,57,58,62,65,67,68,70,73,74]. Such a development also gives rise to diverse models for video recommendation, which can be roughly categorized to collaborative filtering [2,29], content-based filtering [11,16,44,48,77], and hybrid ones [5,6,72].…”