Graph-Refined Convolutional Network for Multimedia Recommendation with Implicit Feedback

Wei, Yinwei; Wang, Xiang; Nie, Liqiang; He, Xiangnan; Chua, Tat-Seng

doi:10.1145/3394171.3413556

Cited by 233 publications

(223 citation statements)

References 32 publications

Supporting

Mentioning

223

Contrasting

Order By: Relevance

“…• GRCN [38] is also one of the state-of-the-arts multimodal recommendation methods. It refines user-item interaction graph by identifying the false-positive feedback and prunes the corresponding noisy edges in the interaction graph.…”

Section: Modelmentioning

confidence: 99%

“…MV-RNN [6] uses multimodal features for sequential recommendation in a recurrent framework. Recently, Graph Neural Networks (GNNs) have been introduced into recommendation systems [36,41,46] and especially multimodal recommendation systems [23,38,39]. MMGCN [39] constructs modal-specific graph and conduct graph convolutional operations, to capture the modal-specific user preference and distills the item representations simultaneously.…”

Section: Related Work 41 Multimodal Recommendationmentioning

confidence: 99%

“…In this way, the learned user representation can reflect the users' specific interests on items. Following MMGCN, GRCN [38] focuses on adaptively refining the structure of interaction graph to discover and prune potential false-positive edges.…”

Section: Related Work 41 Multimodal Recommendationmentioning

confidence: 99%

“…MMGCN [39] constructs modality-specific user-item interaction graphs to model user preferences specific to each modality. Following MMGCN, GRCN [38] utilizes multimodal features to refine user-item interaction graphs by identifying false-positive feedbacks and prunes the corresponding noisy edges.…”

Section: Introductionmentioning

confidence: 99%

See 3 more Smart Citations

Mining Latent Structures for Multimedia Recommendation

Zhang

Zhu

Liu

et al. 2021

Proceedings of the 29th ACM International Conference on Multimedia

107

View full text Add to dashboard Cite

Multimedia content is of predominance in the modern Web era. Investigating how users interact with multimodal items is a continuing concern within the rapid development of recommender systems. The majority of previous work focuses on modeling useritem interactions with multimodal features included as side information. However, this scheme is not well-designed for multimedia recommendation. Specifically, only collaborative item-item relationships are implicitly modeled through high-order item-user-item relations. Considering that items are associated with rich contents in multiple modalities, we argue that the latent semantic item-item structures underlying these multimodal contents could be beneficial for learning better item representations and further boosting recommendation. To this end, we propose a LATent sTructure mining method for multImodal reCommEndation, which we term LAT-TICE for brevity. To be specific, in the proposed LATTICE model, we devise a novel modality-aware structure learning layer, which learns item-item structures for each modality and aggregates multiple modalities to obtain latent item graphs. Based on the learned latent graphs, we perform graph convolutions to explicitly inject high-order item affinities into item representations. These enriched item representations can then be plugged into existing collaborative filtering methods to make more accurate recommendations. Extensive experiments on three real-world datasets demonstrate the superiority of our method over state-of-the-art multimedia recommendation methods and validate the efficacy of mining latent item-item relationships from multimodal features.

show abstract

Section: Modelmentioning

confidence: 99%

Section: Related Work 41 Multimodal Recommendationmentioning

confidence: 99%

Section: Related Work 41 Multimodal Recommendationmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Mining Latent Structures for Multimedia Recommendation

Zhang

Zhu

Liu

et al. 2021

Proceedings of the 29th ACM International Conference on Multimedia

107

View full text Add to dashboard Cite

show abstract

“…In recent years, the amount of searchable micro-videos has increased dramatically and exacerbated the need for recommender systems that can effectively mine users' preference and identify potentially interested micro-videos in a personalized manner. Due to the powerful representation learning capacity, the rapid development of deep learning techniques has nourished the research field of recommendation [17,24,33,41,42,57,58,62,65,67,68,70,73,74]. Such a development also gives rise to diverse models for video recommendation, which can be roughly categorized to collaborative filtering [2,29], content-based filtering [11,16,44,48,77], and hybrid ones [5,6,72].…”

Section: Introductionmentioning

confidence: 99%

Multi-trends Enhanced Dynamic Micro-video Recommendation

Lu,

Huang,

Zhang

et al. 2021

Preprint

View full text Add to dashboard Cite

The explosively generated micro-videos on content sharing platforms call for recommender systems to permit personalized microvideo discovery with ease. Recent advances in micro-video recommendation have achieved remarkable performance in mining users' current preference based on historical behaviors. However, most of them neglect the dynamic and time-evolving nature of users' preference, and the prediction on future micro-videos with historically mined preference may deteriorate the effectiveness of recommender systems. In this paper, we propose to explicitly model dynamic multi-trends of users' current preference and make predictions based on both the history and future potential trends. We devise the DMR framework, which comprises: 1) the implicit user network module which identifies sequence fragments from other users with similar interests and extracts the sequence fragments that are chronologically behind the identified fragments; 2) the multi-trend routing module which assigns each extracted sequence fragment into a trend group and update the corresponding trend vector; 3) the history-future trend prediction module jointly uses the history preference vectors and future trend vectors to yield the final click-through-rate. We validate the effectiveness of the proposed framework over multiple state-of-the-art micro-video recommenders on two publicly available real-world datasets. Relatively extensive analysis further demonstrate the superiority of modeling dynamic multi-trend for micro-video recommendation.

show abstract