Deep Interest Network for Click-Through Rate Prediction

Zhou, Guorui; Zhu, Xiaoqiang; Song, Chenru; Fan, Ying; Zhu, Han; Ma, Xiao; Yan, Yanghui; Jin, Junhua; Li, Han; Gai, Kun

doi:10.1145/3219819.3219823

Cited by 1,614 publications

(1,137 citation statements)

References 21 publications

Supporting

Mentioning

1,137

Contrasting

Order By: Relevance

“…Ni et al [20] adopt LSTM and the attention mechanism to model the user behavior sequence. Compared to sequence-independent approaches, these methods can significantly improve the CTR prediction accuracy and most of these techniques have been deployed in real-world applications [20,32,38,39].…”

Section: Context Aware Personalization Modelmentioning

confidence: 99%

“…Utilizing multiple modality features is often effective to improve the performance of CTR tasks. A straightforward way [20,38,39] is to concatenate the multiple modality features, which is equivalent to giving a fixed importance weight to each modality regardless of different items. A conceivable improvement [14,31] is to dynamically distinguish the contributions of different modalities through an attention mechanism.…”

Section: Multimodal Attention Networkmentioning

confidence: 99%

See 1 more Smart Citation

Adversarial Multimodal Representation Learning for Click-Through Rate Prediction

Wang

Tan

et al. 2020

Proceedings of the Web Conference 2020

View full text Add to dashboard Cite

For better user experience and business effectiveness, Click-Through Rate (CTR) prediction has been one of the most important tasks in E-commerce. Although extensive CTR prediction models have been proposed, learning good representation of items from multimodal features is still less investigated, considering an item in E-commerce usually contains multiple heterogeneous modalities. Previous works either concatenate the multiple modality features, that is equivalent to giving a fixed importance weight to each modality; or learn dynamic weights of different modalities for different items through technique like attention mechanism. However, a problem is that there usually exists common redundant information across multiple modalities. The dynamic weights of different modalities computed by using the redundant information may not correctly reflect the different importance of each modality. To address this, we explore the complementarity and redundancy of modalities by considering modality-specific and modality-invariant features differently. We propose a novel Multimodal Adversarial Representation Network (MARN) for the CTR prediction task. A multimodal attention network first calculates the weights of multiple modalities for each item according to its modality-specific features. Then a multimodal adversarial network learns modalityinvariant representations where a double-discriminators strategy is introduced. Finally, we achieve the multimodal item representations by combining both modality-specific and modality-invariant representations. We conduct extensive experiments on both public and industrial datasets, and the proposed method consistently achieves remarkable improvements to the state-of-the-art methods. Moreover, the approach has been deployed in an operational E-commerce system and online A/B testing further demonstrates the effectiveness.

show abstract

Section: Context Aware Personalization Modelmentioning

confidence: 99%

Section: Multimodal Attention Networkmentioning

confidence: 99%

Adversarial Multimodal Representation Learning for Click-Through Rate Prediction

Wang

Tan

et al. 2020

Proceedings of the Web Conference 2020

View full text Add to dashboard Cite

show abstract

“…And hereby we introduce our Interactive Attention Mechanism. Unlike the attention mechanism in [43] and [44] which uses the target item to query the interacted items sequence, we utilize dual sequences information at the same time interactively to weigh across different time slice. The attention value of each time slice β t is calculated as,…”

Section: Interactive Dual Sequence Modelingmentioning

confidence: 99%

“…The inference procedure is illustrated in Figure 4. As for the loss function, we take an end-to-end training and introduce (i) the widely used cross entropy loss L ce [25,43,44] over the whole training dataset and (ii) the parameter regularization L r . We utilize Adam algorithm for optimization.…”

Section: Final Prediction and Loss Functionsmentioning

confidence: 99%

Sequential Recommendation with Dual Side Neighbor-based Collaborative Relation Modeling

Qin

Ren

et al. 2020

Proceedings of the 13th International Conference on Web Search and Data Mining

View full text Add to dashboard Cite

Sequential recommendation task aims to predict user preference over items in the future given user historical behaviors. The order of user behaviors implies that there are resourceful sequential patterns embedded in the behavior history which reveal the underlying dynamics of user interests. Various sequential recommendation methods are proposed to model the dynamic user behaviors. However, most of the models only consider the user's own behaviors and dynamics, while ignoring the collaborative relations among users and items, i.e., similar tastes of users or analogous properties of items. Without modeling collaborative relations, those methods suffer from the lack of recommendation diversity and thus may have worse performance. Worse still, most existing methods only consider the user-side sequence and ignore the temporal dynamics on the item side. To tackle the problems of the current sequential recommendation models, we propose Sequential Collaborative Recommender (SCoRe) which effectively mines high-order collaborative information using cross-neighbor relation modeling and, additionally utilizes both user-side and item-side historical sequences to better capture user and item dynamics. Experiments on three real-world yet large-scale datasets demonstrate the superiority of the proposed model over strong baselines.

show abstract

“…Secondly, we resort to the multi-task learning with multi-modal data to handle the sparsity issue. It has become a common practice for industrial applications to leverage the useful information across related tasks to make up for the data sparsity in individual task [12,29,37]. In e-commerce, available data sources often include customer view, purchase, search, substitution records, as well as product descriptions and hierarchical category information.…”

mentioning

confidence: 99%

Product Knowledge Graph Embedding for E-commerce

Ruan

Körpeoğlu

et al. 2020

Proceedings of the 13th International Conference on Web Search and Data Mining

View full text Add to dashboard Cite

In this paper, we propose a new product knowledge graph (PKG) embedding approach for learning the intrinsic product relations as product knowledge for e-commerce. We define the key entities and summarize the pivotal product relations that are critical for general e-commerce applications including marketing, advertisement, search ranking and recommendation. We first provide a comprehensive comparison between PKG and ordinary knowledge graph (KG) and then illustrate why KG embedding methods are not suitable for PKG learning. We construct a self-attention-enhanced distributed representation learning model for learning PKG embeddings from raw customer activity data in an end-to-end fashion. We design an effective multi-task learning schema to fully leverage the multi-modal e-commerce data. The Poincaré embedding is also employed to handle complex entity structures. We use a real-world dataset from grocery.walmart.com to evaluate the performances on knowledge completion, search ranking and recommendation. The proposed approach compares favourably to baselines in knowledge completion and downstream tasks. CCS CONCEPTS• Information systems → Data mining; Web searching and information discovery; Retrieval models and ranking.

show abstract

Deep Interest Network for Click-Through Rate Prediction

Cited by 1,614 publications

References 21 publications

Adversarial Multimodal Representation Learning for Click-Through Rate Prediction

Adversarial Multimodal Representation Learning for Click-Through Rate Prediction

Sequential Recommendation with Dual Side Neighbor-based Collaborative Relation Modeling

Product Knowledge Graph Embedding for E-commerce

Contact Info

Product

Resources

About