2017 IEEE International Conference on Image Processing (ICIP) 2017
DOI: 10.1109/icip.2017.8296311
|View full text |Cite
|
Sign up to set email alerts
|

Multi-modal joint embedding for fashion product retrieval

Abstract: Finding a product in the fashion world can be a daunting task. Everyday, e-commerce sites are updating with thousands of images and their associated metadata (textual information), deepening the problem, akin to finding a needle in a haystack. In this paper, we leverage both the images and textual metadata and propose a joint multi-modal embedding that maps both the text and images into a common latent space. Distances in the latent space correspond to similarity between products, allowing us to effectively pe… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
8
0

Year Published

2018
2018
2023
2023

Publication Types

Select...
4
3
2
1

Relationship

0
10

Authors

Journals

citations
Cited by 17 publications
(8 citation statements)
references
References 22 publications
0
8
0
Order By: Relevance
“…The matched images can be recommended as design references. The retrieval input can be either text, images, or both [161][162][163][164].…”
Section: Spark Creative Inspirationmentioning
confidence: 99%
“…The matched images can be recommended as design references. The retrieval input can be either text, images, or both [161][162][163][164].…”
Section: Spark Creative Inspirationmentioning
confidence: 99%
“…Designers and users put forward their requirements through images and text, search for related product images from databases or e-commerce websites, and the matched images will be recommended to designers and users as design references. The retrieval input can be text, images, or both of them [162,163,164,165,166]. For product, the input image provided by designers and users may be taken by their phone on the street or in a store, which is quite different from image databases and e-commerce websites in terms of shooting angle, condition, background, or posture [167,168,169,170,171].…”
Section: Product Design Based On Image Datamentioning
confidence: 99%
“…Deep Metric Learning. Metric learning using deep models is a well-studied problem with many applications [3,25,28,34], especially where the output space is very large. Early approaches are based upon Siamese networks [7] with contrastive loss on pairwise data or relative triplet similarity comparisons [12,28].…”
Section: Related Workmentioning
confidence: 99%