DOI: 10.32469/10355/93997
|View full text |Cite
|
Sign up to set email alerts
|

A system for large-scale image and video retrieval on everyday scenes

Abstract: There has been a growing amount of multimedia data generated on the web todayin terms of size and diversity. This has made accurate content retrieval with these large and complex collections of data a challenging problem. Motivated by the need for systems that can enable scalable and efficient search, we propose QIK (Querying Images Using Contextual Knowledge). QIK leverages advances in deep learning (DL) and natural language processing (NLP) for scene understanding to enable large-scale multimedia retrieval o… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 66 publications
(112 reference statements)
0
1
0
Order By: Relevance
“…Secondly, with the exponential growth of cross-modal data, efficiently retrieving and querying relevant information from massive datasets across different modalities becomes an arduous task. In response to these challenges, researchers both domestically and internationally have conducted crossmodal retrieval research from various perspectives, yielding significant advancements in areas such as image-text retrieval and video-image-text retrieval [7,8,9,10,11]. This paper provides a comprehensive review of the latest research achievements over the past five years by taking image-text cross-modal retrieval as an example.…”
Section: Introductionmentioning
confidence: 99%
“…Secondly, with the exponential growth of cross-modal data, efficiently retrieving and querying relevant information from massive datasets across different modalities becomes an arduous task. In response to these challenges, researchers both domestically and internationally have conducted crossmodal retrieval research from various perspectives, yielding significant advancements in areas such as image-text retrieval and video-image-text retrieval [7,8,9,10,11]. This paper provides a comprehensive review of the latest research achievements over the past five years by taking image-text cross-modal retrieval as an example.…”
Section: Introductionmentioning
confidence: 99%