Bridging the Ultimate Semantic Gap

Jiang, Lu; Yu, Shoou-I; Meng, Deyu; Mitamura, Teruko; Hauptmann, Alexander G.

doi:10.1145/2671188.2749399

Cited by 50 publications

(5 citation statements)

References 40 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…To deal with out-of-vocabulary (OOV) problem, query expansion with ontology influencing [47] and webly-label learning by crawling online visual data [18] are commonly adopted. Despite numerous progress [8,19,35], automatic selection of concepts to capture query semantics as well as context remains highly difficult. Human intervention is often required in practice, for example, by manually removing undesired concepts after automatic matching [34] or by hand-picking query phrases that should be matched with concepts [45].…”

Section: Related Workmentioning

confidence: 99%

Interpretable Embedding for Ad-Hoc Video Search

Ngo

2020

Proceedings of the 28th ACM International Conference on Multimedia

View full text Add to dashboard Cite

Answering query with semantic concepts has long been the mainstream approach for video search. Until recently, its performance is surpassed by concept-free approach, which embeds queries in a joint space as videos. Nevertheless, the embedded features as well as search results are not interpretable, hindering subsequent steps in video browsing and query reformulation. This paper integrates feature embedding and concept interpretation into a neural network for unified dual-task learning. In this way, an embedding is associated with a list of semantic concepts as an interpretation of video content. This paper empirically demonstrates that, by using either the embedding features or concepts, considerable search improvement is attainable on TRECVid benchmarked datasets. Concepts are not only effective in pruning false positive videos, but also highly complementary to concept-free search, leading to large margin of improvement compared to state-of-the-art approaches. CCS CONCEPTS• Computing methodologies → Neural networks; • Information systems → Video search.

show abstract

Section: Related Workmentioning

confidence: 99%

Interpretable Embedding for Ad-Hoc Video Search

Ngo

2020

Proceedings of the 28th ACM International Conference on Multimedia

View full text Add to dashboard Cite

show abstract

“…In [39], the authors propose a state of art system search engine for video event search without any submitted example videos to the query which is called zero-example or 0Ex. The system consists of video semantic indexing component, semantic query generation component, multimedia search, and pseudo-relevance.…”

Section: Recent Approaches 1) Zero-shotmentioning

confidence: 99%

“…It orders the event concept vector scores in descending order, constructing an exponential curve, so that then selects the first k concepts. This paper uses a large concept pool that consists of 13.488 semantic concepts, the MAP of the proposed system is slightly better than the other systems such as [39]. A real-time video retrieval can be applied to this approach because the concept detection scores are previously measured to the database of videos and the semantic similarity computing is fast.…”

Section: Recent Approaches 1) Zero-shotmentioning

confidence: 99%

Semantic-Based Video Retrieval Survey

Toriah¹,

Ghalwash²,

Youssif³

2018

JCC

View full text Add to dashboard Cite

There is a tremendous growth of digital data due to the stunning progress of digital devices which facilitates capturing them. Digital data include image, text, and video. Video represents a rich source of information. Thus, there is an urgent need to retrieve, organize, and automate videos. Video retrieval is a vital process in multimedia applications such as video search engines, digital museums, and video-on-demand broadcasting. In this paper, the different approaches of video retrieval are outlined and briefly categorized. Moreover, the different methods that bridge the semantic gap in video retrieval are discussed in more details.

show abstract

“…The use of automatic detectors means that all data is included in the search, whether the user has supplied tags or not. The search engine would enable the user to prioritize the various filters to sort the data [as in 70,141]. (Alternatively, the user could start with a similarity search, if they already had some relevant examples on hand.)…”

Section: The Proposed Dataset-building Processmentioning

confidence: 99%

Field Studies with Multimedia Big Data: Opportunities and Challenges (Extended Version)

Krell¹,

Bernd²,

Li³

et al. 2017

Preprint

View full text Add to dashboard Cite

Social multimedia users are increasingly sharing all kinds of data about the world. They do this for their own reasons, not to provide data for field studies-but the trend presents a great opportunity for scientists. The Yahoo Flickr Creative Commons 100 Million (YFCC100M) dataset comprises 99 million images and nearly 800 thousand videos from Flickr, all shared under Creative Commons licenses. To enable scientists to leverage these media records for field studies, we propose a new framework that extracts targeted subcorpora from the YFCC100M, in a format usable by researchers who are not experts in big data retrieval and processing.

show abstract

Bridging the Ultimate Semantic Gap

Cited by 50 publications

References 40 publications

Interpretable Embedding for Ad-Hoc Video Search

Interpretable Embedding for Ad-Hoc Video Search

Semantic-Based Video Retrieval Survey

Field Studies with Multimedia Big Data: Opportunities and Challenges (Extended Version)

Contact Info

Product

Resources

About