Multimedia search and retrieval using multimodal annotation propagation and indexing techniques

Lazaridis, Michalis; Αξενόπουλος, Απόστολος; Rafailidis, Dimitrios; Daras, Petros

doi:10.1016/j.image.2012.04.001

Cited by 37 publications

(19 citation statements)

References 31 publications

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…One class of models are based on combination strategies: (1) combining low-level features from different modalities into concise multi-modal features. In [12], a manifold learning algorithm based on Laplacian Eigenmaps is introduced to combine low-level descriptors of each separate modality and map them to a new low-dimensional multi-modal feature space. In this feature space, semantically similar multi-modal data are represented by multi-modal descriptor vectors close to each other.…”

Section: Related Workmentioning

confidence: 99%

Topic correlation model for cross-modal multimedia information retrieval

Qin

Cong

et al. 2015

Pattern Anal Applic

View full text Add to dashboard Cite

In this paper, we present a simple and effective topic correlation model (TCM) for cross-modal multimedia retrieval by jointly modeling the text and image components in multimedia documents. In this model, the image component is represented by the bag-of-features model based on local scale-invariant feature transform features, meanwhile the text component is described by a topic distribution learned from a latent topic model. Statistical correlations between these two mid-level features are investigated by mapping them into a semantic space. These cross-modality correlations are used to calculate the conditional probabilities of answers in one modality while given query in the other modality. The model is tested on three cross-modal retrieval benchmark problems including Wikipedia documents in both English and Chinese. Experimental results have demonstrated that the new TCM model achieves the best performance compared to recent state-of-the-art cross-modal retrieval models on the given benchmarks.

show abstract

Section: Related Workmentioning

confidence: 99%

Topic correlation model for cross-modal multimedia information retrieval

Qin

Cong

et al. 2015

Pattern Anal Applic

View full text Add to dashboard Cite

show abstract

“…Hierarchical image partitioning method is used to detect scale of a live image by matching this image with the database images. Lazaridis et al (2013) designed a method to search and retrieve multimodal data. This framework links images to semantic annotation using some similarity measure.…”

Section: Related Workmentioning

confidence: 99%

Augmented reality system using lidar point cloud data for displaying dimensional information of objects on mobile phones

Gupta

Lohani

2014

ISPRS Ann. Photogramm. Remote Sens. Spatial Inf. Sci.

View full text Add to dashboard Cite

ABSTRACT:Mobile augmented reality system is the next generation technology to visualise 3D real world intelligently. The technology is expanding at a fast pace to upgrade the status of a smart phone to an intelligent device. The research problem identified and presented in the current work is to view actual dimensions of various objects that are captured by a smart phone in real time. The methodology proposed first establishes correspondence between LiDAR point cloud, that are stored in a server, and the image t hat is captured by a mobile. This correspondence is established using the exterior and interior orientation parameters of the mobile camera and the coordinates of LiDAR data points which lie in the viewshed of the mobile camera. A pseudo intensity image is generated using LiDAR points and their intensity. Mobile image and pseudo intensity image are then registered using image registration method SIFT thereby generating a pipeline to locate a point in point cloud corresponding to a point (pixel) on t he mobile image. The second part of the method uses point cloud data for computing dimensional information corresponding to the pairs of points selected on mobile image and fetch the dimensions on top of the image. This paper describes all steps of the proposed method. The paper uses an experimental setup to mimic the mobile phone and server system and presents some initial but encouraging results.

show abstract

“…The main difficulty in cross-media retrieval is to define a similarity measure among heterogeneous low-level features. In order to simultaneously search and retrieve data from multiple modalities, other approaches have been considered [10,11,12,13,14,15,16,17,18,19,20]. For instance in [16], it is experimentally shown that multimodal queries achieve higher retrieval accuracy than mono-modal ones.…”

Section: Introductionmentioning

confidence: 99%

“…The work in [10] suggests using a combination of ontology browsing and keyword-based querying. The methods presented in [11,14,15,16] use a similar approach and rely on the assumption that every document has an equal number of nearest neighbors for each of the modalities. However, such an assumption might degrade the retrieval performance as a document containing "image+text" may have many nearest neighbors in image modality, but not as many relevant textual data.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Search and retrieval of multi-modal data associated with image-parts

Pourian

Karthikeyan

Manjunath

2015

2015 IEEE International Conference on Image Processing (ICIP)

View full text Add to dashboard Cite

We present a novel framework for querying multi-modal data from a heterogeneous database containing images, textual tags, and GPS coordinates. We construct a bi-layer graph structure using localized image-parts and associated GPS locations and textual tags from the database. The first layer graphs capture similar data points from a single modality using a spectral clustering algorithm. The second layer of our multi-modal network allows one to integrate the relationships between clusters of different modalities. The proposed network model enables us to use flexible multi-modal queries on the database.

show abstract

Multimedia search and retrieval using multimodal annotation propagation and indexing techniques

Cited by 37 publications

References 31 publications

Topic correlation model for cross-modal multimedia information retrieval

Topic correlation model for cross-modal multimedia information retrieval

Augmented reality system using lidar point cloud data for displaying dimensional information of objects on mobile phones

Search and retrieval of multi-modal data associated with image-parts

Contact Info

Product

Resources

About