Mayank Lohani scite author profile

Mayank Lohani

3Publications

21Citation Statements Received

34Citation Statements Given

How they've been cited

How they cite others

Affiliations

Vellore Institute of Technology University

Publications

Order By: Most citations

A generic framework for ontology-based information retrieval and image retrieval in web data

Vijayarajan

Dinakaran

Tejaswin

et al. 2016

Hum. Cent. Comput. Inf. Sci.

View full text Add to dashboard Cite

In the internet era, search engines play a vital role in information retrieval from web pages. Search engines arrange the retrieved results using various ranking algorithms. Additionally, retrieval is based on statistical searching techniques or content-based information extraction methods. It is still difficult for the user to understand the abstract details of every web page unless the user opens it separately to view the web content. This key point provided the motivation to propose and display an ontologybased object-attribute-value (O-A-V) information extraction system as a web model that acts as a user dictionary to refine the search keywords in the query for subsequent attempts. This first model is evaluated using various natural language processing (NLP) queries given as English sentences. Additionally, image search engines, such as Google Images, use content-based image information extraction and retrieval of web pages against the user query. To minimize the semantic gap between the image retrieval results and the expected user results, the domain ontology is built using image descriptions. The second proposed model initially examines natural language user queries using an NLP parser algorithm that will identify the subject-predicate-object (S-P-O) for the query. S-P-O extraction is an extended idea from the ontology-based O-A-V web model. Using this S-P-O extraction and considering the complex nature of writing SPARQL protocol and RDF query language (SPARQL) from the user point of view, the SPARQL auto query generation module is proposed, and it will auto generate the SPARQL query. Then, the query is deployed on the ontology, and images are retrieved based on the auto-generated SPARQL query. With the proposed methodology above, this paper seeks answers to following two questions. First, how to combine the use of domain ontology and semantics to improve information retrieval and user experience? Second, does this new unified framework improve the standard information retrieval systems? To answer these questions, a document retrieval system and an image retrieval system were built to test our proposed framework. The web document retrieval was tested against three key-words/bag-of-words models and a semantic ontology model. Image retrieval was tested on IAPR TC-12 benchmark dataset. The precision, recall and accuracy results were then compared against standard information retrieval systems using TREC_EVAL. The results indicated improvements over the standard systems. A controlled experiment was performed by test subjects querying the retrieval system in the absence and presence of our proposed framework. The queries were measured using two metrics, time and click-count. Vijayarajan et al. Hum. Cent. Comput. Inf. Sci. (2016) et al. Hum. Cent. Comput. Inf. Sci. (2016) 6:18 on the retrieval performed with and without our proposed framework. The results were encouraging. RESEARCHPage 2 of 30 Vijayarajan

show abstract

Ontology Based Object-Attribute-Value Information Extraction from Web Pages in Search Engine Result Retrieval

Vijayarajan

Dinakaran

Lohani

2014

View full text Add to dashboard Cite

Comparison of Sequence Models for Text Narration from Tabular Data

Lohani¹,

Dasari²,

Gunasekaran³

et al. 2022

View full text Add to dashboard Cite

This paper demonstrates our work on the survey of pre-trained transformer models for text narration from tabular data. Understanding the meaning of data from tables or any other data source requires human effort and time to interpret the content. In this era of internet where data is exponentially growing and massive improvement in technology, we propose an NLP (Natural Language Processing) based approach where we can generate the meaningful text from the table without the human intervention. In this paper we propose transformer-based models with the goal to generate natural human interpretable language text generated from the input tables. We propose transformer based pre-trained model that is trained with structured and context rich tables and their respective summaries. We present comprehensive comparison between different transformer-based models and conclude with mentioning key points and future research roadmap.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Mayank Lohani

A generic framework for ontology-based information retrieval and image retrieval in web data

Ontology Based Object-Attribute-Value Information Extraction from Web Pages in Search Engine Result Retrieval

Comparison of Sequence Models for Text Narration from Tabular Data

Contact Info

Product

Resources

About