Graph-based methods have been widely used by the document image analysis and recognition community, as the different objects and the content in document images is best represented by this powerful structural representation. Designing of novel computation tools for processing these graph-based structural representations has always remained a hot topic of research. Recently, Graph Neural Network (GNN) have been used for solving different problems in the domain of document image analysis and recognition. In this article we take forward the state of the art by presenting a new approach to gather the symbolic and numeric information from the nodes and edges of a graph. We use this information to learn a Graph Neural Network (GNN). The experimentation on the the recognition of handwritten letters and graphical symbols shows that the proposed approach is an interesting contribution to the growing set of GNN-based methods for document image analysis and recognition.
The recent success of graph neural networks (GNNs) in the area of pattern recognition (PR) has increased the interest of researchers to use these frameworks in non-euclidean structures. This noneuclidean structure includes graphs or manifolds that are called geometric deep learning (GDL). It has opened a new direction for researchers to deal with graphs using deep learning in document processing, outperforming conventional methods. We propose a Deep Graph Neural Network (DGNN) classifier-based on additive angular margin loss for the classification task in document analysis. Another contribution of this work is to investigate the performance of a DGNN as a classifier using different loss functions, which helps to minimize the loss for the document analysis problem. We compare additive angular margin loss, Cosine angular margin loss, and multiplicative angular margin loss. Furthermore, we give a comparison between the mentioned loss functions and the Softmax loss function. We also present the comparisons of results using different graph edit distance (GED) methods. Our quantitative results suggest, that by applying the additive angular marginal loss function makes more compact intra-class ability and increases the inter-class discrepancy which enhances the discriminating power of the DGNN. Enhancing the decision boundaries between the classes increase the intra-class compactness and inter-class discrimination power of the model.
The real-time availability of the Internet has engaged millions of users around the world. The usage of regional languages is being preferred for effective and ease of communication that is causing multilingual data on social networks and news channels. People share ideas, opinions, and events that are happening globally i.e., sports, inflation, protest, explosion, and sexual assault, etc. in regional (local) languages on social media. Extraction and classification of events from multilingual data have become bottlenecks because of resource lacking. In this research paper, we presented the event classification task for the Urdu language text existing on social media and the news channels by using machine learning classifiers. The dataset contains more than 0.1 million (102,962) labeled instances of twelve (12) different types of events. The title, its length, and the last four words of a sentence are used as features to classify the events. The Term Frequency-Inverse Document Frequency (tf-idf) showed the best results as a feature vector to evaluate the performance of the six popular machine learning classifiers. Random Forest (RF) and K-Nearest Neighbor (KNN) are among the classifiers that out-performed among other classifiers by achieving 98.00% and 99.00% accuracy, respectively. The novelty lies in the fact that the features aforementioned are not applied, up to the best of our knowledge, in the event extraction of the text written in the Urdu language.
In today’s world, higher security deployments are needed, as the expansion of the transportation system has accelerated with time. Road traffic disasters have become a widespread problem in recent years. With the tremendous increase in traffic accidents, the fatality rate among people is quickly expanding. Whenever a mishap occurs on the roadways, it becomes a devastating situation for the victims. As an output result of the proposed model in this article, critical notification regarding the scene of an accident and car number was successfully sent to the pre-programmed number after determining the accident scene. The relevant contacts that were configured and added to the system successfully received an emergency message, providing the exact geographic coordinates of the accident scene. Following the receipt of the message, an audio call with a recorded voice was made to the pre-defined number. Moreover, Global Positioning System (GPS) was used to get the coordinates from the satellite. For this purpose, Global System for Mobile Communications (GSM) was utilized to attain the (GPS) coordinates in the event of an accident. Following on, the current location of an automobile through (GPS) was transmitted to certain contact details that were pre-programmed within the application. The system also reported the severity of the accident, as well as whether a vehicle collided with another vehicle or a disaster occurred to the vehicle itself.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.