Text recognition using deep BLSTM networks

Ray, Anupama; Rajeswar, Sai; Chaudhury, Santanu

doi:10.1109/icapr.2015.7050699

Cited by 61 publications

(25 citation statements)

References 26 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Artificial Neural Network(ANN) and more complex versions of Recurrent Neural Networks(RNN) such as Long Short Term Memory (LSTM) only work with numerical values. However (Ray, Rajeswar, & Chaud, 2015) demonstrated that a Deep Bidirectional Long Short Term Memory based RNN (BLSTM-RNN) can be used which provides promising results for text recognition. (Wang, Qian, Soong, He, & Zhao, 2015) further demonstrated this potential when a BLSTM-RNN was used in conjunction with Word Embedding, in such a way phrases and vocabulary were mapped to vectors or real numbers and proved to be an effective method for modelling and predicting sequential text.…”

Section: Methods Two: Bidirectional Long Short Term Memory Recurrent Nmentioning

confidence: 99%

Threat Detection and Analysis in the Internet of Things using Deep Packet Inspection

Kumar¹,

Walia²

2018

IJCSA

View full text Add to dashboard Cite

The Internet of Things (IoT) has quickly transitioned from a promising future paradigm to a pervasive everyday reality. Many consumer IoT devices often lack adequate security and are increasingly being leveraged to perform DDoS attacks. To improve situational awareness of such attacks amongst consumers, this paper presents two solutions to the detection of botnet activity within consumer IoT devices and networks. First, a detection model is built using Term Frequency-Inverse Document Frequency (tf-idf) and analyses network traffic for semantic structure, highlighting semantic similarities between the captured data and that of a known attack dataset. A similarity score is used to determine if mirai attack vectors could be detected in the captured network traffic. Secondly a novel application of Deep Learning is used to develop a detection model based on a Bidirectional Long Short Term Memory based Recurrent Neural Network (BLSTM-RNN). The model is evaluated for accuracy and loss when detecting four attack vectors used by the mirai botnet. The paper demonstrates that both approaches return good results and offer promise for future research in this area. A labelled dataset was generated as part of this research and has been made available to the research community.

show abstract

Section: Methods Two: Bidirectional Long Short Term Memory Recurrent Nmentioning

confidence: 99%

Threat Detection and Analysis in the Internet of Things using Deep Packet Inspection

Kumar¹,

Walia²

2018

IJCSA

View full text Add to dashboard Cite

show abstract

“…A deep recurrent neural network is trained on perfectly segmented data and tests each of the candidate segments, generating unicode sequences. This work is an extension of the work on printed text recognition using Deep BLSTM wherein Deep BLSTM architecture for text recognition was proposed [1]. In the verification stage these unicode sequences are validated using a sub-string match with the language model and best first search is used to find the best possible combination of alternative hypothesis from the tree structure.…”

Section: Introductionmentioning

confidence: 99%

A hypothesize-and-verify framework for text recognition using deep recurrent neural networks

Ray

Rajeswar

Chaudhury

2015

2015 13th International Conference on Document Analysis and Recognition (ICDAR)

Self Cite

View full text Add to dashboard Cite

Deep LSTM is an ideal candidate for text recognition. However text recognition involves some initial image processing steps like segmentation of lines and words which can induce error to the recognition system. Without segmentation, learning very long range context is difficult and becomes computationally intractable. Therefore, alternative soft decisions are needed at the pre-processing level. This paper proposes a hybrid text recognizer using a deep recurrent neural network with multiple layers of abstraction and long range context along with a language model to verify the performance of the deep neural network. In this paper we construct a multi-hypotheses tree architecture with candidate segments of line sequences from different segmentation algorithms at its different branches. The deep neural network is trained on perfectly segmented data and tests each of the candidate segments, generating unicode sequences. In the verification step, these unicode sequences are validated using a sub-string match with the language model and best first search is used to find the best possible combination of alternative hypothesis from the tree structure. Thus the verification framework using language models eliminates wrong segmentation outputs and filters recognition errors.

show abstract

“…However, segmenting (Urdu and alike) cursive scripts into characters is a challenging task in itself. Recently, implicit segmentation using deep learning has been successfully investigated for recognition of Urdu text [23][24][25][26]. These techniques, however, require large training data and employ characters as units of recognition rather than ligatures or words.…”

Section: Analytical Approachesmentioning

confidence: 99%

“…While the initial endeavors primarily focused on recognition of isolated characters [6][7][8], a number of deep learning-based robust solutions [17,[23][24][25][26] have been proposed in the recent years. These methods mainly rely on implicit segmentation of characters and report high recognition rates.…”

Section: Motivationmentioning

confidence: 99%

Segmentation-free optical character recognition for printed Urdu text

Din

Siddiqi

Khalid

et al. 2017

J Image Video Proc.

View full text Add to dashboard Cite

This paper presents a segmentation-free optical character recognition system for printed Urdu Nastaliq font using ligatures as units of recognition. The proposed technique relies on statistical features and employs Hidden Markov Models for classification. A total of 1525 unique high-frequency Urdu ligatures from the standard Urdu Printed Text Images (UPTI) database are considered in our study. Ligatures extracted from text lines are first split into primary (main body) and secondary (dots and diacritics) ligatures and multiple instances of the same ligature are grouped into clusters using a sequential clustering algorithm. Hidden Markov Models are trained separately for each ligature using the examples in the respective cluster by sliding right-to-left the overlapped windows and extracting a set of statistical features. Given the query text, the primary and secondary ligatures are separately recognized and later associated together using a set of heuristics to recognize the complete ligature. The system evaluated on the standard UPTI Urdu database reported a ligature recognition rate of 92% on more than 6000 query ligatures.

show abstract

Text recognition using deep BLSTM networks

Cited by 61 publications

References 26 publications

Threat Detection and Analysis in the Internet of Things using Deep Packet Inspection

Threat Detection and Analysis in the Internet of Things using Deep Packet Inspection

A hypothesize-and-verify framework for text recognition using deep recurrent neural networks

Segmentation-free optical character recognition for printed Urdu text

Contact Info

Product

Resources

About