Fan Bai scite author profile

Scene text recognition has been a hot research topic in computer vision due to its various applications. The state of the art is the attention-based encoder-decoder framework that learns the mapping between input images and output sequences in a purely data-driven way. However, we observe that existing attention-based methods perform poorly on complicated and/or low-quality images. One major reason is that existing methods cannot get accurate alignments between feature areas and targets for such images. We call this phenomenon "attention drift". To tackle this problem, in this paper we propose the FAN (the abbreviation of Focusing Attention Network) method that employs a focusing attention mechanism to automatically draw back the drifted attention. FAN consists of two major components: an attention network (AN) that is responsible for recognizing character targets as in the existing methods, and a focusing network (FN) that is responsible for adjusting attention by evaluating whether AN pays attention properly on the target areas in the images. Furthermore, different from the existing methods, we adopt a ResNet-based network to enrich deep representations of scene text images. Extensive experiments on various benchmarks, including the IIIT5k, SVT and ICDAR datasets, show that the FAN method substantially outperforms the existing methods.Comment: Revise the description of IC15 datasets (1811 samples

show abstract

AON: Towards Arbitrarily-Oriented Text Recognition

Cheng

Bai

et al. 2018

305

194

View full text Add to dashboard Cite

Recognizing text from natural images is a hot research topic in computer vision due to its various applications. Despite the enduring research of several decades on optical character recognition (OCR), recognizing texts from natural images is still a challenging task. This is because scene texts are often in irregular (e.g. curved, arbitrarilyoriented or seriously distorted) arrangements, which have not yet been well addressed in the literature. Existing methods on text recognition mainly work with regular (horizontal and frontal) texts and cannot be trivially generalized to handle irregular texts. In this paper, we develop the arbitrary orientation network (AON) to directly capture the deep features of irregular texts, which are combined into an attention-based decoder to generate character sequence. The whole network can be trained end-to-end by using only images and word-level annotations. Extensive experiments on various benchmarks, including the CUTE80, SVT-Perspective, IIIT5k, SVT and ICDAR datasets, show that the proposed AON-based method achieves the-state-of-theart performance in irregular datasets, and is comparable to major existing methods in regular datasets.2. We design a filter gate (FG) for fusing four-direction features with the learned placement clues. That is, FG is responsible for generating the integrated feature sequence.3. We integrate AON, FG and an attention-based decoder into the character recognition framework. The whole network can be directly trained end-to-end without any character-level bounding box annotations.4. We conduct extensive experiments on several public irregular and regular text benchmarks, which show that our method obtains state-of-the-art performance in irregular benchmarks, and is comparable to major existing methods in regular benchmarks.

show abstract

Edit Probability for Scene Text Recognition

et al. 2018

View full text Add to dashboard Cite

We consider the scene text recognition problem under the attention-based encoder-decoder framework, which is the state of the art. The existing methods usually employ a frame-wise maximal likelihood loss to optimize the models. When we train the model, the misalignment between the ground truth strings and the attention's output sequences of probability distribution, which is caused by missing or superfluous characters, will confuse and mislead the training process, and consequently make the training costly and degrade the recognition accuracy. To handle this problem, we propose a novel method called edit probability (EP) for scene text recognition. EP tries to effectively estimate the probability of generating a string from the output sequence of probability distribution conditioned on the input image, while considering the possible occurrences of missing/superfluous characters. The advantage lies in that the training process can focus on the missing, superfluous and unrecognized characters, and thus the impact of the misalignment problem can be alleviated or even overcome. We conduct extensive experiments on standard benchmarks, including the IIIT-5K, Street View Text and ICDAR datasets. Experimental results show that the EP can substantially boost scene text recognition performance.

show abstract

The China Plant Trait Database: toward a comprehensive regional compilation of functional traits for land plants

Harrison

Prentice

et al. 2017

Ecology

View full text Add to dashboard Cite

Abstract. Plant functional traits provide information about adaptations to climate and environmental conditions, and can be used to explore the existence of alternative plant strategies within ecosystems. Trait data are also increasingly being used to provide parameter estimates for vegetation models. Here we present a new database of plant functional traits from China. Most global climate and vegetation types can be found in China, and thus the database is relevant for global modeling. The China Plant Trait Database contains information on morphometric, physical, chemical, and photosynthetic traits from 122 sites spanning the range from boreal to tropical, and from deserts and steppes through woodlands and forests, including montane vegetation. Data collection at each site was based either on sampling the dominant species or on a stratified sampling of each ecosystem layer. The database contains information on 1,215 unique species, though many species have been sampled at multiple sites. The original field identifications have been taxonomically standardized to the Flora of China. Similarly, derived photosynthetic traits, such as electron-transport and carboxylation capacities, were calculated using a standardized method. To facilitate trait-environment analyses, the database also contains detailed climate and vegetation information for each site. The data set is released under a Creative Commons BY license. When using the data set, we kindly request that you cite this article, recognizing the hard work that went into collecting the data and the authors' willingness to make it publicly available.

show abstract

Forest vegetation responses to climate and environmental change: A case study from Changbai Mountain, NE China

Bai

Sang

Axmacher

2011

Forest Ecology and Management

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Fan Bai

Focusing Attention: Towards Accurate Text Recognition in Natural Images

AON: Towards Arbitrarily-Oriented Text Recognition

Edit Probability for Scene Text Recognition

The China Plant Trait Database: toward a comprehensive regional compilation of functional traits for land plants

Forest vegetation responses to climate and environmental change: A case study from Changbai Mountain, NE China

Contact Info

Product

Resources

About