Proceedings of the Third International Symposium on Women in Computing and Informatics 2015
DOI: 10.1145/2791405.2791555
|View full text |Cite
|
Sign up to set email alerts
|

Applications of Text Detection and its Challenges

Abstract: The rising need for automation of systems has effected the development of text detection and recognition from images to a large extent. Text recognition has a wide range of applications, each with scenario dependent challenges and complications. How can these challenges be mitigated? What image processing techniques can be applied to make the text in the image machine readable? How can text be localized and separated from non textual information? How can the text image be converted to digital text format? This… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
1
0

Year Published

2016
2016
2024
2024

Publication Types

Select...
3
2
1

Relationship

0
6

Authors

Journals

citations
Cited by 13 publications
(1 citation statement)
references
References 74 publications
0
1
0
Order By: Relevance
“…continued on following page challenges,solutions,andconstraints.Further,theydiscussedcontentforms,imageminingmethods, andlanguage/scriptidentificationandclassificationschemes,whichwerecomparedforcriteriaof languagesused,scriptandlanguagedetection,featureextraction,grayscaleorcoloredimage,font variation,resolution,printerorscannertype,andclassifier.Theirobservationsfoundthemaximum contributionintextualcontentformwithmonoandmulti-lingualdocumentsalongwiththescript identification,anduseof300DPI,grayscale,andSVMclassifier.Pal(2014)discussedareview onlanguageandscriptidentificationmethodswithfont-cum-stylerecognitionandfurtherprovided languageoverview,origin,difficulties,singleandmulti-scriptidentificationtechniquesforprintedand handwrittendocuments,challenges,andfinally,fontstyle,generation,variation,andtheirrecognition methods Nevetha and Baskar (2015). demonstrated text detection applications and techniques withtheirchallengesongeneral,scientific,unconstrainedandscenedocumentimagesoftextual information.Theyfurtherdiscussedtextrecognitionphasesofpreprocessing,segmentation,feature extraction,andrecognition.Felhi,TabboneandSegovia(2014)providedamulti-scalestroke-based pagesegmentationapproachtogetthetext,lines,photosandbackground.Theyfollowedthesteps ofglobalstrokewidthvariation-basedtextandlineCCdetection,imagesegmentationintophoto andbackgroundregionswithactivecontourmodel,textclassification,lineseparation,textcandidate clustering by mean-shift analysis, and finally, horizontal and vertical text regions separation word Recognition and Spotting Thissectionillustratesvariouswordrecognitionandspottingmethods.Thescalable,statistical,script independentline-basedwordspottingmethodperformedminimumpreprocessing,nosegmentation, fillermodelcreationinnon-keywordregions,featureextraction,and,finally,HiddenMarkovModel (HMM)basedrecognition(Wshah,KumarandGovindaraju,2014).Thismethodhasbeentested onEnglishdocumentsfromIAMdatasets,ArabicdocumentsfromAMAdatasets,andDevanagari documentsfromLAWdatasetsandfoundsystemcomplexityofO(K 2 L)+(R 2 L*)usinglexicon-…”
mentioning
confidence: 99%
“…continued on following page challenges,solutions,andconstraints.Further,theydiscussedcontentforms,imageminingmethods, andlanguage/scriptidentificationandclassificationschemes,whichwerecomparedforcriteriaof languagesused,scriptandlanguagedetection,featureextraction,grayscaleorcoloredimage,font variation,resolution,printerorscannertype,andclassifier.Theirobservationsfoundthemaximum contributionintextualcontentformwithmonoandmulti-lingualdocumentsalongwiththescript identification,anduseof300DPI,grayscale,andSVMclassifier.Pal(2014)discussedareview onlanguageandscriptidentificationmethodswithfont-cum-stylerecognitionandfurtherprovided languageoverview,origin,difficulties,singleandmulti-scriptidentificationtechniquesforprintedand handwrittendocuments,challenges,andfinally,fontstyle,generation,variation,andtheirrecognition methods Nevetha and Baskar (2015). demonstrated text detection applications and techniques withtheirchallengesongeneral,scientific,unconstrainedandscenedocumentimagesoftextual information.Theyfurtherdiscussedtextrecognitionphasesofpreprocessing,segmentation,feature extraction,andrecognition.Felhi,TabboneandSegovia(2014)providedamulti-scalestroke-based pagesegmentationapproachtogetthetext,lines,photosandbackground.Theyfollowedthesteps ofglobalstrokewidthvariation-basedtextandlineCCdetection,imagesegmentationintophoto andbackgroundregionswithactivecontourmodel,textclassification,lineseparation,textcandidate clustering by mean-shift analysis, and finally, horizontal and vertical text regions separation word Recognition and Spotting Thissectionillustratesvariouswordrecognitionandspottingmethods.Thescalable,statistical,script independentline-basedwordspottingmethodperformedminimumpreprocessing,nosegmentation, fillermodelcreationinnon-keywordregions,featureextraction,and,finally,HiddenMarkovModel (HMM)basedrecognition(Wshah,KumarandGovindaraju,2014).Thismethodhasbeentested onEnglishdocumentsfromIAMdatasets,ArabicdocumentsfromAMAdatasets,andDevanagari documentsfromLAWdatasetsandfoundsystemcomplexityofO(K 2 L)+(R 2 L*)usinglexicon-…”
mentioning
confidence: 99%