18th International Conference on Pattern Recognition (ICPR'06) 2006
DOI: 10.1109/icpr.2006.169
|View full text |Cite
|
Sign up to set email alerts
|

A Robust Split-and-Merge Text Segmentation Approach for Images

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2008
2008
2024
2024

Publication Types

Select...
4
4

Relationship

1
7

Authors

Journals

citations
Cited by 17 publications
(4 citation statements)
references
References 7 publications
0
4
0
Order By: Relevance
“…The disadvantage is the long processing time and low efficiency. The representative algorithms include the connected-region algorithm [24], run-length smoothing algorithm [25], Voronoi diagram algorithm [26], etc. The main idea of the connected domain-based segmentation method is to first find all the connected domains in the image, and then merge them into a larger connected domain based on the intra-and inter-character and line spacing of the text [27,28].…”
Section: Traditional Layout Analysis Methodsmentioning
confidence: 99%
“…The disadvantage is the long processing time and low efficiency. The representative algorithms include the connected-region algorithm [24], run-length smoothing algorithm [25], Voronoi diagram algorithm [26], etc. The main idea of the connected domain-based segmentation method is to first find all the connected domains in the image, and then merge them into a larger connected domain based on the intra-and inter-character and line spacing of the text [27,28].…”
Section: Traditional Layout Analysis Methodsmentioning
confidence: 99%
“…Similarly, we must define relevant segments within audio recordings to create cohesive text units (e.g., from audio show segments). For large data, automatic segmentation procedures find cohesive text or audio units, commonly splitting or combining parts of an image or audio recording into cohesive text segments (Shinde and Chougule 2012; Smith 2007; Zhan, Wang, and Gao 2006). We therefore refer to errors related to creating coherent text segments as text-to-documents errors.…”
Section: The Total Corpus Quality Frameworkmentioning
confidence: 99%
“…Existing methods related to this problems are based on applying adaptive color reduction (ACR), Principal Component Analyzer (PCA) and Self-Organized Feature Map (SOFM) [3]. Zhan et al uses multiscale wavelet features and SVM classifier [4]. Other methods propose Globally Matched Wavelet Filters (GWM) with fisher classifiers [5] and a method insensitive to orientation, size and noise [6].…”
Section: Introductionmentioning
confidence: 99%