Proceedings of the First EAI International Conference on Computer Science and Engineering 2017
DOI: 10.4108/eai.27-2-2017.152280
|View full text |Cite
|
Sign up to set email alerts
|

Text Segmentation for Analysing Different Languages

Abstract: Abstract. Over the past several years, researchers have applied different methods of text segmentation. Text segmentation is defined as a method of splitting a document into smaller segments, assuming with its own relevant meaning. Those segments can be classified into the tag, word, sentence, topic, phrase and any information unit. Firstly, this study reviews the different types of text segmentation methods used in different types of documentation, and later discusses the various reasons for utilizing it in o… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2019
2019
2019
2019

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 21 publications
0
1
0
Order By: Relevance
“…They regarded that the text segmentation is more important than word segmentation while reading a Chinese ancient text for DH research. Actually, text segmentation will also influence the accuracy of Chinese word segmentation of an ancient text (Pak and Teh, 2016). Therefore, developing a text segmentation scheme with high accuracy for Chinese ancient texts is a more urgent issue than word segmentation, particularly in using the ATAS to support reading Chinese ancient text without punctuations.…”
Section: Discussionmentioning
confidence: 99%
“…They regarded that the text segmentation is more important than word segmentation while reading a Chinese ancient text for DH research. Actually, text segmentation will also influence the accuracy of Chinese word segmentation of an ancient text (Pak and Teh, 2016). Therefore, developing a text segmentation scheme with high accuracy for Chinese ancient texts is a more urgent issue than word segmentation, particularly in using the ATAS to support reading Chinese ancient text without punctuations.…”
Section: Discussionmentioning
confidence: 99%