2009
DOI: 10.1007/978-3-642-03348-3_33
|View full text |Cite
|
Sign up to set email alerts
|

A Hybrid Statistical Data Pre-processing Approach for Language-Independent Text Classification

Abstract: Abstract. Data pre-processing is an important topic in Text Classification (TC).It aims to convert the original textual data in a data-mining-ready structure, where the most significant text-features that serve to differentiate between textcategories are identified. Broadly speaking, textual data pre-processing techniques can be divided into three groups: (i) linguistic, (ii) statistical, and (iii) hybrid (i) & (ii). With regard to language-independent TC, our study relates to the statistical aspect only. The … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2011
2011
2011
2011

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 20 publications
0
1
0
Order By: Relevance
“…The data pre-processing is an important step to improve the discernment precision of network, which can increase the forecasting capability and is similar to adjusting expression levels measured by northern analysis [15]. However, it is often neglected in the data mining process.…”
Section: Sample Nomalizationmentioning
confidence: 99%
“…The data pre-processing is an important step to improve the discernment precision of network, which can increase the forecasting capability and is similar to adjusting expression levels measured by northern analysis [15]. However, it is often neglected in the data mining process.…”
Section: Sample Nomalizationmentioning
confidence: 99%