2016
DOI: 10.1007/978-981-10-2585-3_2
|View full text |Cite
|
Sign up to set email alerts
|

A Comparative Study of Text Preprocessing Techniques for Natural Language Call Routing

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2

Citation Types

0
3
0

Year Published

2017
2017
2022
2022

Publication Types

Select...
4
3

Relationship

0
7

Authors

Journals

citations
Cited by 7 publications
(3 citation statements)
references
References 17 publications
0
3
0
Order By: Relevance
“…Preprocessing (Sergienko et al, 2017) is performed on incoming email files to transform strings of characters into a representation suitable for the classification algorithm (Figure 1, Tables I and II). Figure 1 clearly depicts the structure of a spam classifier in which an email file with header and body goes for preprocessing.…”
Section: Preprocessingmentioning
confidence: 99%
“…Preprocessing (Sergienko et al, 2017) is performed on incoming email files to transform strings of characters into a representation suitable for the classification algorithm (Figure 1, Tables I and II). Figure 1 clearly depicts the structure of a spam classifier in which an email file with header and body goes for preprocessing.…”
Section: Preprocessingmentioning
confidence: 99%
“…In text categorization and sentiment analysis, Support Vector Machine is often considered as the best classifier providing the greatest performances for those tasks [25]. It"s among the class of classifiers based on kernel substitution [26].In this work, the version Sequential Minimal Optimization (SMO) developed in [27] is used.…”
Section: A Experimental Setup 1) Preprocessingmentioning
confidence: 99%
“…Tokenization process, basically practical is by recognizing the token and their check. Tokenization is a strategy of unmistakable verification of token/subjects inside data compositions and it serves to decreased interest with an immense degree [13]. In a present-day time of data/information, when data/information is broadening complex on reliably from its beginning stage, in a kind of compositions, site pages, etc., so the noteworthiness of effective and profitable tokenization count wraps up perceptibly essential for an IR system [14].…”
mentioning
confidence: 99%