2013
DOI: 10.1007/978-3-642-54105-6_1
|View full text |Cite
|
Sign up to set email alerts
|

Statistical Approach for Term Weighting in Very Short Documents for Text Categorization

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
9
0

Year Published

2017
2017
2023
2023

Publication Types

Select...
3

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(9 citation statements)
references
References 7 publications
0
9
0
Order By: Relevance
“…Due to limited input space (character limitations) and severe time-pressure under which data is collected, maintenance records are frequently much shorter than that typically considered in literature (Mukherjee and Chakraborty, 2007;Chen and Nayak, 2007). Most text-mining research is focussed on documents with more than 100 words, such as the common Reuters-21578 dataset with an average document length of 160 words (Timonen, 2012). In contrast, Chen and Nayak (2007) report records ranging from 1-50 words and even more extreme Mukherjee and Chakraborty (2007) with 5-10 words.…”
Section: Maintenance Specific Propertiesmentioning
confidence: 99%
See 4 more Smart Citations
“…Due to limited input space (character limitations) and severe time-pressure under which data is collected, maintenance records are frequently much shorter than that typically considered in literature (Mukherjee and Chakraborty, 2007;Chen and Nayak, 2007). Most text-mining research is focussed on documents with more than 100 words, such as the common Reuters-21578 dataset with an average document length of 160 words (Timonen, 2012). In contrast, Chen and Nayak (2007) report records ranging from 1-50 words and even more extreme Mukherjee and Chakraborty (2007) with 5-10 words.…”
Section: Maintenance Specific Propertiesmentioning
confidence: 99%
“…Until recently, Twitter messages were limited to 140 characters with an average length of 34 characters per tweet (Perez, 2018), making it much shorter than more typical text-mining corpora. The growing impact of these social networking sites has made the classification of such short-form corpora an increasingly important research topic (Timonen, 2012;Bermingham and Smeaton, 2010).…”
Section: Jqme 293mentioning
confidence: 99%
See 3 more Smart Citations