2011
DOI: 10.1007/978-3-642-20161-5_31
|View full text |Cite
|
Sign up to set email alerts
|

Classifying with Co-stems

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2016
2016
2021
2021

Publication Types

Select...
2
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(3 citation statements)
references
References 17 publications
0
3
0
Order By: Relevance
“…k-top co-stems. In prior work, the ends of words (e.g., conjugations, plurals, and possessive marks) were shown to give notable signal about a variety of blog author attributes (Lipka and Stein 2011). These strings, called co-stems, can be obtained by subtracting the stem returned by the Lovins stemmer and processing only the ending that remains (i.e., the word minus the stem).…”
Section: Featuresmentioning
confidence: 99%
“…k-top co-stems. In prior work, the ends of words (e.g., conjugations, plurals, and possessive marks) were shown to give notable signal about a variety of blog author attributes (Lipka and Stein 2011). These strings, called co-stems, can be obtained by subtracting the stem returned by the Lovins stemmer and processing only the ending that remains (i.e., the word minus the stem).…”
Section: Featuresmentioning
confidence: 99%
“…A similar procedure was used to select the k mostbiased stemmed tokens and hashtags for each class. Finally, we include the k most-biased co-stems, which proved useful for inferring author attributes (Lipka and Stein 2011). The same value of k=13 was used for all four feature types after initial tuning showed it provided good performance; higher values of k did not noticeably improve performance.…”
Section: Methodsmentioning
confidence: 99%
“…They consist feature male female words team, league, bro hair, omg, cute stems scor, win, team girl, bab, feel co-stems s, e, ed n, i, a digrams in, er, re ee, ah, aa trigrams ing, ion, ent aha, eee, aaa hashtags soundcloud, mufc, np sorrynotsorry, excited, love of the k-top most discriminating words, k-top word stems 9 (e.g., "paper" representing "paper", "papers", or "papered"). k-top co-stems (Lipka and Stein [13] demonstrate that the stem-reduced words, or co-stems, yield a significant improvement over classical bag of words models; for example, "s" for "papers"), k-top digrams (the most discriminating digrams; for example, "pa", "ap", "pe", and "er"), k-top trigrams (the most discriminating trigrams; for example, "pap", "ape", and "per"), and k-top hashtags (hashtags are labels that are attached by the users to their Tweet messages; for example, "paper" for "#paper").…”
Section: Threshold Classifiermentioning
confidence: 99%