Proceedings of the 26th Annual International ACM SIGIR Conference on Research and Development in Informaion Retrieval - SIGIR 2003
DOI: 10.1145/860454.860456
|View full text |Cite
|
Sign up to set email alerts
|

A repetition based measure for verification of text collections and for text categorization

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2

Citation Types

0
13
0

Year Published

2004
2004
2018
2018

Publication Types

Select...
2
2
2

Relationship

0
6

Authors

Journals

citations
Cited by 12 publications
(13 citation statements)
references
References 0 publications
0
13
0
Order By: Relevance
“…We use the same experimental setup that was used in [2] to compare the different methods. An experimental collection to test authorship attribution was formed from Reuters Corpus Volume 1 (RCV1), the extended Reuters corpus of over 800K news articles.…”
Section: Resultsmentioning
confidence: 99%
See 4 more Smart Citations
“…We use the same experimental setup that was used in [2] to compare the different methods. An experimental collection to test authorship attribution was formed from Reuters Corpus Volume 1 (RCV1), the extended Reuters corpus of over 800K news articles.…”
Section: Resultsmentioning
confidence: 99%
“…The second column lists results for the PPMD implementation (with exclusions), the third column for PPMD without exclusions, and the fourth column for C-measure. For comparison, results for several other methods can be found in [2]. The best performing methods were as follows: SVM (85.0%); RMeasure, a repetition-based approach (89.0%); and RAR, an off-the-shelf PPM-based compression method (89.4%).…”
Section: Resultsmentioning
confidence: 99%
See 3 more Smart Citations