2012
DOI: 10.1007/978-3-642-31680-7_16
|View full text |Cite
|
Sign up to set email alerts
|

Use Fewer Instances of the Letter “i”: Toward Writing Style Anonymization

Abstract: Abstract. This paper presents Anonymouth, a novel framework for anonymizing writing style. Without accounting for style, anonymous authors risk identification. This framework is necessary to provide a tool for testing the consistency of anonymized writing style and a mechanism for adaptive attacks against stylometry techniques. Our framework defines the steps necessary to anonymize documents and implements them. A key contribution of this work is this framework, including novel methods for identifying which fe… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
54
0
1

Year Published

2012
2012
2023
2023

Publication Types

Select...
5
2
2

Relationship

4
5

Authors

Journals

citations
Cited by 63 publications
(55 citation statements)
references
References 13 publications
0
54
0
1
Order By: Relevance
“…We experimented using a variety of features to identify the features leading to highest accuracy as measured by the true-positive rate. We utilized two authorship attribution tools, (1) JGAAP 4 developed by Juola et al [10] and (2) JStylo, a novel framework for authorship attribution that was developed by McDonald et al [11]. JGAAP is limited to analysis using one feature at a time.…”
Section: Corpus Selectionmentioning
confidence: 99%
“…We experimented using a variety of features to identify the features leading to highest accuracy as measured by the true-positive rate. We utilized two authorship attribution tools, (1) JGAAP 4 developed by Juola et al [10] and (2) JStylo, a novel framework for authorship attribution that was developed by McDonald et al [11]. JGAAP is limited to analysis using one feature at a time.…”
Section: Corpus Selectionmentioning
confidence: 99%
“…An early work in this area by Kacmarcik et al [22] applies obfuscation by modifying the most important stylometric features of the text to reduce the effectiveness of author attribution. This approach was used in Anonymouth [35], a semi-automated tool that provides feedback to authors on which features to modify to effectively anonymise their texts. A similar approach was also followed by Karadhov et al [23] as part of the PAN@Clef 2017 task.…”
Section: Related Workmentioning
confidence: 99%
“…See [14] for a thorough summary of stylometric studies in each of these three problem domains along with their study parameters and the resulting accuracy. These studies traditionally use large sets of features (see Table II in [15]) in combination with support vector machines (SVMs) that have proven to be effective in high dimensional feature space [16], even in cases when the number of features exceeds the number of samples. Nevertheless, with these approaches, often more than 500 words are required in order to achieve adequately low error rates [17].…”
Section: Stylometry Web Browsing Application Usage Locationmentioning
confidence: 99%