2019
DOI: 10.31235/osf.io/2rzsg
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Pull out all the stops: Textual analysis via punctuation sequences

Abstract: I'm tired of wasting letters when punctuation will do, period." -Steve Martin, Twitter, 2011Whether enjoying the lucid prose of a favorite author or slogging through some other writer's cumbersome, heavy-set prattle (full of parentheses, em-dashes, compound adjectives, and Oxford commas), readers will notice stylistic signatures not only in word choice and grammar, but also in punctuation itself. Indeed, visual sequences of punctuation from different authors produce marvelously different (and visually striking… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
2
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(2 citation statements)
references
References 30 publications
0
2
0
Order By: Relevance
“…Text tokenisation: Each sentence is transformed into a sequence of tokens, i.e., words, punctuation and special characters (e.g., emojis). Despite punctuation being relevant in some cases, e.g., writing style analysis 85 , we discard it here and focus only on words and other tokens with emotional content like emojis.…”
Section: Text Analysis In Emoatlasmentioning
confidence: 99%
“…Text tokenisation: Each sentence is transformed into a sequence of tokens, i.e., words, punctuation and special characters (e.g., emojis). Despite punctuation being relevant in some cases, e.g., writing style analysis 85 , we discard it here and focus only on words and other tokens with emotional content like emojis.…”
Section: Text Analysis In Emoatlasmentioning
confidence: 99%
“…In order to calculate the influence of word frequency on vocabulary difficulty, this paper uses the Gutenberg plan, which counts the words of all electronic publications with a large vocabulary and is more persuasive for the provided word frequency, as the source of word frequency [17]. Download and save the word frequencies of all words in the Gutenberg plan.…”
Section: Classification Of Vocabulary Difficulty Levelmentioning
confidence: 99%