2015
DOI: 10.1088/1742-5468/2015/03/p03005
|View full text |Cite
|
Sign up to set email alerts
|

Authorship recognition via fluctuation analysis of network topology and word intermittency

Abstract: Abstract. Statistical methods have been widely employed in many practicalnatural language processing applications. More specifically, complex network concepts and methods from dynamical systems theory have been successfully applied to recognize stylistic patterns in written texts. Despite the large number of studies devoted to representing texts with physical models, only a few studies have assessed the relevance of attributes derived from the analysis of stylistic fluctuations. Because fluctuations represent … Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

3
34
0
5

Year Published

2015
2015
2023
2023

Publication Types

Select...
8
1

Relationship

0
9

Authors

Journals

citations
Cited by 42 publications
(42 citation statements)
references
References 70 publications
(135 reference statements)
3
34
0
5
Order By: Relevance
“…This result is consistent with similar studies showing that the topology plays a relevant role in characterising complex systems, especially those conveying information [2,12,37,44]. Because the proposed representation is complementary to the traditional approaches, we advocate that the combination of features of distinct nature (traditional and topological) could lead to the improvement of similar tasks relying on the accurate characterisation of stylistic marks.…”
Section: Comparison With Traditional Methodssupporting
confidence: 89%
See 1 more Smart Citation
“…This result is consistent with similar studies showing that the topology plays a relevant role in characterising complex systems, especially those conveying information [2,12,37,44]. Because the proposed representation is complementary to the traditional approaches, we advocate that the combination of features of distinct nature (traditional and topological) could lead to the improvement of similar tasks relying on the accurate characterisation of stylistic marks.…”
Section: Comparison With Traditional Methodssupporting
confidence: 89%
“…According to equation 3, the betweenness centrality can be interpreted as the network flow [39][40][41], which is a relevant quantity for the analysis of robustness of power-grid networks [42,43]. When applied to the analysis of text networks, this measurement has been interpreted as being useful to quantify the generality of words in which the word appears [44], which is in part motivated by the use of this measurement in community detection methods [45]. Unlike the clustering coefficient, the betweenness centrality uses the global connectivity information to quantify the specificity/generality of concepts [44].…”
Section: Complex Network Measurementsmentioning
confidence: 99%
“…[With Suju Takano (1893-1976 and Seiho Awano (1899-1992) the four poets are referred to as "Four S of Little Cuckoo" or more simply "Four S, " due to the common initial in their pen names.] Finally, it should be mentioned that similarly to the artful fluctuation analysis of network topology and word intermittency [45] the present statistical method might be The distribution of sounds for the 203 arrangements is /u/: 30457, /o/: 55912, /a/: 59299, /e/: 11352, /i/: 42353, and /n/: 2779.…”
Section: Comparing Pattern Distributionsmentioning
confidence: 99%
“…removed stop words from a text to work only with semantically meaningful words, which they further mapped to their singular and infinitive forms before linking adjacent words to form a word-to-word network. They then combined network features with linguistic features such as word frequency, intermittency, n -grams, and used machine learning techniques to identify the authors of various texts with reasonable accuracy [28, 29]. …”
Section: Complex-network Approaches To the Study Of Languagesmentioning
confidence: 99%