2015
DOI: 10.1080/09296174.2015.1037159
|View full text |Cite
|
Sign up to set email alerts
|

Classification of Literary Style that takes Order into Consideration

Abstract: The statistical analysis of the heterogeneity of the style of a text often leads to the analysis of contingency tables of ordered rows. When multiple authorship is suspected, one can explore that heterogeneity through either a change-point analysis of these rows, consistent with sudden changes of author, or a cluster analysis of them, consistent with authors contributing exchangeably, without taking order into consideration. Here an analysis is proposed that strikes a compromise between change-point and cluste… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
4
0

Year Published

2016
2016
2018
2018

Publication Types

Select...
2
1

Relationship

2
1

Authors

Journals

citations
Cited by 3 publications
(5 citation statements)
references
References 31 publications
0
4
0
Order By: Relevance
“…The second extension can be modeled by letting the weights in the mixture distribution for the random effects of the i ‐th row, ν i = ( ν i 2 , …, ν i J ), vary from row to row, false(ν1,,νnfalse)false|ω,σ2,ktruei=1nfalse(false(1wifalse)truej=2J.2emNormalfalse(0,σ2false)1em+witruej=2J.2emNormalfalse(0,k2σ2false)false), where ω = ( ω 1 ,…, ω n ). This idea was proposed by Fernandez and Green for Poisson mixtures for spatial data, and it was used by Puig et al for multinomial cluster analysis. As a consequence of , the probability that the i th row is allocated to the outlier group, ω i , will change from row to row, and the set of allocation variables, ζ = ( ζ 1 , …, ζ n ), will not be identically distributed because π ( ζ i = 1| ω ) = ω i .…”
Section: Outlier Detection Model For the Rows Of A Tablementioning
confidence: 99%
See 2 more Smart Citations
“…The second extension can be modeled by letting the weights in the mixture distribution for the random effects of the i ‐th row, ν i = ( ν i 2 , …, ν i J ), vary from row to row, false(ν1,,νnfalse)false|ω,σ2,ktruei=1nfalse(false(1wifalse)truej=2J.2emNormalfalse(0,σ2false)1em+witruej=2J.2emNormalfalse(0,k2σ2false)false), where ω = ( ω 1 ,…, ω n ). This idea was proposed by Fernandez and Green for Poisson mixtures for spatial data, and it was used by Puig et al for multinomial cluster analysis. As a consequence of , the probability that the i th row is allocated to the outlier group, ω i , will change from row to row, and the set of allocation variables, ζ = ( ζ 1 , …, ζ n ), will not be identically distributed because π ( ζ i = 1| ω ) = ω i .…”
Section: Outlier Detection Model For the Rows Of A Tablementioning
confidence: 99%
“…where ω = (ω 1 ,…,ω n ). This idea was proposed by Fernandez and Green 13 for Poisson mixtures for spatial data, and it was used by Puig et al 14 for multinomial cluster analysis. As a consequence of (2.5), the probability that the ith row is allocated to the outlier group, ω i , will change from row to row, and the set of allocation variables, ζ = (ζ 1 , …, ζ n ), will not be identically distributed because π(ζ i = 1|ω) = ω i .…”
Section: Outlier Detection Model For a Structured Tablementioning
confidence: 99%
See 1 more Smart Citation
“…Puig, Font & Ginebra (2015) propose an alternative analysis that treats the two stylometric variables separately, but incorporates the fact that chapters close together are more likely to belong to the same author than chapters that are far apart. In this case, the results of the analysis are similar.…”
Section: Case Study 2: Tirant Lo Blancmentioning
confidence: 99%
“…Finally, note that different from the previous case study, in this example texts (chapters) are ordered sequentially, and that order is not taken into consideration in the cluster analysis model used here. Puig, Font and Ginebra (2014) proposes an alternative analysis that treats the two stylometric variables separately, but incorporates the fact that chapters close together are more likely to belong to the same author than chapters that are far apart. In that way, one strikes a compromise between change-point analysis, as-suming all neighboring chapters to belong to the same cluster except the boundary ones, and the kind of cluster analysis considered here, that treat all chapters exchangeably, as if order did not matter whatsoever.…”
Section: Case Study 2: Tirant Lo Blancmentioning
confidence: 99%