2015
DOI: 10.1371/journal.pone.0114825
|View full text |Cite
|
Sign up to set email alerts
|

Interactions of Cultures and Top People of Wikipedia from Ranking of 24 Language Editions

Abstract: Wikipedia is a huge global repository of human knowledge that can be leveraged to investigate interwinements between cultures. With this aim, we apply methods of Markov chains and Google matrix for the analysis of the hyperlink networks of 24 Wikipedia language editions, and rank all their articles by PageRank, 2DRank and CheiRank algorithms. Using automatic extraction of people names, we obtain the top 100 historical figures, for each edition and for each algorithm. We investigate their spatial, temporal, and… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

8
94
1

Year Published

2015
2015
2020
2020

Publication Types

Select...
5
3

Relationship

2
6

Authors

Journals

citations
Cited by 73 publications
(103 citation statements)
references
References 31 publications
8
94
1
Order By: Relevance
“…The fact that, as stated above, the English Wikipedia has more articles and these tend to be longer, and arguably because English is commonly perceived as a universal language, affects the perception of this particular edition as being the best both in terms of reliability and coverage of topics. Despite this preference for English, there is no indication in our results that this is due to perceiving the smaller language editions to be more likely biased, as suggested by some literature (Pfeil et al, 2006;Massa and Scrinzi, 2013;Eom et al, 2015), but it rather seems to be related to the extension and completion in the more widely used language.…”
Section: Discussioncontrasting
confidence: 70%
See 1 more Smart Citation
“…The fact that, as stated above, the English Wikipedia has more articles and these tend to be longer, and arguably because English is commonly perceived as a universal language, affects the perception of this particular edition as being the best both in terms of reliability and coverage of topics. Despite this preference for English, there is no indication in our results that this is due to perceiving the smaller language editions to be more likely biased, as suggested by some literature (Pfeil et al, 2006;Massa and Scrinzi, 2013;Eom et al, 2015), but it rather seems to be related to the extension and completion in the more widely used language.…”
Section: Discussioncontrasting
confidence: 70%
“…For a variety of reasons, there are significant differences in coverage, approaches and even internal policies among the different editions, which impact on how one can participate in the writing of articles. Additionally, there are the possible biases, which have been identified in the literature as being more likely to occur in smaller language communities, as it is expected that they would have a smaller group of people involved in the curation process (Pfeil et al, 2006;Massa and Scrinzi, 2013;Eom et al, 2015).…”
Section: Introductionmentioning
confidence: 99%
“…As such, G qrnd represents indirect (hidden) links between the N r nodes appearing via the global network. We note that certain matrix elements of G qr can be negative, which is possible due to the negative terms in Q c = 1 − P c appearing in (13). The total weight of negative elements is however much smaller than W qr (at least 6 times smaller and even non-existing in ArWiki).…”
Section: Decomposition Of G Rmentioning
confidence: 95%
“…At present directed networks of real systems can be very large (about 4.2 million articles for the English Wikipedia edition in 2013 [13] or 3.5 billion web pages (called also nodes) for a publicly accessible web crawl that was gathered by the Common Crawl Foundation in 2012 [18]). For some studies, one might be interested only in the particular interactions between a very small subset of nodes compared to the full network size.…”
Section: Introductionmentioning
confidence: 99%
“…These codes had been developed in the frame of EC FET Open NADINE project (2012)(2013)(2014)(2015) [22] and used for Wikipedia (2013) data in [4]. This work was granted access to the HPC resources of CALMIP (Toulouse) under the allocation 2017-P0110.…”
Section: Acknowledgmentsmentioning
confidence: 99%