2016 IEEE 2nd International Conference on Collaboration and Internet Computing (CIC) 2016
DOI: 10.1109/cic.2016.044
|View full text |Cite
|
Sign up to set email alerts
|

Measuring Quality of Collaboratively Edited Documents: The Case of Wikipedia

Abstract: Abstract-Wikipedia is a great example of large scale collaboration, where people from all over the world together build the largest and maybe the most important human knowledge repository in the history. However, a number of studies showed that the quality of Wikipedia articles is not equally distributed. While many articles are of good quality, many others need to be improved. Assessing the quality of Wikipedia articles is very important for guiding readers towards articles of high quality and suggesting auth… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

2
50
0

Year Published

2017
2017
2021
2021

Publication Types

Select...
3
2
1
1

Relationship

2
5

Authors

Journals

citations
Cited by 39 publications
(52 citation statements)
references
References 48 publications
2
50
0
Order By: Relevance
“…Another form of bias is the presentation by Wikipedia authors of information regarding members of their own national group in a more positive way than information about other groups (ingroup bias) [15]. In line with previous research [16,17], we hypothesize that a certain proportion of such distortions is attributable to a lack of experience on the side of the respective article's contributors.…”
Section: Wikipedia As a Resourcesupporting
confidence: 68%
See 1 more Smart Citation
“…Another form of bias is the presentation by Wikipedia authors of information regarding members of their own national group in a more positive way than information about other groups (ingroup bias) [15]. In line with previous research [16,17], we hypothesize that a certain proportion of such distortions is attributable to a lack of experience on the side of the respective article's contributors.…”
Section: Wikipedia As a Resourcesupporting
confidence: 68%
“…These tags were neutral point of view policy violation (42 cases in our sample), contradictory content (13), unbalanced content (12), confusing content (17), and inaccurate content (23). Out of all the sampled articles, 99 (0.53%) articles had received at some point at least one of the aforementioned tags.…”
Section: Methodsmentioning
confidence: 99%
“…Deep learning approaches to predicting Wikipedia article quality have also been proposed. For example, Dang and Ignat (2016b) use a version of doc2vec (Le and Mikolov 2014) to represent articles, and feed the document embeddings into a four hidden layer neural network. Shen, Qi, and Baldwin (2017) first obtain sentence representations by averaging words within a sentence, and then apply a biLSTM (Hochreiter and Schmidhuber 1997) to learn a documentlevel representation, which is combined with hand-crafted features as side information.…”
Section: Related Workmentioning
confidence: 99%
“…Table 11 shows that the structure feature outperforms the other feature sets; its precision is even better than the model with all of feature sets by 3.6%. Compared with the models in existing research (Dang & Ignat, 2016a;Dalip & Cristo, 2017), stacked LSTMs with structure features demonstrate considerably better performance. Intuitively, a well-organized article tends to have high quality.…”
Section: Feature Set Importance Analysismentioning
confidence: 81%
“…Some methods are not automatic and require excessive human effort. Thus, machine-learning models, such as support vector regression (SVR) or k-nearest neighbor (KNN), have been adopted (Bykau, Korn, Srivastava, & Velegrakis, 2015;Dalip & Cristo, 2017;Dalip, Lima, Gonçalves, Cristo, & Calado, 2014;Dang & Ignat, 2016a;Ganjisaffar, Javanmardi, & Lopes, 2009;Kapugama, Lorensuhewa, & Kalyani, 2017;Sinanc & Yavanoglu, 2013). Moreover, existing methods fail to implement a comprehensive feature framework.…”
Section: Introductionmentioning
confidence: 99%