On Spammer Detection In Crowdsourcing Pairwise Comparison Tasks: Case Study On Two Multimedia Qoe Assessment Scenarios

Ak, Ali; Abid, Mona; Silva, Matthieu Perreira Da; Callet, Patrick Le

doi:10.1109/icmew53276.2021.9455992

Cited by 5 publications

(8 citation statements)

References 10 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…for ACR or Double Stimulus Impairment Scale (DSIS) scores. In case of binary answers, such as the expression of preference during a Pair Comparison (PC) test, recent works [16,17] favored the use of dissimilarity metrics (Rogers-Tanimoto (RT) dissimilarity [18], Cohen's Kappa Coefficient [19]) to estimate inter-observer agreement.…”

Section: Outlier Detection Methodsmentioning

confidence: 99%

When is the Cleaning of Subjective Data Relevant to Train UGC Video Quality Metrics?

Perrin

Dormeval

Wang

et al. 2022

2022 IEEE International Conference on Image Processing (ICIP)

View full text Add to dashboard Cite

Outlier analysis and spammer detection recently gained momentum in order to reduce uncertainty of subjective ratings in image & video quality assessment tasks. The large proportion of unreliable ratings from online crowdsourcing experiments and the need for qualitative and quantitative large-scale studies in the deep-learning ecosystem played a role in this event.We study the effect that data cleaning has on trainable models predicting the visual quality for videos, and present results demonstrating when cleaning is necessary to reach higher efficiency. To this end, we present and analyze a benchmark on clean and noisy User Generated Content (UGC) large-scale datasets on which we re-trained models, followed by an empirical exploration of the constraint of data removal. Our results show that a dataset presenting between 7 and 30% of outliers benefits from cleaning before training.

show abstract

Section: Outlier Detection Methodsmentioning

confidence: 99%

When is the Cleaning of Subjective Data Relevant to Train UGC Video Quality Metrics?

Perrin

Dormeval

Wang

et al. 2022

2022 IEEE International Conference on Image Processing (ICIP)

View full text Add to dashboard Cite

show abstract

“…An early example of subjective IQA on crowdsourcing shows promise by comparing crowdsourcing and laboratory experiment results [12]. Recent works raise concerns on the effects of QoE tasks on crowdsourcing subjective experiments [13]. LIVE In the Wild [14] IQA dataset consists of over 350000 opinion scores on 1162 images.…”

Section: Related Workmentioning

confidence: 99%

“…Although several methodologies have been proposed for interobserver agreement and outlier detection in rating experiments [29], [30], there are not many well-established methodologies for ranking experiments. Ak et al [13] showed that inter-observer agreement in pairwise comparison experiments can be measured based on Rogers-Tanimoto (RT) dissimilarity measure. A similar variation to such metric, known as Jaccard index [31], has been developed by Paul Jaccard.…”

Section: Inter-observer Agreementmentioning

confidence: 99%

See 1 more Smart Citation

RV-TMO: Large-Scale Dataset for Subjective Quality Assessment of Tone Mapped Images

Goswami

Hauser

et al. 2023

IEEE Trans. Multimedia

Self Cite

View full text Add to dashboard Cite

HAL is a multi-disciplinary open access archive for the deposit and dissemination of scientific research documents, whether they are published or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers. L'archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d'enseignement et de recherche français ou étrangers, des laboratoires publics ou privés.

show abstract

“…The main characteristics of these datasets are presented in Table I. 1) Exp-TMO Dataset: The Exp-TMO dataset has been recently published in [26]. This dataset contains originally 20 HDR sources processed with 4 different TMOs (see as Hypothetical Reference Circuits HRC) leading to a total of 120 pairs of tone mapped stimuli used in our experiment.…”

Section: A Datasetsmentioning

confidence: 99%

A machine-learning framework to predict TMO preference based on image and visual attention features

Ellahi

Vigier

Callet

2021

2021 IEEE 23rd International Workshop on Multimedia Signal Processing (MMSP)

Self Cite

View full text Add to dashboard Cite

On Spammer Detection In Crowdsourcing Pairwise Comparison Tasks: Case Study On Two Multimedia Qoe Assessment Scenarios

Cited by 5 publications

References 10 publications

When is the Cleaning of Subjective Data Relevant to Train UGC Video Quality Metrics?

When is the Cleaning of Subjective Data Relevant to Train UGC Video Quality Metrics?

RV-TMO: Large-Scale Dataset for Subjective Quality Assessment of Tone Mapped Images

A machine-learning framework to predict TMO preference based on image and visual attention features

Contact Info

Product

Resources

About