Abstract:The subjective tests used to evaluate image and video quality estimators (QEs) are expensive and time consuming. More problematic, the majority of subjective testing is not designed to find systematic weaknesses in the evaluated QEs. As a result, a motivated attacker can take advantage of these systematic weaknesses to gain unfair monetary advantage. In this paper, we draw on some lessons of software testing to propose additional testing procedures that target a specific QE under test. These procedures supplem… Show more
“…The strategies in [10,11] can only identify potential false ties. Various strategies in [6] create either potential FTs or potential FOs. The goal of this paper is to create as many False Orderings as possible.…”
Section: Challenges Of Subjective Test Designmentioning
confidence: 99%
“…The boundaries of the regions W o , E o and B o are easy to identify using the QE we are testing; it is straightforward to select an image that has a potential FT [10,11,6]. However, a FT does not provide much information about the severity of a systematic weakness; it only informs us that one exists.…”
Section: Challenges Of Identifying Fosmentioning
confidence: 99%
“…In [6], we introduce the notion of using one more accurate QE as a proxy for subjective quality when designing test pairs for a second less accurate QE. Occasionally the proxy QE identifies its own weakness.…”
Section: Proxies For Subjective Qualitymentioning
confidence: 99%
“…The following principles lie at the core of software testing [5] and are directly applicable to testing image and video QEs [6]. The goal of testing should be to find errors, not demonstrate that the system satisfies its specifications.…”
Section: Introductionmentioning
confidence: 99%
“…In [6], we apply these principles to systematically create targeted, small-scale subjective tests. The test cases are com-prised of image pairs designed to have a high probability of creating potential misclassification errors.…”
We present an automated algorithm to design subjective tests that have a high likelihood of finding misclassification errors in many image quality estimators (QEs). In our algorithm, a collection of existing QEs collaboratively determine the best pairs of images that will test the accuracy of each individual QE. We demonstrate that the resulting subjective test provides valuable information regarding the accuracy of the cooperating QEs. The proposed strategy is particularly useful for comparing efficacy of QEs across multiple distortion types and multiple reference images.
“…The strategies in [10,11] can only identify potential false ties. Various strategies in [6] create either potential FTs or potential FOs. The goal of this paper is to create as many False Orderings as possible.…”
Section: Challenges Of Subjective Test Designmentioning
confidence: 99%
“…The boundaries of the regions W o , E o and B o are easy to identify using the QE we are testing; it is straightforward to select an image that has a potential FT [10,11,6]. However, a FT does not provide much information about the severity of a systematic weakness; it only informs us that one exists.…”
Section: Challenges Of Identifying Fosmentioning
confidence: 99%
“…In [6], we introduce the notion of using one more accurate QE as a proxy for subjective quality when designing test pairs for a second less accurate QE. Occasionally the proxy QE identifies its own weakness.…”
Section: Proxies For Subjective Qualitymentioning
confidence: 99%
“…The following principles lie at the core of software testing [5] and are directly applicable to testing image and video QEs [6]. The goal of testing should be to find errors, not demonstrate that the system satisfies its specifications.…”
Section: Introductionmentioning
confidence: 99%
“…In [6], we apply these principles to systematically create targeted, small-scale subjective tests. The test cases are com-prised of image pairs designed to have a high probability of creating potential misclassification errors.…”
We present an automated algorithm to design subjective tests that have a high likelihood of finding misclassification errors in many image quality estimators (QEs). In our algorithm, a collection of existing QEs collaboratively determine the best pairs of images that will test the accuracy of each individual QE. We demonstrate that the resulting subjective test provides valuable information regarding the accuracy of the cooperating QEs. The proposed strategy is particularly useful for comparing efficacy of QEs across multiple distortion types and multiple reference images.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.