How to Benchmark Objective Quality Metrics from Paired Comparison Data?

The procedures commonly used to evaluate the performance of objective quality metrics rely on ground truth mean opinion scores and associated confidence intervals, which are usually obtained via direct scaling methods. However, indirect scaling methods, such as the paired comparison method, can also be used to collect ground truth preference scores. Indirect scaling methods have a higher discriminatory power and are gaining popularity, for example in crowdsourcing evaluations. In this paper, we present how the classification errors, an existing analysis tool, can also be used with subjective preference scores. Additionally, we propose a new analysis tool based on the receiver operating characteristic analysis. This tool can be used to further assess the performance of objective metrics based on ground truth preference scores. We provide a MATLAB script with an implementation of the proposed tools and we show one example of application of the proposed tools.

Published in:
2016 Eighth International Conference On Quality Of Multimedia Experience (Qomex)
Presented at:
8th International Conference on Quality of Multimedia Experience (QoMEX), Lisbon, Portugal, June 6-8, 2016
New York, Ieee

 Record created 2016-04-30, last modified 2019-12-05

Publisher's version:
Download fulltext

Rate this document:

Rate this document:
(Not yet reviewed)