众包QoE排名中的异常值检测研究

研究论文

27 浏览量更新于2024-08-26 收藏 958KB PDF 举报

身份认证购VIP最低享 7 折!

30元优惠券

"这篇研究论文探讨了在QoE（用户体验质量）的众包排名中异常值的探索问题。作者包括来自多个知名机构的研究人员，他们分析了异常值对众包评估的影响，并提出了一种方法来检测和处理这些异常值。" 在众包排名中，QoE（用户体验质量）的评估已经成为一种有效的获取用户反馈的方式，特别是在信息技术和互联网服务领域。QoE是衡量用户对某种服务或产品的主观满意度，通常涵盖诸如网络性能、视频质量、音频体验等多个方面。然而，由于众包数据的多样性与复杂性，可能存在异常值，这些异常值可能源于参与者的误操作、不准确的反馈或是有偏见的评价。这篇研究论文的重点在于异常值检测，这是确保众包评估结果稳健性的重要环节。异常值如果未被识别和处理，可能会扭曲总体的评估结果，导致对服务质量的错误判断。作者们通过深入分析众包数据的特性，设计了一套方法来识别这些异常排名，以提高QoE评估的准确性和可靠性。论文可能涵盖了以下内容： 1. 异常值定义与类型：定义在QoE众包排名中的异常值，区分不同类型的异常，如系统性异常（参与者错误）、随机异常（偶然的异常反馈）和策略性异常（故意操纵排名）。 2. 数据预处理与特征工程：描述如何处理众包数据，提取与QoE相关的特征，为异常检测做准备。 3. 异常检测算法：介绍所提出的检测异常值的统计方法或机器学习模型，可能是基于聚类、回归、深度学习或其他技术。 4. 实验与案例分析：通过真实或模拟的众包数据集进行实验，展示异常值检测的效果，并对比其他方法的性能。 5. 结果解释与应用：解释检测到的异常值如何影响QoE评估，以及如何利用这些结果改进服务质量。 6. 讨论与未来工作：讨论方法的局限性和未来可能的研究方向，例如考虑社会心理学因素对异常值的影响，或者开发更适应动态环境的实时异常检测机制。通过这项研究，研究人员旨在提升QoE评估的信度，为服务提供商提供更准确的用户反馈，从而优化产品和服务。这对于持续改进用户体验、提升用户满意度具有重要意义。

资源详情

资源推荐

Exploring Outliers in Crowdsourced Ranking for QoE

Qianqian Xu

, Ming Yan

, Chendi Huang

Jiechao Xiong

3,4

, Qingming Huang

5,6,7

, Yuan Yao

8∗

State Key Laboratory of Information Security (SKLOIS), Institute of Information Engineering, CAS, Beijing,

100093, China

Department of Computational Mathematics, Science and Engineering and Department of Mathematics,

Michigan State University, East Lansing, MI, 48824, USA

BICMR-LMAM-LMEQF-LMP, School of Mathematical Sciences, Peking University, Beijing, 100871, China

Tencent AI Lab, Shenzhen, 518057, China

University of Chinese Academy of Sciences, Beijing, 100049, China

Key Lab of Intell. Info. Process., Inst. of Comput. Tech., CAS, Beijing, 100190, China

Key Lab of Big Data Mining and Knowledge Management, CAS, Beijing, 100190, China

Department of Mathematics, Hong Kong University of Science and Technology, Hong Kong, 100871

xuqianqian@iie.ac.cn,yanm@math.msu.edu,cdhuang@pku.edu.cn

jcxiong@tencent.com,qmhuang@ucas.ac.cn,yuany@ust.hk

ABSTRACT

Outlier detection is a crucial part of robust evaluation for

crowdsourceable assessment of Quality of Experience (QoE)

and has attracted much attention in recent years. In this

paper, we propose some simple and fast algorithms for outlier

detection and robust QoE evaluation based on the noncon-

vex optimization principle. Several iterative procedures are

designed with or without knowing the number of outlier-

s in samples. Theoretical analysis is given to show that

such procedures can reach statistically good estimates under

mild conditions. Finally, experimental results with simu-

lated and real-world crowdsourcing datasets show that the

proposed algorithms could produce similar performance to

Huber-LASSO approach in robust ranking, yet with nearly

8 or 90 times speed-up, without or with a prior knowledge

on the sparsity size of outliers, respectively. Therefore the

proposed methodology provides us a set of helpful tools for

robust QoE evaluation with crowdsourcing data.

CCS CONCEPTS

•Information systems →Data cleaning; Rank aggre-

gation;

KEYWORDS

HodgeRank; Outlier Detection;

-regularization; Iterative

Hard Thresholding; Iterative Least Trimmed Squares; Adap-

tive Algorithms

∗

Corresponding author.

Permission to make digital or hard copies of all or part of this work

for personal or classroom use is granted without fee provided that

copies are not made or distributed for proﬁt or commercial advantage

and that copies bear this notice and the full citation on the ﬁrst page.

Copyrights for components of this work owned by others than ACM

must be honored. Abstracting with credit is permitted. To copy

otherwise, or republish, to post on servers or to redistribute to lists,

requires prior speciﬁc permission and/or a fee. Request permissions

from permissions@acm.org.

MM’17, October 23–27, 2017, Mountain View, CA, USA.

DOI: https://doi.org/10.1145/3123266.3123267

1 INTRODUCTION

In recent years, the Quality of Experience (QoE) [

]

has become a major research theme within the multimedia

community. QoE measures a user’s subjective expectation,

feeling, perception, and satisfaction with respect to multime-

dia content. Measuring and ensuring good QoE of multimedia

content is highly subjective in nature.

A variety of approaches can be employed to conduct sub-

jective tests, among which Mean Opinion Score (MOS) [

]

and paired comparison are the two most popular ones. In

the MOS test, individuals are asked to specify a rating from

“Bad” to “Excellent” (e.g., Bad-1, Poor-2, Fair-3, Good-4,

and Excellent-5) to grade the quality of a stimulus; while in

paired comparison approach, raters are only asked to make

intuitive comparative judgments instead of mapping their

perception on a categorical or numerical scale. Among these

there may be tradeoﬀs in the amount of information the

preference label contains and the bias associated with ob-

taining the label. For example, while a graded relevance

judgment on a ﬁve-point scale may contain more information

than a binary judgment, raters may also make more errors

due to the complexity of assigning ﬁner-grained judgments.

In [

], it shows that MOS may suﬀer from three fundamental

problems: (i) it is unable to concretely deﬁne the concept of

scale; (ii) the interpretations of the scales among raters are

highly diﬀerent; (iii) it is diﬃcult to verify whether a rater

gives false ratings either intentionally or carelessly. Therefore,

the paired comparison method is currently gaining growing

attention. It not only promises assessments that are easier

and faster to obtain with less demanding task for raters, but

also yields more reliable data with less personal scale bias

in practice. However, a shortcoming of paired comparison

is that it has more expensive sampling complexity than the

MOS test, since the number of pairs grows quadratically with

the number of items to be ranked.

To tackle the cost problem, with the growth of crowd-

sourcing platforms such as MTurk, InnoCentive, CrowdFlow-

er, CrowdRank, and AllOurIdeas, researchers who wish to

Session: Novel 1

MM’17, October 23-27, 2017, Mountain View, CA, USA

1540

下载后可阅读完整内容，剩余8页未读，立即下载

weixin_38621553

粉丝: 2
资源: 935

众包QoE排名中的异常值检测研究

qoe11-15.rar

论文研究-一种基于QoE的流媒体业务质量建模方案 .pdf

pensieve算法中QoE的对数尺度表达式怎么写

pensieve算法中对数尺度奖励对应的QoE计算式是什么

QOE评估模型如何建立

QoE模型中客观评价方法

qos和qoe的区别

QoE对于研究短视频传输调度算法的意义是什么

pensieve算法中有线性、对数和HD三种QoE评价方式，为什么会选择对数尺度作为奖励函数呢

短视频传输调度算法的国内外研究现状

什么是MUSIQ 评价指标

大数据数学基础视频资料

基于C语言的Dao编程语言设计源码

如何自定义数据集进行目标检测_keras-yolo3.zip

基于JavaScript及多语言融合的勤工俭学平台设计源码

初始化对LoRA微调动态的影响研究

【PFJSP问题】基于matlab豪猪算法CPO求解置换流水车间调度问题PFSP【含Matlab源码 7895期】.mp4

IGWO-SVM：改良的灰狼优化算法改进支持向量机 采用三种改进思路：两种Logistic和Tent混沌映射和采用DIH策略

Spring-dbUtil-xml-proxy

最新资源

IGWO-SVM：改良的灰狼优化算法改进支持向量机采用三种改进思路：两种Logistic和Tent混沌映射和采用DIH策略