Models Versus Satisfaction

Fan Zhang,Jiaxin Mao,Yiqun Liu,Shaoping Ma,Xiaohui Xie,Weizhi Ma,Min Zhang

doi:10.1145/3397271.3401162

Abstract

Evaluation metrics play an important role in the batch evaluation of IR systems. Based on a user model that describes how users interact with the rank list, an evaluation metric is defined to link the relevance scores of a list of documents to an estimation of system effectiveness and user satisfaction. Therefore, the validity of an evaluation metric has two facets: whether the underlying user model can accurately predict user behavior and whether the evaluation metric correlates well with user satisfaction. While a tremendous amount of work has been undertaken to design, evaluate, and compare different evaluation metrics, few studies have explored the consistency between these two facets of evaluation metrics. Specifically, we want to investigate whether the metrics that are well calibrated with user behavior data can perform as well in estimating user satisfaction. To shed light on this research question, we compare the performance of various metrics with the C/W/L Framework in estimating user satisfaction when they are optimized to fit observed user behavior. Experimental results on both self-collected and public available user search behavior datasets show that the metrics optimized to fit users' click behavior can perform as well as those calibrated with user satisfaction feedback. We also investigate the reliability in the calibration process of evaluation metrics to find out how much data is required for parameter tuning. Our findings provide empirical support for the consistency between user behavior modeling and satisfaction measurement, as well as guidance for tuning the parameters in evaluation metrics.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Models Versus Satisfaction

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Incorporating Query Reformulating Behavior into Web Search Evaluation
Jia Chen ... Jiaxin Mao
-
Jia Chen, et. al.Jia Chen ... Jiaxin Mao
26 Oct 2021
26 Oct 2021

The study of information users satisfaction model based on the user behavior
Zou Jin ... Yan Yu
-
Zou Jin, et. al. Zou Jin ... Yan Yu
01 Aug 2012
01 Aug 2012

Constructing Better Evaluation Metrics by Incorporating the Anchoring Effect into the User Model
Nuo Chen ... Fan Zhang
-
Nuo Chen, et. al.Nuo Chen ... Fan Zhang
06 Jul 2022
06 Jul 2022

User behavior modeling for Web search evaluation
Fan Zhang ... Shaoping Ma
AI Open | VOL. 1
Fan Zhang, et. al.Fan Zhang ... Shaoping Ma
01 Jan 2020
AI Open | VOL. 1

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Models Versus Satisfaction

Abstract

Talk to us

Similar Papers