Conflicts to Harmony: A Framework for Resolving Conflicts in Heterogeneous Data by Truth Discovery

Yaliang Li,Bo Zhao,Qi Li,Lu Su,Jiawei Han,Jing Gao,Wei Fan

doi:10.1109/tkde.2016.2559481

Abstract

In many applications, one can obtain descriptions about the same objects or events from a variety of sources. As a result, this will inevitably lead to data or information conflicts. One important problem is to identify the true information (i.e., the truths ) among conflicting sources of data. It is intuitive to trust reliable sources more when deriving the truths, but it is usually unknown which one is more reliable a priori . Moreover, each source possesses a variety of properties with different data types. An accurate estimation of source reliability has to be made by modeling multiple properties in a unified model. Existing conflict resolution work either does not conduct source reliability estimation, or models multiple properties separately. In this paper, we propose to resolve conflicts among multiple sources of heterogeneous data types. We model the problem using an optimization framework where truths and source reliability are defined as two sets of unknown variables. The objective is to minimize the overall weighted deviation between the truths and the multi-source observations where each source is weighted by its reliability. Different loss functions can be incorporated into this framework to recognize the characteristics of various data types, and efficient computation approaches are developed. The proposed framework is further adapted to deal with streaming data in an incremental fashion and large-scale data in MapReduce model. Experiments on real-world weather, stock, and flight data as well as simulated multi-source data demonstrate the advantage of jointly modeling different data types in the proposed framework.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Conflicts to Harmony: A Framework for Resolving Conflicts in Heterogeneous Data by Truth Discovery

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Knowledge and Data Engineering

Lead the way for us

Journal: IEEE Transactions on Knowledge and Data Engineering	Publication Date: Aug 1, 2016
Citations: 116

Similar Papers

Resolving conflicts in heterogeneous data by truth discovery and source reliability estimation
Qi Li ... Bo Zhao
-
Qi Li, et. al.Qi Li ... Bo Zhao
18 Jun 2014
18 Jun 2014

Better Weather Forecasting through truth discovery Analysis
Zhiqiang Zhang ... Xiangbing Huang
-
Zhiqiang Zhang, et. al.Zhiqiang Zhang ... Xiangbing Huang
17 Jul 2017
17 Jul 2017

Truth Discovery in Data Streams
Zhou Zhao ... Wilfred Ng
-
Zhou Zhao, et. al.Zhou Zhao ... Wilfred Ng
03 Nov 2014
03 Nov 2014

Dynamic Truth Discovery on Numerical Data
Shi Zhi ... Zheyi Zhu
-
Shi Zhi, et. al.Shi Zhi ... Zheyi Zhu
01 Nov 2018
01 Nov 2018

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Conflicts to Harmony: A Framework for Resolving Conflicts in Heterogeneous Data by Truth Discovery

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Knowledge and Data Engineering