Abstract

The total knowledge contained within a collective supersedes the knowledge of even its most intelligent member. Yet the collective knowledge will remain unaccessible to us unless we are able to find efficient knowledge aggregation methods which produce reliable decisions based on the behavior or opinions of the collective's members. It is often stated that simple averaging of a pool of opinions is a good and in many cases the optimal way to extract knowledge from a crowd. The method of averaging has been applied to analysis of decision-making in very different fields such as forecasting, collective animal behavior, individual psychology and machine learning. Two mathematical theorems, Condorcet's theorem and Jensen's inequality, provide a general theoretical justification for the averaging procedure. Yet the necessary conditions which guarantee the applicability of these theorems are often not met in practice. Under such circumstances, averaging can lead to suboptimal and sometimes very poor performance. Practitioners in many different fields have independently developed procedures to counteract the failures of averaging. We review such knowledge aggregation procedures and interpret the methods in the light of a statistical decision theory framework to explain when their application is justified. Our analysis indicates that in the ideal case, there should be a matching between the aggregation procedure and the nature of the knowledge distribution, correlations and associated error costs. This leads us to explore how machine learning techniques can be used to extract near-optimal decision rules in a data-driven manner. We end with a discussion of open frontiers in the domain of knowledge aggregation and collective intelligence in general.

Highlights

  • Decisions must be grounded on a good understanding of the state of the world (Green and Swets, 1988)

  • We begin our review of collective intelligence with a brief survey of statistical decision theory (Green and Swets, 1988; Bishop, 2006; Trimmer et al, 2011)

  • The collection of methodologies grouped under the umbrella term wisdom of crowds (WOC) has found widespread application and continues to generate new research at a considerable pace

Read more

Summary

INTRODUCTION

Decisions must be grounded on a good understanding of the state of the world (Green and Swets, 1988). Rather than estimating the numeric value of a quantity, the group needs to choose the best option among a set of alternatives In such cases, the majority vote can be seen as the analog of averaging. Conradt and Roper (2003) have presented a theoretical treatment where the majority vote emerges as a good solution to the problem of resolving conflicts of interest within a group (such applications may in turn suffer from other problems such as the absence of collective rationality (List, 2011)) These issues remain outside the scope of the present review. We refer the interested reader to dedicated review articles on this topic (Bonabeau et al, 1999; Couzin and Krause, 2003; Garnier et al, 2007; Vicsek and Zafeiris, 2012; Valentini et al, 2017)

A BRIEF PRIMER ON STATISTICAL DECISION THEORY
THE RELATIONSHIP BETWEEN INDIVIDUAL OPINIONS AND THE TRUTH
Leveraging Information about Biases and Shapes
Individuality and Expertise
THE ROLE OF DEPENDENCIES
Correlations Can Improve Performance in Voting Models
Correlations and Continuous Variables
Measures of Intelligence
Beyond Convexity
EMBRACING COMPLEXITY: A MACHINE LEARNING APPROACH
DISCUSSION
Convexity
Bias and Variance
Derivation of the Many-Eyes Model
Modeling the Influence of Distracting
Findings
Training Neural Networks

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.