Fundamental Limits of Data Analytics in Sociotechnical Systems

Lav R. Varshney

doi:10.3389/fict.2016.00002

Abstract

In the Big Data era, informational systems involving humans and machines are being deployed in multifarious societal settings. Many use data analytics as subcomponents for descriptive, predictive, and prescriptive tasks, often trained using machine learning. Yet when analytics components are placed in large-scale sociotechnical systems, it is often difficult to characterize how well the systems will act, measured with criteria relevant in the world. Here, we propose a system modeling technique that treats data analytics components as `noisy black boxes' or stochastic kernels, which together with elementary stochastic analysis provides insight into fundamental performance limits. An example application is helping prioritize people's limited attention, where learning algorithms rank tasks using noisy features and people sequentially select from the ranked list. This paper demonstrates the general technique by developing a stochastic model of analytics-enabled sequential selection, derives fundamental limits using concomitants of order statistics, and assesses limits in terms of system-wide performance metrics like screening cost and value of objects selected. Connections to sample complexity for bipartite ranking are also made.

Highlights

There is an emerging ubiquity to data analytics that have multifarious machine learning and data mining algorithm subcomponents and that are embedded in sociotechnical systems, such as firms and cities
Data analytics have emerged as a key driver of value in business operations and allow firms to differentiate themselves in competitive markets (Apte et al, 2003; Davenport and Harris, 2007; Varshney and Mojsilović, 2011)
In the remainder of this paper, we demonstrate the approach of treating machine learning components as stochastic kernels in analyzing the performance of sociotechnical systems, through an example of sequential selection

Summary

INTRODUCTION

There is an emerging ubiquity to data analytics that have multifarious machine learning and data mining algorithm subcomponents and that are embedded in sociotechnical systems, such as firms and cities. The easy theoretical approach is meant to yield insights for consumption by potential users of data systems, such as business executives or city government officials Such users are interested in understanding the basic trade-offs present in these systems under metrics they care about, knowing how much value an algorithm deployment effort can provide, and determining whether it is worthwhile spending time/energy in developing specific advanced algorithms. They are typically not interested in detailed evaluation of specific algorithm performance, which has. We describe how the approach was successfully used by human resource executives in a large multinational corporation and by government officials in a medium-sized American city

DATA ANALYTICS TO PRIORITIZE HUMAN ATTENTION

A MODEL OF SOCIOTECHNICAL SEQUENTIAL SELECTION SYSTEMS

ANALYSIS OF ANALYTICS-BASED PRIORITIZATION

Findings

CONCLUSION

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Frontiers in ICT	Publication Date: Feb 18, 2016
Citations: 32	License type: cc-by

R Discovery Prime

R Discovery Prime

Fundamental Limits of Data Analytics in Sociotechnical Systems

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Frontiers in ICT

Lead the way for us

Similar Papers

Correlation Awareness in Low-Rank Models: Sampling, Algorithms, and Fundamental Limits
Piya Pal
IEEE Signal Processing Magazine | VOL. 35
Piya PalPiya Pal
01 Jul 2018
IEEE Signal Processing Magazine | VOL. 35

HIGH PERFORMANCE PIAA CORONAGRAPHY WITH COMPLEX AMPLITUDE FOCAL PLANE MASKS
Olivier Guyon ... Frantz Martinache
The Astrophysical Journal Supplement Series | VOL. 190
Olivier Guyon, et. al.Olivier Guyon ... Frantz Martinache
16 Sep 2010
The Astrophysical Journal Supplement Series | VOL. 190

Fundamental performance limits and scaling of a CMOS passive double-balanced mixer
Krenar Komoni ... Sameer Sonkusale
-
Krenar Komoni, et. al.Krenar Komoni ... Sameer Sonkusale
01 Jun 2008
01 Jun 2008

A fundamental control performance limit for a class of positive nonlinear systems
Graham C Goodwin ... Adrian M Medioli
Automatica | VOL. 95
Graham C Goodwin, et. al.Graham C Goodwin ... Adrian M Medioli
26 May 2018
Automatica | VOL. 95

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Fundamental Limits of Data Analytics in Sociotechnical Systems

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Frontiers in ICT