Eligible Voters Research Articles

Statisticians are increasingly posed with thought-provoking and even paradoxical questions, challenging our qualifications for entering the statistical paradises created by Big Data. By developing measures for data quality, this article suggests a framework to address such a question: “Which one should I trust more: a 1% survey with 60% response rate or a self-reported administrative dataset covering 80% of the population?” A 5-element Euler-formula-like identity shows that for any dataset of size $n$, probabilistic or not, the difference between the sample average $\overline{X}_{n}$ and the population average $\overline{X}_{N}$ is the product of three terms: (1) a data quality measure, $\rho_{{R,X}}$, the correlation between $X_{j}$ and the response/recording indicator $R_{j}$; (2) a data quantity measure, $\sqrt{(N-n)/n}$, where $N$ is the population size; and (3) a problem difficulty measure, $\sigma_{X}$, the standard deviation of $X$. This decomposition provides multiple insights: (I) Probabilistic sampling ensures high data quality by controlling $\rho_{{R,X}}$ at the level of $N^{-1/2}$; (II) When we lose this control, the impact of $N$ is no longer canceled by $\rho_{{R,X}}$, leading to a Law of Large Populations (LLP), that is, our estimation error, relative to the benchmarking rate $1/\sqrt{n}$, increases with $\sqrt{N}$; and (III) the “bigness” of such Big Data (for population inferences) should be measured by the relative size $f=n/N$, not the absolute size $n$; (IV) When combining data sources for population inferences, those relatively tiny but higher quality ones should be given far more weights than suggested by their sizes. Estimates obtained from the Cooperative Congressional Election Study (CCES) of the 2016 US presidential election suggest a $\rho_{{R,X}}\approx-0.005$ for self-reporting to vote for Donald Trump. Because of LLP, this seemingly minuscule data defect correlation implies that the simple sample proportion of the self-reported voting preference for Trump from $1\%$ of the US eligible voters, that is, $n\approx2\mbox{,}300\mbox{,}000$, has the same mean squared error as the corresponding sample proportion from a genuine simple random sample of size $n\approx400$, a $99.98\%$ reduction of sample size (and hence our confidence). The CCES data demonstrate LLP vividly: on average, the larger the state’s voter populations, the further away the actual Trump vote shares from the usual $95\%$ confidence intervals based on the sample proportions. This should remind us that, without taking data quality into account, population inferences with Big Data are subject to a Big Data Paradox: the more the data, the surer we fool ourselves.

Read full abstract

AbstractThe public discussion of executive compensation often centres on ‘fair’ and ‘unfair’ amounts and the public outrage over compensation that is deemed too high. The academic literature states that such outrage can lead to outrage costs, pressuring firms to adjust compensation levels. However, it is unclear what a ‘fair’ compensation is for various stakeholders and how their fairness concerns relate to outrage constraints. Based on surveys among two key stakeholder groups (representative eligible voters and investment professionals), we provide evidence that fairness is an important criterion for both groups but that opinions on how large a fair compensation amount should be are widely dispersed. Moreover, personality traits systematically influence fairness opinions through self‐serving interpretations of distributive justice and personal risk attitudes, indicating that a ‘fair’ amount of executive compensation may strongly depend on the involved stakeholders. Investigating thresholds for outrage, i.e., amounts above which compensation is judged ‘unfairly’ high, we show that even though investment professionals care for fairness as well, ‘capital market outrage’ might not equate to ‘public outrage’. Our paper contributes to the literature on outrage constraints by linking individual fairness concerns to outrage potential and has implications for transparency of executive compensation and research on shareholder activism.

Read full abstract

Eligible Voters Research Articles

Related Topics

Articles published on Eligible Voters

The Groundswell Political Campaign Strategy of Anies-Sandi in the 2017 DKI Jakarta Governor Election

Do Surveys Overestimate or Underestimate Socioeconomic Differences in Voter Turnout? Evidence from Administrative Registers

SIGHPC elections

Do direct elections matter? Quasi-experimental evidence from Germany

The Elephant in the Room: Intentional Voter Suppression

On the Political Economy of Felon Disenfranchisement

Congressional districts: How “equal” are they?

A new campaign strategy informed by pragmatism: Running on a platform of expanding voting accessibility

Compulsory Voting: A Defence

Assessing Employee Support during Union Organizing Campaigns

Naysaying and negativity promote initial power establishment and leadership endorsement.

Narcissistic Women and Cash-Strapped Men: Who Can Be Encouraged to Consider Running for Political Office, and Who Should Do the Encouraging?

Blockchain-Enabled E-Voting

Statistical paradises and paradoxes in big data (I): Law of large populations, big data paradox, and the 2016 US presidential election

Connecting young adults to democracy via government social network sites

A novel approach to estimating the demand value of public safety

What is a fair amount of executive compensation? Outrage potential of two key stakeholder groups

Let’s stop the enemy!

Civic Engagement in the Americas

On the independence referendum in the Kurdistan Region of Iraq and disputed territories in 2017

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Eligible Voters Research Articles

Related Topics

Articles published on Eligible Voters

The Groundswell Political Campaign Strategy of Anies-Sandi in the 2017 DKI Jakarta Governor Election

Do Surveys Overestimate or Underestimate Socioeconomic Differences in Voter Turnout? Evidence from Administrative Registers

SIGHPC elections

Do direct elections matter? Quasi-experimental evidence from Germany

The Elephant in the Room: Intentional Voter Suppression

On the Political Economy of Felon Disenfranchisement

Congressional districts: How “equal” are they?

A new campaign strategy informed by pragmatism: Running on a platform of expanding voting accessibility

Compulsory Voting: A Defence

Assessing Employee Support during Union Organizing Campaigns

Naysaying and negativity promote initial power establishment and leadership endorsement.

Narcissistic Women and Cash-Strapped Men: Who Can Be Encouraged to Consider Running for Political Office, and Who Should Do the Encouraging?

Blockchain-Enabled E-Voting

Statistical paradises and paradoxes in big data (I): Law of large populations, big data paradox, and the 2016 US presidential election

Connecting young adults to democracy via government social network sites

A novel approach to estimating the demand value of public safety

What is a fair amount of executive compensation? Outrage potential of two key stakeholder groups

Let’s stop the enemy!

Civic Engagement in the Americas

On the independence referendum in the Kurdistan Region of Iraq and disputed territories in 2017