Analysis of Categorical Data with the R Package confreq

Jörg-Henrik Heine,Mark Stemmler

doi:10.3390/psych3030034

Jörg-Henrik Heine, Mark Stemmler

Open Access

https://doi.org/10.3390/psych3030034

Copy DOI

Abstract

The person-centered approach in categorical data analysis is introduced as a complementary approach to the variable-centered approach. The former uses persons, animals, or objects on the basis of their combination of characteristics which can be displayed in multiway contingency tables. Configural Frequency Analysis (CFA) and log-linear modeling (LLM) are the two most prominent (and related) statistical methods. Both compare observed frequencies (foi…k) with expected frequencies (fei…k). While LLM uses primarily a model-fitting approach, CFA analyzes residuals of non-fitting models. Residuals with significantly more observed than expected frequencies (foi…k>fei…k) are called types, while residuals with significantly less observed than expected frequencies (foi…k<fei…k) are called antitypes. The R package confreq is presented and its use is demonstrated with several data examples. Results of contingency table analyses can be displayed in tables but also in graphics representing the size and type of residual. The expected frequencies represent the null hypothesis and different null hypotheses result in different expected frequencies. Different kinds of CFAs are presented: the first-order CFA based on the null hypothesis of independence, CFA with covariates, and the two-sample CFA. The calculation of the expected frequencies can be controlled through the design matrix which can be easily handled in confreq.

Highlights

Data that include categorical variables are often seen in the social sciences and psychological research
The term categorical variables typically refers to variables that, according to Steven’s [1] influential taxonomy of scale levels, have at least a nominal or ordinal scale level
Steven’s taxonomy was already criticized almost at the same time of its introduction, see, e.g., in [2], but see in [3], and can be regarded as the initial spark for a controversy about scale levels and measurement of social science variables as such, e.g., in [3,4,5,6,7], it can at least provide a useful heuristic for the practice of data analysis

Summary

Introduction

Data that include categorical variables are often seen in the social sciences and psychological research. Steven’s taxonomy was already criticized almost at the same time of its introduction, see, e.g., in [2], but see in [3], and can be regarded as the initial spark for a (still ongoing) controversy about scale levels and measurement of social science variables as such, e.g., in [3,4,5,6,7], it can at least provide a useful heuristic for the practice of data analysis From such a practice perspective, the term categorical variables can be used to characterize variables that comprise few distinct trait expressions or attributes that result from the classification of any type of observation into “one of a set of mutually exclusive and collectively exhaustive categories” [8] p. Analysis (CFA), and provides a link to the R package vcd [14,15] for the visualization of cross-tabulated categorical data

A Person-Centered Perspective on Data

Introduction to the confreq Framework in R

Working with confreq

A First Look on a Classical Data Example

The CFA Main Effect Model of Independency

Modifying the CFA-Model Design Matrices

Introducing Covariates into the CFA-Model

Comparing Pattern Frequencies for Two Samples with CFA

Summary and Conclusions

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Psych	Publication Date: Sep 7, 2021
Citations: 3	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Analysis of Categorical Data with the R Package confreq

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Psych

Lead the way for us

Similar Papers

Using Configural Frequency Analysis as a Person-centered Analytic Approach with Categorical Data
Mark Stemmler ... Jörg-Henrik Heine
International Journal of Behavioral Development | VOL. 41
Mark Stemmler, et. al.Mark Stemmler ... Jörg-Henrik Heine
09 Jul 2016
International Journal of Behavioral Development | VOL. 41

Person-centered data analysis with covariates and the R-package confreq
Mark Stemmler ... Jörg-Henrik Heine
Methodology | VOL. 17
Mark Stemmler, et. al.Mark Stemmler ... Jörg-Henrik Heine
30 Jun 2021
Methodology | VOL. 17

Categorical Data Analysis
Alan Agresti
-
Alan AgrestiAlan Agresti
03 Jul 2002
03 Jul 2002

Temporal patterns of variable relationships in person-oriented research: Longitudinal models of configural frequency analysis.
Alexander Von Eye ... G Anne Bogat
Developmental Psychology | VOL. 44
Alexander Von Eye, et. al.Alexander Von Eye ... G Anne Bogat
01 Mar 2008
Developmental Psychology | VOL. 44

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Analysis of Categorical Data with the R Package confreq

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Psych