Abstract

Quasi-orders, that is, reflexive and transitive binary relations, have numerous applications. In educational theories, the dependencies of mastery among the problems of a test can be modeled by quasi-orders. Methods such as item tree or Boolean analysis that mine for quasi-orders in empirical data are sensitive to the underlying quasi-order structure. These data mining techniques have to be compared based on extensive simulation studies, with unbiased samples of randomly generated quasi-orders at their basis. In this paper, we develop techniques that can provide the required quasi-order samples. We introduce a discrete doubly inductive procedure for incrementally constructing the set of all quasi-orders on a finite item set. A randomization of this deterministic procedure allows us to generate representative samples of random quasi-orders. With an outer level inductive algorithm, we consider the uniform random extensions of the trace quasi-orders to higher dimension. This is combined with an inner level inductive algorithm to correct the extensions that violate the transitivity property. The inner level correction step entails sampling biases. We propose three algorithms for bias correction and investigate them in simulation. It is evident that, on even up to 50 items, the new algorithms create close to representative quasi-order samples within acceptable computing time. Hence, the principled approach is a significant improvement to existing methods that are used to draw quasi-orders uniformly at random but cannot cope with reasonably large item sets.

Highlights

  • We present simulation results for the absolute rejection method (ARM), simple resampling method (SIRM), and stratified resampling method (STRM) used to sample quasi-orders

  • We can see that the points in the P-P plots all fall on the straight lines. This indicates that the sampling methods ARM, SIRM, STRM, and uniform extension method (UEM) give virtually the “same” size distributions for quasi-orders being randomly and representatively generated by any of these methods

  • Schrepp and Ünlü (2015) and Ünlü and Schrepp (2015) discussed the importance of representative random quasi-order samples needed in extensive simulation studies for the reliable comparison of data mining algorithms used to reconstruct relational dependencies among behavioral test items

Read more

Summary

INTRODUCTION

We address why discrete modeling with quasi-orders is useful and why we need to sample quasi-orders we want to be representative. This section gives an overview of the main contributions and organization of this paper

Why Discrete Order Structures Are Important
Why Representative Quasi-Order Samples Are Important
Content and Structure of This Work
Flexible but Non-representative Ad hoc Strategies
Representative but Infeasible Direct Methods
DOUBLY INDUCTIVE PROCEDURE FOR QUASI-ORDER CONSTRUCTION
Description of the Deterministic Construction Procedure
Properties of the Doubly Inductive Procedure
RANDOMIZATION OF THE DISCRETE DOUBLY INDUCTIVE CONSTRUCTION PROCEDURE
Description of the Probabilistic Sampling Procedure
Induced Sampling Biases and Bias Correction Factors
PROCEDURAL VARIANTS FOR BIAS CORRECTION
Absolute Rejection Method
Simple and Stratified Resampling Methods
SIRM Approach
SIMULATION RESULTS
Summary and Final Remarks
Further Research
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call