Abstract

In several applications, information is drawn from quali- tative variables. In such cases, measures of central tendency and dis- persion may be highly inappropriate. Variability for categorical data can be correctly quantied by the so-called diversity measures. These measures can be modied to quantify heterogeneity between groups (or subpopulations). Pinheiro et al. (2005) shows that Hamming distance can be employed in such way and the resulting estimator of hetero- geneity between populations will be asymptotically normal under mild regularity conditions. Pinheiro et al. (2009) proposes a class of weighted U-statistics based on degenerate kernels of degree 2, called quasi U-statistics, with the property of asymptotic normality under suitable conditions. This is generalized to kernels of degree m by Pinheiro et al. (2011). In this work we generalize this class to an innite order degenerate kernel. We then use this powerful tools and the reverse martingale nature of U-statistics to study the asymptotic behavior of a collection of trans- formed classic diversity measures. We are able to estimate them in a common framework instead of the usual individualized estimation procedures. MSC 2000: primary - 62G10; secondary - 62G20, 92D20.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.