Abstract

ABSTRACT In an asymmetric population, individuals are concentrated toward one tail of the distribution. An estimator of the population mean in this asymmetric case is constructed on the basis of median ranked-set sampling, that is, the population is divided into subsets of equal size and the intersections of these sets depend on the chosen order of ranking according to a known auxiliary variable. Ranking individuals according to this auxiliary variable should approximate their ranking with respect to the unknown variable of interest. This procedure is a cost-effective way of selecting the sample when the variable of interest is unknown. To do this, the auxiliary variable must be at least weakly correlated with the variable of interest. The proposed estimator extends that constructed with extreme ranked-set sampling, whose principle is to divide the population into subsets whose intersections depend on the extreme values of the auxiliary variable. The mean square error of the estimator is expressed analytically. A simulation allows for comparing the proposed estimator with estimators based on simple random sampling and with those based on sampling sets of extreme values. A simulation shows that when the response variable is correlated with both auxiliary variables, even if these correlations are weak, around 0.5 in absolute value, then the mean square error of the proposed estimator is at least 175% lower than the mean square error of estimators based either on simple random or on extreme ranked-set samplings. A first application focuses on household incomes in the Iranian provinces of Fars and Khuzestan in 2022, first with the single gross income, which is the total income that an individual or household earns before tax as auxiliary variable and then with the two auxiliary variables of total gross household income and wages paid year-round to heads of households through the banking network. In this application, the mean square error of the proposed estimator with median ranked-set sampling is at least 60% lower than that obtained with simple random and extreme ranked-set samplings. In the application of the physical preparation score with runners’ track records as an auxiliary variable concerning 160 Iranian athletes in 2022 with sample sizes of 6, 8, 10, 25, and 30, the mean square error of the proposed estimator with median ranked-set sampling is at least 50% lower than that obtained with simple random and extreme ranked-set samplings. In the third application of the COVID-19 mean mortality rate in 2022 in the USA, Iran, Turkey, and Germany, with sample sizes of 6, 8, 10, 25, and 30, estimations of the mean mortality rate are based on new cases. In each of the four countries, the mean square error of the proposed estimator under median ranked-set sampling is at least 60% lower than that obtained with simple random and extreme ranked-set samplings.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.