BackgroundIndirect sampling methods are not only inexpensive but also efficient for establishing reference intervals (RIs) using clinical data. This study was conducted to select fully normal records to establish ageand gender-specific RIs for common biochemical analytes by laboratory data mining.MethodsIn total, 280,206 records from 2014 to 2018 were obtained from Peking Union Medical College Hospital. Common biochemical analytes total protein, albumin, total bilirubin (TBil), direct bilirubin (DBil), alanine aminotransferase (ALT), glutamyltranspeptidase (GGT), alkaline phosphatase (ALP), aspartate aminotransferase (AST), lactate dehydrogenase (LDH), potassium, sodium, chlorine, calcium, urea, glucose, uric acid (UA), inorganic phosphorus, creatinine (Cr), total cholesterol, triglyceride, high-density lipoprotein cholesterol, and low-density lipoprotein cholesterol] were measured using an automatic analyzer. Sources of variation were identified by multiple regression analysis. The 2.5th and 97.5th percentiles were calculated as the lower and upper limits of the RIs, respectivelyResultsGender was the major source of variation among the 13 common biochemical analytes with an rp > 0.15. In contrast to the value listed in the WS/T 404, nearly all RIs established in this study were significantly narrower. Furthermore, age-specific RIs should be determined for DBil, LDH, and urea, whereas gender-specific RIs are suggested for GGT, LDH, and urea.ConclusionsWe recommend that gender-specific RIs should be established for ALT, AST, GGT, DBil, TBil, UA, and Cr as well as genderand age-specific RIs for urea and ALP. Through indirect sampling, ageand gender-specific RIs for common biochemical analytes were established and analyzed.