Measurement of glucose in multiple Field Laboratories requires rigorous standardization when patients and caregivers are masked, unless predefined thresholds are met. Local misclassification of participants at the thresholds can introduce recruitment bias and adversely affect the integrity of study findings. To describe the challenges and the approach to meeting them in measuring glucose, HbA1c, and C-peptide in the Hyperglycemia and Adverse Pregnancy Outcome (HAPO) Study. HAPO is an observational epidemiologic study of 25 000 pregnant women from 15 centres in 10 countries, designed to clarify unanswered questions on associations of maternal glycemia, less severe than overt diabetes mellitus, with risks of adverse pregnancy outcome. Glucose tolerance (75 g two-hour OGTT) is assessed locally at 24-32 weeks' gestation, with results masked if fasting and two-hour plasma glucose are </=5.8 mmol/L and </=11.1 mmol/L, respectively. For analysis of outcomes HAPO utilizes measurements of glucose, C-peptide, and HbA1c on frozen samples in a Central Laboratory; measurement of HbA1c on stored, frozen blood also creates special challenges. A common external quality assessment scheme was used to standardize glucose measurements between Field Centre and Central Laboratories before and during data collection. Agreement between Field Centres and the Central Laboratory in rates above the masking thresholds has been excellent (Kappa = 0.79, P < 0.001), with rates of 1.8 and 1.7% respectively. Percent technical error for Central Laboratory OGTT measurements of glucose, C-peptide, and HbA1c, were 2.0, 4.2, and 2.0, respectively. It is not possible to assess if comparable results could have been obtained with a less rigorous approach to standardization. HAPO has been successful in ensuring comparability of glucose measurement between its Central Laboratory and Field Centre laboratories and in measuring its key metabolites with accuracy and precision. Recruitment bias based on glucose measurement has been avoided.