The diagnosis of disorders of gut-brain interaction (DGBI) in children is exclusively based on clinical criteria called the Rome criteria. The inter-rater reliability (IRR) measures how well two raters agree with a diagnosis using the same diagnostic tool. Previous versions of the Rome criteria showed only fair to moderate IRR. There have been no studies assessing the IRR of the current edition of the pediatric Rome criteria (Rome IV). This study sought to investigate the IRR of the pediatric Rome IV criteria and compare its reliability with the previous versions of the Rome criteria. We hypothesized that changes made to Rome IV would result in higher IRR than previous versions. This study used the same methodology as the previous studies on Rome II and III, including identical clinical vignettes, number of raters, and levels of expertise. Participants included 10 pediatric gastroenterology fellows and 10 pediatric gastroenterology specialists. IRR was assessed using the percentage of agreement and Cohen's kappa coefficient to account for possible agreement by chance. The average IRR percentage of agreement using the Rome IV criteria was 55% for pediatric gastroenterologists and 48.5% for fellows, indicating moderate agreement (k = 0.54 for specialists, k = 0.47 for fellows). The results demonstrated higher percentages of agreement and kappa coefficients compared to the Rome II and III criteria. The findings demonstrate improved reliability in Rome IV compared to Rome II and III, suggesting that the changes incorporated into the Rome IV criteria have enhanced diagnostic consistency. Despite the advancements, the reliability is still moderate, indicating the need for further refinement of future versions of the Rome criteria.
Read full abstract