As professionals in data processing, analysis and information technology, we read with interest the Bulletin’s coverage of the barriers to data sharing in public health.1 We contend that there are many existing solutions for low-cost, high-quality data collection, management and analysis. Many of these systems are built on open-source technologies and thus are more amenable to receiving input for their design and operation from information technologists and researchers in low-income countries. The information technology gap between rich and poor countries may not be as large as some may think. For example, a recent map of Facebook “friend” linkages shows areas of high internet connectivity in low-income countries, with specific interest to us being the connection into Rwanda from Mombasa, Kenya.2 In our institutions, after some consideration,3 we adopted a web-based, open source clinical trials package called OpenClinica (Akaza Research, Waltham, MA, United States of America) for a large (n > 3000) multi-site trial (more information available at: http://www.feast-trial.org). For observational epidemiological and clinical studies that rely on, or are derived from, surveillance systems there are several free, easy-to-install systems such as RedCap,4 OpenMRS5 and OpenXdata. Indeed, there are novel technologies that have come from low-income countries that are now in use in high-income countries, e.g. software developer Ushahidi.6 As Tom Smith of the Swiss Tropical Institute said at the Pan-African Malaria Conference in 2009 when introducing a presentation on a mobile phone-based system for malaria surveillance in Zanzibar, United Republic of Tanzania: “Africa is in the vanguard in the use of such technology.” Indeed the Kenyan mobile phone-based money transfer system, M-Pesa, is highly regarded and has spread very quickly.7 Given the ubiquity of mobile phones, the use of developing technologies, such as lens-free microscopy,8 are likely to have major impacts soon on disease surveillance in low-income countries. With such technologies comes the need for a tool to effectively analyse and disseminate the data generated. One such tool is the open-source software package, simply named R, arguably9 the most rapidly evolving and powerful data analytical engine with which the major statistics software packages can integrate. A most useful input from the World Health Organization and the Special Programme for Research and Training in Tropical Diseases has been sponsorship of the online R course at Thailands' Prince of Songkla University and publication of a free book. R’s potential for data storage, use and analysis is well illustrated by Zack Almquist10 who has created a freely available set of tools that interrogates data from the year 2000 census in the United States of America. Pisani & AbouZahr discuss how collaboration allows researchers to stand on the shoulders of giants.1 We think this R package does just that. The field of genetics, which they cite as one that public health and epidemiological researchers should emulate, has been greatly assisted by R through the BioConductor project. Robert Gentleman, who developed R with Ross Ihaka,11 is a notable contributor to this project.12 With powerful data programmes freely available, data can be shared more widely, but this also depends on skilled personnel. In east Africa alone, we need to train 100 new data managers and statisticians every year so that researchers, data managers and analysts can benefit from the wider availability of data and to ensure its quality. Public health researchers need to be trained in good data collection methods, mentoring with researchers and partnering with data analysis experts from developed countries. Systems are required that extend beyond research projects into the collection of routine data for public health systems that can be used to monitor the health of the whole population. We think technology and shared data have the potential to radically transform the health systems of low-income countries within our lifetime. We believe that technology and data should be shared equitably across all countries and that everyone should be enabled to use the results from the acquired knowledge.
Read full abstract