ABSTRACT Water supply pipes age, deteriorate and break, which puts at risk the continuous provision of safe potable water endangering the public health in cities. Risk management methods are increasingly applied to optimise the capital investment for pipe replacement and rehabilitation, taking into account the probability and hydraulic impact of pipe breaks. As part of this process, however, historic pipe break data and statistical methods should be utilised to gather causal insights for past breaks to inform operational changes and/or capital investment decisions in order to reduce future breaks. This paper presents a comparative study of statistical and machine learning methods to carry out an exploratory causal analysis for historic pipe breaks in an operational water supply network. Regression models for count data and probabilistic models have been developed. The performance of these models was assessed and enhanced with the introduction of interactions and the inclusion of different network representations.