Abstract
Discrete observations from data which are obtained from sparse, and yet concentrated events are often observed (e.g. road accidents or murders). Traditional methods to compute summary statistics often include placing the data in discrete bins but for this type of data this approach often results in large numbers of empty bins for which no function or summary statistic can be computed.Here, a method for dealing with sparse and concentrated observations is constructed, based on a sequence of non-overlapping bins of varying size, which gives a continuous interpolation of data for computing summary statistics of the values for the data, such as the mean.The method presented here overcomes the problem which sparsity and concentration present when computing functions to represent the data. Implementation of the method presented here is facilitated via open access to the code.•A new method for computing functions over sparse and concentrated data is constructed.•The method allows straightforward functions to be computed over partitions of the data, such as the mean, but also more complicated functions, such as coefficients, ratios, correlations, regressions and others.
Highlights
The method presented here overcomes the problem which sparsity and concentration present when computing functions to represent the data
The cafe wants to investigate how much money customers spend at different times in the day
They have records which give the time and the amount paid by each customer
Summary
Rafael Prieto Curiela,*, Carmen Cabrera Arnaub, Mara Torres Pinedoc, Humberto González Ramírezd, Steven Richard Bishopb a Mathematical Institute, University of Oxford, United Kingdom b Mathematics Department, University College London, United Kingdom c Institute for Global Prosperity, University College London, United Kingdom d École Nationale des Travaux Publics de l'État, ENTPE, Universiteé de Lyon 2, France
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.