Abstract

The objective of this research is to analyze the electricity billing data in Uruguay during the period 2000-2022, using machine learning algorithms. It is a descriptive-explanatory type of research, and the Data Science methodology is used to achieve the stated objective. The K-Means algorithm is used to group the data according to the types of clients, the K-NN algorithm to generate a model that allows predicting the type of client of the new records, the PCA technique to reduce the dimensionality of the data, prior to the application of the Linear Regression algorithm to obtain a model to predict the total electric energy billed from the new records. The model obtained with K-Means generated a cluster for each type of customer, perfectly grouping the data. The model obtained through K-NN made it possible to predict whether the client was residential or non-residential, with 100% accuracy. Combining the correlation analysis with the PCA analysis, the dimensionality was reduced until only three explanatory variables were obtained. The linear regression model had a high coefficient of determination R2 of 0.981, and the residuals were normally distributed.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.