One method of demand response (DR) is time-of-use planning, which assigns a fee to the use of low-load (LL), middle-load (ML), and peak-load (PL) hours. This strategy can be implemented on an agent graph (AG) of several clusters of a season instead of daily graphs of a season. In fact, the extraction of LL, ML, and PL hours of clusters from a season and the implementation of the demand response program (DRP) have been discussed in a few studies, which is known as the research gap. In this regard, this paper proposes a method based on unsupervised learning and reinforcement learning for clustering the days of each season and extracting the AG of each season from real data. First, missing and outlier data are corrected using a preprocessing step. Then, using the K-means algorithm, the data of each season are categorized into desired clusters, and the AG of each cluster is extracted using the ε-greedy algorithm. After extracting the AG of each cluster, different load hours are checked on the AG, and the DRP is implemented based on different load hours for each cluster, and the improved days are determined in terms of load factor. The simulation results show that for the exploration rate factor of 0.1 (ε = 10%), 93.39 percent, 95.69 percent, 97.77 percent, and 91.01 percent of days in spring, summer, autumn, and winter improve in terms of load factor. The proposed method can be a suitable tool for extracting the AGs of each cluster and implementing the DRP to improve the load factor of power consumers.
Read full abstract