Abstract

Data classification storage has emerged as an effective strategy, harnessing the diverse performance attributes of storage devices and orchestrating a harmonious equilibrium between energy consumption, cost considerations, and user accessibility. As research on emerging storage media (e.g. Non-Volatile Memory (NVM)) delves deeper, in scenarios characterized by dynamically evolving storage demands, conventional paradigms of rigid classification strategy and Solid-State Drive (SSD) and Hard Disk Drive (HDD) storage architectures fall short of addressing such complex situations. In this paper, we propose an effective data classification storage method using text seasonal features based on the traditional access frequency analysis strategy. First, to procure richer semantic information, we employ external knowledge actualizing the short-text feature expansion. Then, we leverage the ensemble learning stacking method optimized models to improve the accuracy of data classification based on seasonal features. Additionally, following the seasonal feature, we further classify the data into hot, warm, and cold and place them separately on the NVM, SSD, and HDD, which conserves storage energy consumption and operational costs while ensuring the quality of user access. The experimental results demonstrate that data classification accuracy can reach more than 95.10%, and the energy consumption and operating cost can be reduced by more than 30.22% and 8.73%, respectively.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call