Abstract
Hadoop distributed file system (HDFS) has been widely used in deep reinforcement learning due to its high reliability and high performance. However, the dataset for training deep reinforcement learning models is usually composed of a large number of small files, which will cause the severe performance degradation of HDFS. Although many methods have been proposed to address small files problem in HDFS, none of them considers correlations between small files for training deep reinforcement learning models. To improve the learning performance for deep reinforcement learning, this paper proposes a new HDFS architecture, which uses a distributed cache mechanism to improve data access efficiency for training deep reinforcement learning models. The experimental results show that the proposed HDFS with distributed cache mechanism can achieve better learning performance compared to the traditional HDFS without cache and with centralized caching mechanism.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.