Abstract

In complex sea environments, underwater bionic flapping foils exhibit significant nonlinear characteristics across a high-dimensional parameter space. This study introduces a novel optimisation paradigm, G-TD3, integrating Twin-Delayed Deep Deterministic (TD3) reinforcement learning with Gaussian process regression (GPR). An automatic experimental device was developed, using which a series of flapping foil propulsion experiments were conducted, exploring both symmetric and asymmetric flutter modes. Utilizing GPR, we characterized the flapping foil's operational environment from experimental data. In addition, the TD3 algorithm was deployed to exploit this environment, enabling interactive learning in high-dimensional, continuous parameter spaces. Our integrated approach facilitates comprehensive exploration-exploitation dynamics. The optimized propulsion strategy, fine-tuned through maximizing acquisition rewards, demonstrates G-TD3's superiority over traditional brute-force methods, requiring less data for optimising both symmetric and asymmetric flutter strategies. This research provides a new optimisation method for nonlinear systems and offers valuable insights into the practical applications of bionic flapping foils.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.