The computational exploration of reactive processes is challenging due to the requirement of thorough sampling across the free energy landscape using accurate ab initio methods. To address these constraints, machine learning potentials are employed, yet their training for this kind of problem is still a laborious and tedious task. In this study, we present an efficient approach to train these potentials by cleverly using a single batch of unbiased trajectories that avoid the pitfalls of trajectories artificially biased along a suboptimal collective variable. This strategy, when integrated with current enhanced sampling techniques, allows to obtain free energy profiles and kinetic rates of ab initio quality, yet dramatically reducing the computational cost.