Abstract Information-theoretic motion planning and machine learning through Bayesian inference are exploited to localize and track a dynamic radio frequency (RF) emitter with unknown waveform (uncooperative target). A target-state estimator handles non-Gaussian distributions, while mutual information is utilized to coordinate the motion control of a network of mobile sensors (agents) to minimize measurement uncertainty. The mutual information is computed for pairs of sensors through a four-permutation-with-replacement process. The information surfaces are combined to create a composite map, which is then used by agents to plan their motion for more efficient and effective target estimation and tracking. Simulations and physical experiments involving micro-aerial vehicles with time difference of arrival (TDOA) measurements are performed to evaluate the performance of the algorithm. Results show that when two or three agents are used, the algorithm outperforms state-of-the-art methods. Results also show that for four or more agents, the performance is as competitive as an idealized static sensor network.