Ultra-wideband technology has become increasingly prevalent in localization systems, particularly with the emergence of multi-antenna systems capable of estimating both distance and angle of arrival (AoA) for incoming signals. However, most scientific research analyzes the accuracy of AoA in one specific environment for which the estimator is trained. In this paper, we analyze the performance of various AoA estimation algorithms, such as deep convolutional neural networks (DCNN), multiple signal classification (MUSIC), and phase difference of arrival (PDoA), in unseen environments. Prior work already demonstrated the superior performance of ML for AoA estimation compared to PDoA or MUSIC. We show that MUSIC, PDoA and ML solutions suffer from degradation in unseen environments at the 90th percentile of error, with ML-based AoA estimation degrading by about 14 degrees in unseen environments compared to 4 degrees for PDoA. We demonstrate that while PDoA more effectively corrects AoA at the median level in unseen environments, ML-based methods excel at correcting higher-percentile AoA errors, including outliers. Finally, we propose a novel framework to further improve the correction of AoA outliers for DCNN-based AoA estimators using data augmentation and transfer learning, resulting in a median angular error of only 5 degrees in unseen environments, even considering a field of view up to 90 degrees.