As a mobile platform, unmanned aerial vehicles (UAV) can carry optics and acoustics sensors recording visual and audio information from ground. Over the past decade, UAVs are widely used in different applications, e.g., search and rescue, pipe leakage detection, etc. Compared with visual information, audio information is challenging to capture from UAVs due to the strong ego-noise generated by the rotating motors, propellers on the UAV as well as the motion of the UAV, which result in a low signal to noise ratio (SNR) of the received acoustics signals. With the contaminated signals, the performance of sound source localization algorithms, e.g., conventional beamforming, is severely degraded. To improve the quality of sound source localization result, different sound source enhancement algorithms have been proposed by researchers. In this paper, simulation experiments were conducted to study the potential of apply convolutional neural networks (CNN) to improve the beamforming result. It was found that with CNN, the estimation of a single point sound source location and source strength can be improved. The model's localization accuracy and source strength estimation accuracy were evaluated respectively.