InSAR and optical techniques represent two principal approaches for the generation of large-scale Digital Elevation Models (DEMs). Due to the inherent limitations of each technology, a single data source is insufficient to produce high-quality DEM products. The increasing deployment of satellites has generated vast amounts of InSAR and optical DEM data, thereby providing opportunities to enhance the quality of final DEM products through the more effective utilization of the existing data. Previous research has established that complete DEMs generated by InSAR technology can be combined with optical DEMs to produce a fused DEM with enhanced accuracy and reduced noise. Traditional DEM fusion methods typically employ weighted averaging to compute the fusion results. Theoretically, if the weights are appropriately selected, the fusion outcome can be optimized. However, in practical scenarios, DEMs frequently lack prior information on weights, particularly precise weight data. To address this issue, this study adopts a fully connected artificial neural network for elevation fusion prediction. This approach represents an advancement over existing neural network models by integrating local elevation and terrain as input features and incorporating curvature as an additional terrain characteristic to enhance the representation of terrain features. We also investigate the impact of terrain factors and local terrain feature as training features on the fused elevation outputs. Finally, three representative study areas located in Oregon, USA, and Macao, China, were selected for empirical validation. The terrain data comprise InSAR DEM, AW3D30 DEM, and Lidar DEM. The results indicate that compared to traditional neural network methods, the proposed approach improves the Root-Mean-Squared Error (RMSE) ranges, from 5.0% to 12.3%, and the Normalized Median Absolute Deviation (NMAD) ranges, from 10.3% to 26.6%, in the test areas, thereby validating the effectiveness of the proposed method.