The issue of air pollution from transportation sources remains a major concern, particularly the emissions from heavy-duty diesel vehicles, which pose serious threats to ecosystems and human health. China VI emission standards mandate On-Board Diagnostics (OBD) systems in heavy-duty diesel vehicles for real-time data transmission, yet the current data quality, especially concerning crucial parameters like NOx output, remains inadequate for effective regulation. To address this, a novel approach integrating Multimodal Feature Fusion with Particle Swarm Optimization (OBD-PSOMFF) is proposed. This network employs Long Short-Term Memory (LSTM) networks to extract features from OBD indicators, capturing temporal dependencies. PSO optimizes feature weights, enhancing prediction accuracy. Testing on 23 heavy-duty vehicles demonstrates significant improvements in predicting NOx and CO2 mass emission rates, with mean squared errors reduced by 65.205 % and 70.936 % respectively compared to basic LSTM models. This innovative multimodal fusion method offers a robust framework for emission prediction, crucial for effective vehicle emission regulation and environmental preservation.