Autonomous navigation in dense traffic scenarios, such as on-ramp forced merging, still poses significant challenges for autonomous vehicles to prevent accidents and alleviate traffic congestion. This paper introduces a novel motion planning framework that combines Interactive Safe Reinforcement Learning (IntSRL) with Nonlinear Model Predictive Control (NMPC). This framework develops an interactive merging planning policy that accounts for the uncertainty of traffic participants, multi-objective optimization and heterogeneous vehicle interactions, in which the upper planner, i.e., IntSRL, furnishes the lower planner, NMPC, with global guidance path and velocity guidance. An Adaptive Safety Governor (ASG) module within IntSRL adjusts potentially unsafe actions by incorporating prior knowledge and driving experience. And a coupling evaluation mechanism for multi-objective optimization is embedded into reward shaping with integration of driving safety and strategy efficiency. We evaluate the proposed controller on various dense traffic scenarios using the proposed Heterogeneous Intelligent Driver Model (H-IDM) considering different driving styles and cooperative willingness of other vehicles. The test results indicate that the proposed method surpasses existing optimization-based and learning-based baselines in qualitative and quantitative measures.