As cities progress into high-quality developments, the demand for urban parks that enhance residents’ well-being and sustainability is increasing. Traditional visual-centric design methods no longer suffice. Given that vision and hearing are the primary sensory pathways through which people perceive their environment, exploring their relationship with landscape experiences offers a novel perspective for optimizing the audiovisual perception quality of urban parks. This study explores the relationship between visual and auditory elements and landscape experiences to optimize urban parks’ sensory quality. Using visual perception, soundscape perception, sound source perception, and behavioral vitality, this study evaluates the audiovisual perception quality of a representative wetland park in Chengdu’s ring ecological zone. By quantifying relationships between audiovisual characteristics, behavioral vitality, and emotional feedback, several emotional assessment models were constructed. The results show that lawns, pavements, and sound pressure levels significantly impact vitality. A sound pressure level of 77 dB has been identified as a critical threshold in emotional perception models. Consequently, distinct emotional prediction models can be employed to enhance landscape design across various sound pressure level zones. This research provides scientific evidence and flexible strategies for designing urban open spaces that improve landscape experiences based on multisensory perception.