Decentralized baseband processing (DBP) architecture, which partitions the base station antennas into multiple antenna clusters, has been recently proposed to alleviate the excessively high interconnect bandwidth, chip input/output data rates, and detection complexity for massive multi-user multiple-input multiple-output (MU-MIMO) systems. In this paper, we propose a novel decentralized Gaussian message passing (GMP) detection for the DBP architecture. Based on the message passing rule, each antenna cluster iteratively calculated the local means and variances which are fused to generate the global symbol beliefs for demodulation. The state evolution framework of the decentralized GMP algorithm is presented under the assumptions of large-system limit and Gaussian sources. Analytical results corroborated by simulations demonstrate that the nonuniform antenna cluster partition scheme exhibits a higher convergence rate than the uniform counterpart. Simulation results illustrate that the proposed decentralized GMP detection outperforms the recently proposed decentralized algorithms.