Suppressing random noise is critical for revealing real subsurface structures. Convolutional neural networks (CNNs), the leading seismic data denoising methods, excel at extracting local features but struggle to capture global representations. Unet can extract and reuse multi-scale features, aiding in the precise detection of details and semantic information; however, being based on convolutional operations, it struggles to capture global information. To capture global representations, researchers normally employ Transformers in high-level visual tasks, owing to their self-attention mechanisms. This paper introduces a method for mining multi-scale local and global information based on hybrid-gated Unet (HGUnet), which integrates Transformer, CNN, and Unet architectures to enhance the feature representation capability for seismic random noise suppression tasks. HGUnet comprises hybrid-gated blocks (HGB) embedded within a U-shaped architecture, employing a concurrent structure of Octave convolution and lightweight multi-head self-attention mechanism to efficiently extract multi-scale local and global features simultaneously. Moreover, at the conclusion of the HGB, to precisely leverage information and reduce computing costs, a gated feedforward network is designed to retain valuable information and prune redundancies for feature fusion. Synthetic and field experimental results demonstrate that HGUnet improves denoising quality over traditional and CNN methods without adding significant computing costs.