Published in last 50 years
Articles published on Multi-scale Features
- New
- Research Article
- 10.3390/s25216688
- Nov 1, 2025
- Sensors
- Xiaohui Li + 4 more
Cross-scene classification of hyperspectral images poses significant challenges due to the lack of a priori knowledge and the differences in data distribution across scenes. While traditional studies have had limited use of a priori knowledge from other modalities, recent advancements in pre-trained large-scale language-vision models have shown strong performance on various downstream tasks, highlighting the potential of cross-modal assisted learning. In this paper, we propose a Semantic-aware Collaborative Parallel Network (SCPNet) to mitigate the impact of data distribution differences by incorporating linguistic modalities to assist in learning cross-domain invariant representations of hyperspectral images. SCPNet uses a parallel architecture consisting of a spatial–spectral feature extraction module and a multiscale feature extraction module, designed to capture rich image information during the feature extraction phase. The extracted features are then mapped into an optimized semantic space, where improved supervised contrastive learning clusters image features from the same category together while separating those from different categories. Semantic space bridges the gap between visual and linguistic modalities, enabling the model to mine cross-domain invariant representations from the linguistic modality. Experimental results demonstrate that SCPNet significantly outperforms existing methods on three publicly available datasets, confirming its effectiveness for cross-scene hyperspectral image classification tasks.
- New
- Research Article
- 10.1016/j.engappai.2025.111993
- Nov 1, 2025
- Engineering Applications of Artificial Intelligence
- Liyuan Pan + 8 more
MambaRSIS: Context-aware multi-scale feature aggregation with selective state space model for remote sensing instance segmentation
- New
- Research Article
- 10.2352/j.imagingsci.technol.2025.69.6.060401
- Nov 1, 2025
- Journal of Imaging Science and Technology
- Shang Xinping + 3 more
Low-Light Image Enhancement Method based on Multiscale Feature Fusion
- New
- Research Article
- 10.1016/j.engappai.2025.111608
- Nov 1, 2025
- Engineering Applications of Artificial Intelligence
- Hanyu Zhang + 4 more
An attention-guided multi-scale feature cascade network for underwater fish counting
- New
- Research Article
- 10.1109/tpami.2025.3593283
- Nov 1, 2025
- IEEE transactions on pattern analysis and machine intelligence
- Zhaokai Wang + 10 more
Image pyramids are widely adopted in top-performing methods to obtain multi-scale features for precise visual perception and understanding. However, current image pyramids use the same large-scale model to process multiple resolutions of images, leading to significant computational cost. To address this challenge, we propose a novel network architecture, called Parameter-Inverted Image Pyramid Networks (PIIP). Specifically, PIIP uses pretrained models (ViTs or CNNs) as branches to process multi-scale images, where images of higher resolutions are processed by smaller network branches to balance computational cost and performance. To integrate information from different spatial scales, we further propose a novel cross-branch feature interaction mechanism. To validate PIIP, we apply it to various perception models and a representative multimodal large language model called LLaVA, and conduct extensive experiments on various tasks such as object detection, segmentation, image classification and multimodal understanding. PIIP achieves superior performance compared to single-branch and existing multi-resolution approaches with lower computational cost. When applied to InternViT-6B, a large-scale vision foundation model, PIIP can improve its performance by 1%-2% on detection and segmentation with only 40%-60% of the original computation, finally achieving 60.0 box AP on MS COCO and 59.7 mIoU on ADE20 K. For multimodal understanding, our PIIP-LLaVA achieves 73.0% accuracy on TextVQA and 74.5% on MMBench with only 2.8 M training data.
- New
- Research Article
- 10.1049/icp.2025.3575
- Nov 1, 2025
- IET Conference Proceedings
- Wei Dai + 3 more
A YOLOv11-based crack detection method for stamped parts with the use of multiscale feature fusion and adaptive downsampling
- New
- Research Article
- 10.1016/j.patrec.2025.08.005
- Nov 1, 2025
- Pattern Recognition Letters
- Zhanqiang Huo + 4 more
VMamba-Crowd: Bridging multi-scale features from Visual Mamba for weakly-supervised crowd counting
- New
- Research Article
- 10.1016/j.epsr.2025.111911
- Nov 1, 2025
- Electric Power Systems Research
- Haijun Xiong + 4 more
Multiscale feature decoupling via VMD and dual-channel networks for dissolved gas prediction in transformers
- New
- Research Article
- 10.1016/j.energy.2025.138586
- Nov 1, 2025
- Energy
- Wenyang Wang + 4 more
A deep learning framework for global transportation energy carbon emission forecasting: integrating generative pre-trained transformer with multi-scale feature analysis
- New
- Research Article
- 10.3390/s25216664
- Nov 1, 2025
- Sensors
- Zhao Zhang + 5 more
Accurate pose estimation of non-cooperative space objects is crucial for applications such as satellite maintenance, space debris removal, and on-orbit assembly. However, monocular pose estimation methods face significant challenges in environments with limited visibility. Different from the traditional pose estimation methods that use images from a single band as input, we propose a novel deep learning-based pose estimation framework for non-cooperative space objects by fusing visible and infrared images. First, we introduce an image fusion subnetwork that integrates multi-scale features from visible and infrared images into a unified embedding space, preserving the detailed features of visible images and the intensity information of infrared images. Subsequently, we design a robust pose estimation subnetwork that leverages the rich information from the fused images to achieve accurate pose estimation. By combining these two subnetworks, we construct the Visible and Infrared Fused Pose Estimation Framework (VIPE) for non-cooperative space objects. Additionally, we present a Bimodal-Vision Pose Estimation (BVPE) dataset, comprising 3,630 visible-infrared image pairs, to facilitate research in this domain. Extensive experiments on the BVPE dataset demonstrate that VIPE significantly outperforms existing monocular pose estimation methods, particularly in complex space environments, providing more reliable and accurate pose estimation results.
- New
- Research Article
- 10.1016/j.jag.2025.104874
- Nov 1, 2025
- International Journal of Applied Earth Observation and Geoinformation
- Jushuang Qin + 6 more
3D-M2C-ResNet: A Multi-Scale feature enhancement and fusion model for Fine-Scale tree species classification in urban forests
- New
- Research Article
- 10.1016/j.eswa.2025.130335
- Nov 1, 2025
- Expert Systems with Applications
- Shaokang Dong + 1 more
LMFENet: A Hybrid Local-Global and Multi-Scale Feature Extraction Network for Oil Spill Type Classification Using Sentinel-1 Imagery
- New
- Research Article
- 10.1016/j.engappai.2025.111496
- Nov 1, 2025
- Engineering Applications of Artificial Intelligence
- Sheng Xie + 5 more
Multi-scale Feature Refinement via Perspective Scaling and Adaptive Regularization for text-based person search
- New
- Research Article
- 10.1016/j.neucom.2025.131081
- Nov 1, 2025
- Neurocomputing
- Yu Chen + 8 more
Cascaded embedded-FPN: A cross-modality multi-scale feature fusion network for varied-sized objects semantic segmentation
- New
- Research Article
- 10.1016/j.neucom.2025.131039
- Nov 1, 2025
- Neurocomputing
- Wei Guo + 5 more
SMAFusion: Multimodal medical image fusion based on spatial registration and local-global multi-scale feature adaptive fusion
- New
- Research Article
- 10.1016/j.medengphy.2025.104409
- Nov 1, 2025
- Medical engineering & physics
- Zixuan Zhai + 2 more
Fast geometric deep learning for intraoperative soft tissue deformation estimation: Towards real-time AR guidance in liver surgery.
- New
- Research Article
- 10.1016/j.bios.2025.117830
- Nov 1, 2025
- Biosensors & bioelectronics
- Huquan Zhu + 4 more
Dark-field intelligent detection of V. parahaemolyticus using T4 bacteriophage displaying tail spike proteins and gold nanoparticles.
- New
- Research Article
- 10.1016/j.neunet.2025.107782
- Nov 1, 2025
- Neural networks : the official journal of the International Neural Network Society
- Zengnan Wang + 4 more
S-YOLO: An enhanced small object detection method based on adaptive gating strategy and dynamic multi-scale focus module.
- New
- Research Article
- 10.1016/j.engappai.2025.111900
- Nov 1, 2025
- Engineering Applications of Artificial Intelligence
- Quanyu Zhong + 5 more
A cross-domain multi-scale feature fusion network based on graph convolution for intelligent fault diagnosis
- New
- Research Article
- 10.1016/j.compag.2025.110785
- Nov 1, 2025
- Computers and Electronics in Agriculture
- Hao Bai + 3 more
A self-supervised model based on sequence information and multi-scale features advances agricultural soil CT image segmentation