Information In Fused Image Research Articles

In both civil and military fields, remote sensing image fusion is a popular method for improving images. Multimodal image fusion is typically employed in remote sensing image fusion. Multimodal image fusion acquires synthetic images containing rich image information by fusing image information from different wavebands. Current fusion networks concentrate on fusing features at a single image scale, and the resulting image lacks either spatial or texture features. To address these problems and achieve high-quality fusion, we propose MCnet (multiscale network). MCnet is a new method for multimodal image fusion of visible remote sensing images and infrared remote sensing images in a multiscale framework. Specifically, MCnet first deals with the fusion of image features at different scales in a coarse-to-fine manner. Second, MCnet adaptively provides the amount of information in each image at different scales and images for fine fusion feature supplementation in the coarse fusion stage. In the step of fine fusion, the outcomes of the previous stage will be supplied with the missing characteristics. Finally, we design an objective function with three components: structure loss, region loss, and image quality loss. Structure loss and region loss maintain convergence on overall image similarity and region feature similarity. The image quality constraint partially mitigates the effect of low-quality results on model convergence. MCnet emphasizes the texture features and edge contours of the results, which not only boost the quality of the fusion results but also cause the images to show better discriminative properties. We conduct sufficient experiments based on VIS (visible) and IR (infrared) datasets. The results demonstrate that our proposed model achieves state-of-the-art performance. We also conduct generalizability studies on the proposed method, which likewise yield positive results, demonstrating that MCnet is successful and applicable in a variety of situations.

Read full abstract

We consider the problem of estimating 3-D human body pose from visual signals within a discriminative framework. It is challenging because there is a wide gap between complex 3-D human motion and planar visual observation, which makes this a severely ill-conditioned problem. In this paper, we focus on three critical factors to tackle human body pose estimation, namely, feature extraction, learning algorithm, and camera utilization. On the feature level, we describe images using the salient interest points represented by scale-invariant feature transform (SIFT)-like descriptors, in which the position, appearance, and local structural information are encoded simultaneously. On the learning algorithm level, we propose to use Gaussian processes and multiple linear (ML) regression to model the mapping between poses and features. Fusing image information from multiple cameras in different views is of great interest to us on the camera level. We make a comprehensive evaluation on the HumanEva database and get two meaningful insights into the three crucial aspects for human pose estimation: 1) although the choice of feature is very important to the problem, once the learning algorithm becomes efficient, the choice of feature is no longer critical, and 2) the impact of information combination from multiple cameras on pose estimation is closely related to not only the quantity of image information, but also its quality. In most cases, it is true that the more information is involved, the better results can be achieved. But when the information quantity is the same, the differences in quality will lead to totally different performance. Furthermore, dense evaluations demonstrate that our approach is an accurate and robust solution to the human body pose estimation problem.

Read full abstract

Information In Fused Image Research Articles

Related Topics

Articles published on Information In Fused Image

Semi-supervised learning advances species recognition for aquatic biodiversity monitoring

Multiple feature fusion transformer for modeling penicillin fermentation process with unequal sampling intervals.

MCnet: Multiscale visible image and infrared image fusion network

3D Object Detection from Point Cloud Based on Deep Learning

融合图片信息的“标题党”新闻识别研究

Maximum a posteriori fusion method based on gradient consistency constraint for multispectral/panchromatic remote sensing images

A probabilistic framework for image information fusion with an application to mammographic analysis

Human Pose Regression Through Multiview Visual Fusion

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Information In Fused Image Research Articles

Related Topics

Articles published on Information In Fused Image

Semi-supervised learning advances species recognition for aquatic biodiversity monitoring

Multiple feature fusion transformer for modeling penicillin fermentation process with unequal sampling intervals.

MCnet: Multiscale visible image and infrared image fusion network

3D Object Detection from Point Cloud Based on Deep Learning

融合图片信息的“标题党”新闻识别研究

Maximum a posteriori fusion method based on gradient consistency constraint for multispectral/panchromatic remote sensing images

A probabilistic framework for image information fusion with an application to mammographic analysis

Human Pose Regression Through Multiview Visual Fusion