Video Super-resolution Research Articles

Abstract Visual vibration measurement has emerged in the field of structural health monitoring in recent years, but it still has some shortcomings in terms of resolution, recognition rate and real-time performance. Considering the three aspects of recovering high-frequency image details, improving the compactness of the target bounding box, and reducing the computational time, we use the constructed image super-resolution reconstruction model and target detection model to measure the vibration displacement of the bridge structural model. First, we integrate the Transformer module into the Unet network with a simple structure. The Swin and Global Transformer Unet (SGTU) module constructed in this form can reduce the computational cost while reconstructing the large-resolution feature map target, and it can sharply edge information of the vibration target. We use the framework of the YOLOv5 algorithm as the backbone, and use the GhostBottleneck (GB) module to reduce the time for convolution operations to generate similar features. In addition, the proposed DWCBottleneck (DWCB) fusion module is also able to achieve high-level semantic fusion and network depth expansion with minimal computational cost. Finally, the center point offset of the bounding box predicted by the model can be used to obtain the displacement offset of the object in the image sequence. The position information of the target in the first frame image is used as the reference frame for calculating the offset, and the vibration displacement of the flexible structure in the image coordinate system is obtained by calculating the deviation of the displacement between the remaining frames and the first frame. We perform qualitative and quantitative comparisons in three aspects: video super-resolution reconstruction, visual detection robustness, and sensor vibration measurement displacement using a homemade vibration image dataset. The time-frequency domain displacement curves regressed by the visual vibration measurement algorithm are compared with the curves acquired after accelerometer acquisition, indicating the necessity of super-resolution reconstruction in visual vibration measurement.&#xD;

AbstractIn video streaming, bandwidth constraints significantly affect client-side video quality. Addressing this, deep neural networks offer a promising avenue for implementing video super-resolution (VSR) at the user end, leveraging advancements in modern hardware, including mobile devices. The principal challenge in VSR is the computational intensity involved in processing temporal/spatial video data. Conventional methods, uniformly processing entire scenes, often result in inefficient resource allocation. This is evident in the over-processing of simpler regions and insufficient attention to complex regions, leading to edge artifacts in merged regions. Our innovative approach employs semantic segmentation and spatial frequency-based categorization to divide each video frame into regions of varying complexity: simple, medium, and complex. These are then processed through an efficient incremental model, optimizing computational resources. A key innovation is the sparse temporal/spatial feature transformation layer, which mitigates edge artifacts and ensures seamless integration of regional features, enhancing the naturalness of the super-resolution outcome. Experimental results demonstrate that our method significantly boosts VSR efficiency while maintaining effectiveness. This marks a notable advancement in streaming video technology, optimizing video quality with reduced computational demands. This approach, featuring semantic segmentation, spatial frequency analysis, and an incremental network structure, represents a substantial improvement over traditional VSR methodologies, addressing the core challenges of efficiency and quality in high-resolution video streaming.

Video Super-resolution Research Articles

Related Topics

Articles published on Video Super-resolution

TempDiff: Enhancing Temporal‐awareness in Latent Diffusion for Real‐World Video Super‐Resolution

A lightweight distillation recurrent convolution network on FPGA for real-time video super-resolution

Continuous Space-Time Video Super-Resolution with Multi-Stage Motion Information Reorganization

Vibration displacement measurement of bridge structural models using image super-resolution reconstruction and visual object detection network

FDDCC-VSR: a lightweight video super-resolution network based on deformable 3D convolution and cheap convolution

SR4KVQA: Video quality assessment database and metric for 4K super-resolution

Real-world video superresolution enhancement method based on the adaptive down-sampling model

Combining optical flow and Swin Transformer for Space-Time video super-resolution

Recurrent feature supplementation network for video super-resolution

Space-time video super-resolution via multi-scale feature interpolation and temporal feature fusion

STDAN: Deformable Attention Network for Space-Time Video Super-Resolution.

A ‘deep’ review of video super-resolution

Semantic guidance incremental network for efficiency video super-resolution

Self-Supervised Deep Blind Video Super-Resolution.

Bidirectional scale-aware upsampling network for arbitrary-scale video super-resolution

MSTG: Multi-Scale Transformer with Gradient for joint spatio-temporal enhancement

Deep Compressed Video Super-Resolution With Guidance of Coding Priors

A Database and Model for the Visual Quality Assessment of Super-Resolution Videos

Looking beyond input frames: Self-supervised adaptation for video super-resolution

Enhanced Video Super-Resolution Network towards Compressed Data

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Video Super-resolution Research Articles

Related Topics

Articles published on Video Super-resolution

TempDiff: Enhancing Temporal‐awareness in Latent Diffusion for Real‐World Video Super‐Resolution

A lightweight distillation recurrent convolution network on FPGA for real-time video super-resolution

Continuous Space-Time Video Super-Resolution with Multi-Stage Motion Information Reorganization

Vibration displacement measurement of bridge structural models using image super-resolution reconstruction and visual object detection network

FDDCC-VSR: a lightweight video super-resolution network based on deformable 3D convolution and cheap convolution

SR4KVQA: Video quality assessment database and metric for 4K super-resolution

Real-world video superresolution enhancement method based on the adaptive down-sampling model

Combining optical flow and Swin Transformer for Space-Time video super-resolution

Recurrent feature supplementation network for video super-resolution

Space-time video super-resolution via multi-scale feature interpolation and temporal feature fusion

STDAN: Deformable Attention Network for Space-Time Video Super-Resolution.

A ‘deep’ review of video super-resolution

Semantic guidance incremental network for efficiency video super-resolution

Self-Supervised Deep Blind Video Super-Resolution.

Bidirectional scale-aware upsampling network for arbitrary-scale video super-resolution

MSTG: Multi-Scale Transformer with Gradient for joint spatio-temporal enhancement

Deep Compressed Video Super-Resolution With Guidance of Coding Priors

A Database and Model for the Visual Quality Assessment of Super-Resolution Videos

Looking beyond input frames: Self-supervised adaptation for video super-resolution

Enhanced Video Super-Resolution Network towards Compressed Data