Conventional Video Codecs Research Articles

Several real-time visual monitoring applications such as surveillance, mental state monitoring, driver drowsiness and patient care, require equipping high-quality cameras with wireless sensors to form visual sensors and this creates an enormous amount of data that has to be managed and transmitted at the sensor node. Moreover, as the sensor nodes are battery-operated, power utilization is one of the key concerns that must be considered. One solution to this issue is to reduce the amount of data that has to be transmitted using specific compression techniques. The conventional compression standards are based on complex encoders (which require high processing power) and simple decoders and thus are not pertinent for battery-operated applications, i.e., VSN (primitive hardware). In contrast, compressive sensing (CS) a distributive source coding mechanism, has transformed the standard coding mechanism and is based on the idea of a simple encoder (i.e., transmitting fewer data-low processing requirements) and a complex decoder and is considered a better option for VSN applications. In this paper, a CS-based joint decoding (JD) framework using frame prediction (using keyframes) and residual reconstruction for single-view video is proposed. The idea is to exploit the redundancies present in the key and non-key frames to produce side information to refine the non-key frames’ quality. The proposed method consists of two main steps: frame prediction and residual reconstruction. The final reconstruction is performed by adding a residual frame with the predicted frame. The proposed scheme was validated on various arrangements. The association among correlated frames and compression performance is also analyzed. Various arrangements of the frames have been studied to select the one that produces better results. The comprehensive experimental analysis proves that the proposed JD method performs notably better than the independent block compressive sensing scheme at different subrates for various video sequences with low, moderate and high motion contents. Also, the proposed scheme outperforms the conventional CS video reconstruction schemes at lower subrates. Further, the proposed scheme was quantized and compared with conventional video codecs (DISCOVER, H-263, H264) at various bitrates to evaluate its efficiency (rate-distortion, encoding, decoding).

Read full abstract

MPEG-2, MPEG-4와 같은 기존의 비디오 코덱에서는 인터 예측을 수행할 때 고정된 해상도의 움직임 벡터를 사용한다. 그러나 KTA 참조 소프트웨어에서는 움직임 벡터의 해상도를 슬라이스 단위로 선택하여 사용할 수 있는 기능을 지원한다. 그러나 선택된 하나의 움직임 벡터 해상도를 슬라이스 전체에 일괄적으로 적용하기 때문에 영상의 국지적인 특성을 반영하는데 어려움이 있다. 본 논문에서는 탐색 구간에 따라 적응적으로 움직임 벡터의 해상도를 결정하는 방법을 제안한다. 움직임 벡터의 탐색 영역을 움직임 벡터가 예측 움직임 벡터로부터 떨어진 거리에 따라 다수개의 구간으로 분할하고, 각 구간에 대하여 하나의 움직임 벡터 해상도를 할당하여 움직임 예측에 적용한다. 따라서 제안하는 방법의 부호화 효율은 각 구간을 분할하는 Threshold와 움직임 벡터를 부호화하는 엔트로피 코딩 방법에 영향을 받는다. HEVC의 참조 소프트웨어인 HM3.0을 이용하여 실험한 결과, Random Access 부호화 구조에서는 평균적으로 약 0.9%의 성능 향상을 얻을 수 있었으며, Low Delay 부호화 구조에 B picture를 적용한 경우는 약 0.6%, P picture를 적용한 경우에서는 약 2.7%의 평균 발생 비트량 감소를 확인할 수 있었다. In most conventional video codecs, such as MPEG-2 and MPEG-4, inter coding is performed with the fixed motion vector resolution. When KTA software was developed, resolution for MVs can be selected in each slice. Although KTA codec uses a variety of resolutions for ME, the selected resolution is applied over the entire pixels in the slice and the statistical property of the local area is not considered. In this paper, we propose an adaptive decision scheme for motion vector resolution which depends on region, where MV search area is divided to multiple regions according to the distance from PMV. In each region, the assigned resolution is used to estimate MV. Each region supports different resolution for ME from other regions. The efficiency of the proposed scheme is affected from threshold values to divide the search area and the entropy coding method to encode the estimated MV. Simulation results with HM3.0 which is the reference software of HEVC show that the proposed scheme provides bit rate gains of 0.9%, 0.6%, and 2.9% in Random Access, Low Delay with B picture, and Low Delay with P picture structures, respectively.

Read full abstract

Conventional Video Codecs Research Articles

Related Topics

Articles published on Conventional Video Codecs

Learned scalable video coding for humans and machines

NN-VVC: A Hybrid Learned-Conventional Video Codec Targeting Humans and Machines

Neural compression for hologram images and videos.

End-to-End Rate-Distortion Optimized Learned Hierarchical Bi-Directional Video Compression.

Deep Predictive Video Compression Using Mode-Selective Uni- and Bi-Directional Predictions Based on Multi-Frame Hypothesis

Block Compressive Sensing Single-View Video Reconstruction Using Joint Decoding Framework for Low Power Real Time Applications

3D Motion Estimation and Compensation Method for Video-Based Point Cloud Compression

Contextual Homogeneity-Based Patch Decomposition Method for Higher Point Cloud Compression

Utilising Low Complexity CNNs to Lift Non-Local Redundancies in Video Coding.

Distributed video coding with adaptive two-step side information generation for smart and interactive media

A New Progressively Refined Wyner-Ziv Video Coding for Low-Power Human-Centered Telehealth

Frame differencing-based segmentation for low bit rate video codec using H.264

Improving low bitrate video coding via computation incorporating a priori information

ADAPTIVE GOP STRUCTURE TO H.264/AVC BASED ON SCENE CHANGE

HEVC 고성능 압축 도구들의 성능 분석을 통한 스크린 콘텐츠 응용 최적 부호화 모델

적응적인 움직임 벡터 해상도를 이용한 움직임 벡터 부호화 방법

Subsampled Block-Matching for Zoom Motion Compensated Prediction

Novel prediction schemes for error resilient video coding

3D motion estimation for depth image coding in 3D video coding

Distributed Video Coding Using LDPC Codes for Wireless Video

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Conventional Video Codecs Research Articles

Related Topics

Articles published on Conventional Video Codecs

Learned scalable video coding for humans and machines

NN-VVC: A Hybrid Learned-Conventional Video Codec Targeting Humans and Machines

Neural compression for hologram images and videos.

End-to-End Rate-Distortion Optimized Learned Hierarchical Bi-Directional Video Compression.

Deep Predictive Video Compression Using Mode-Selective Uni- and Bi-Directional Predictions Based on Multi-Frame Hypothesis

Block Compressive Sensing Single-View Video Reconstruction Using Joint Decoding Framework for Low Power Real Time Applications

3D Motion Estimation and Compensation Method for Video-Based Point Cloud Compression

Contextual Homogeneity-Based Patch Decomposition Method for Higher Point Cloud Compression

Utilising Low Complexity CNNs to Lift Non-Local Redundancies in Video Coding.

Distributed video coding with adaptive two-step side information generation for smart and interactive media

A New Progressively Refined Wyner-Ziv Video Coding for Low-Power Human-Centered Telehealth

Frame differencing-based segmentation for low bit rate video codec using H.264

Improving low bitrate video coding via computation incorporating a priori information

ADAPTIVE GOP STRUCTURE TO H.264/AVC BASED ON SCENE CHANGE

HEVC 고성능 압축 도구들의 성능 분석을 통한 스크린 콘텐츠 응용 최적 부호화 모델

적응적인 움직임 벡터 해상도를 이용한 움직임 벡터 부호화 방법

Subsampled Block-Matching for Zoom Motion Compensated Prediction

Novel prediction schemes for error resilient video coding

3D motion estimation for depth image coding in 3D video coding

Distributed Video Coding Using LDPC Codes for Wireless Video