Articles published on Variable Bitrate Coding
Authors
Select Authors
Journals
Select Journals
Duration
Select Duration
9 Search results
Sort by Recency
- Research Article
21
- 10.1109/tcsvt.2022.3230843
- Jun 1, 2023
- IEEE Transactions on Circuits and Systems for Video Technology
- Shurun Wang + 3 more
There has been an increasing consensus that the machine vision is gradually replacing human vision in numerous tasks, with the demonstrated success of artificial intelligence. In this paper, we propose a deep image compression scheme towards machine vision, with the principle of “begin with the end in mind”. In particular, a unified optimization scheme for end-to-end image compression towards machine vision is proposed, accompanied with the dedicated variable bitrate coding and generalized rate-accuracy optimization. The presented framework, which jointly optimizes the compression and the machine vision networks, exploits the utmost potential of robust machine vision for compressed images. The variable bitrate modules towards machine vision, which effectively shrink the storage space for model parameters, are further developed to accommodate to the real-world applications. Moreover, an iterative algorithm is presented to achieve the optimality in terms of the generalized rate-accuracy towards machine vision. Experimental results show that the proposed framework achieves the state-of-the-art object detection performance among the end-to-end image compression methods: in the exploration of Video Coding for Machines (VCM) in Moving Picture Experts Group (MPEG), and the proposed framework achieves 31.69% and 23.96% BD-rate gains compared with the VCM official test datasets, the Open Images dataset and the TVD dataset respectively, which are generated using the state-of-the-art standard Versatile Video Coding (VVC) standard. The generalization capability of the proposed framework is also verified with instance segmentation under various scenarios.
- Research Article
218
- 10.1109/tpami.2020.2988453
- Apr 20, 2020
- IEEE Transactions on Pattern Analysis and Machine Intelligence
- Guo Lu + 5 more
Traditional video compression approaches build upon the hybrid coding framework with motion-compensated prediction and residual transform coding. In this paper, we propose the first end-to-end deep video compression framework to take advantage of both the classical compression architecture and the powerful non-linear representation ability of neural networks. Our framework employs pixel-wise motion information, which is learned from an optical flow network and further compressed by an auto-encoder network to save bits. The other compression components are also implemented by the well-designed networks for high efficiency. All the modules are jointly optimized by using the rate-distortion trade-off and can collaborate with each other. More importantly, the proposed deep video compression framework is very flexible and can be easily extended by using lightweight or advanced networks for higher speed or better efficiency. We also propose to introduce the adaptive quantization layer to reduce the number of parameters for variable bitrate coding. Comprehensive experimental results demonstrate the effectiveness of the proposed framework on the benchmark datasets.
- Research Article
- 10.7776/ask.2011.30.8.461
- Nov 30, 2011
- The Journal of the Acoustical Society of Korea
- Chung-Soo Lim + 1 more
Accurate classification of input signals is the key prerequisite for variable bit-rate coding, which has been introduced in order to effectively utilize limited communication bandwidth. Especially, recent surge of multimedia services elevate the importance of speech/music classification. Among many speech/music classifier, the ones based on support vector machine (SVM) have a strong selling point, high classification accuracy, but their computational complexity and memory requirement hinder their way into actual implementations. Therefore, techniques that reduce the computational complexity and the memory requirement is inevitable, particularly for embedded systems. We first analyze implementation of an SVM-based classifier on embedded systems in terms of execution time and energy consumption, and then propose two techniques that alleviate the implementation requirements: One is a technique that removes support vectors that have insignificant contribution to the final classification, and the other is to skip processing some of input signals by virtue of strong correlations in speech/music frames. These are post-processing techniques that can work with any other optimization techniques applied during the training phase of SVM. With experiments, we validate the proposed algorithms from the perspectives of classification accuracy, execution time, and energy consumption.
- Research Article
11
- 10.1006/jvci.1994.1006
- Mar 1, 1994
- Journal of Visual Communication and Image Representation
- Christine Guillemot + 1 more
Layered Coding Schemes for Video Transmission on ATM Networks
- Research Article
19
- 10.1109/49.219556
- Apr 1, 1993
- IEEE Journal on Selected Areas in Communications
- M.A Saleh + 2 more
Variable bit rate (VBR) coding techniques have received great research interest as very promising tools for transmitting bursty multimedia traffic with low bandwidth requirements over a communication link. Statistically multiplexing the multimedia bursty traffic is a very efficient method of maximizing the utilization of the link capacity. The application of computer simulation techniques in analyzing a rate-based access control scheme for multimedia traffic such as voice traffic is discussed. The control scheme regulates the packetized bursty traffic at the user network interface of the link. Using a suitable congestion measure, namely, the multiplexer buffer length, the scheme dynamically controls the arrival rate by switching the coder to a different compression ratio (i.e., changing the coding rate). VBR coding methods can be adaptively adjusted to transmit at a lower rate with very little degradation in the voice quality. Reported results prove that the scheme greatly improves the link performance, in terms of reducing the probability of call blocking and enhancing the statistical multiplexing gain.< <ETX xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">></ETX>
- Research Article
2
- 10.1016/0923-5965(91)90037-3
- Sep 1, 1991
- Signal Processing: Image Communication
- Jiro Katto
Variable bit-rate coding based on human visual system
- Research Article
16
- 10.1016/0923-5965(91)90005-m
- Jun 1, 1991
- Signal Processing: Image Communication
- Stefano Tubaro
A two layers video coding scheme for ATM networks
- Research Article
134
- 10.1109/49.32339
- Jun 1, 1989
- IEEE Journal on Selected Areas in Communications
- W Verbiest + 1 more
The bandwidth flexibility offered by the asynchronous transfer mode (ATM) technique makes it possible to select picture quality and bandwidth over a wide range in a simple and straightforward manner. A prototype model of a video codec was developed that demonstrates the feasibility of both variable bit rate (VBR) coding and user-selectable picture quality. The VBR coding algorithm is discussed and it is shown how a stabilized quality is achieved and how this quality and associated bandwidth can be selected by the user. How error propagation is limited to reduce the visibility of cell losses is also discussed. Interfaces with the ATM network are analyzed, with emphasis on decoder synchronization and absorption of cell delay jitter. The VBR codec offers very good picture quality for videophony applications at an equivalent load of 5.9 Mb/s. Picture quality remains relatively constant, even for heavy motion. >
- Research Article
155
- 10.1109/49.32343
- Jun 1, 1989
- IEEE Journal on Selected Areas in Communications
- F Kishino + 3 more
Statistical characteristics of video signals for video packet coding, are clarified and a variable-bit-rate coding method for asynchronous transfer mode (ATM) networks is described that is capable of compensating for packet loss. ATM capabilities are shown to be greatly affected by delay, delay jitter, and packet loss probability. Packet loss has the greatest influence on picture quality. Packets may be lost either due to random bit error in a cell header or to network control when traffic is congested. A layered coding technique using discrete-cosine transform (DCT) coding is presented which is suitable for packet loss compensation. The influence of packet loss on picture quality is discussed, and decoded pictures with packet loss are shown. The proposed algorithm was verified by computer simulations.< <ETX xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">></ETX>