Mean Opinion Score Values Research Articles

This paper presents the result of a recent large-scale subjective study of image retargeting quality on a collection of images generated by several representative image retargeting methods. Owning to many approaches to image retargeting that have been developed, there is a need for a diverse independent public database of the retargeted images and the corresponding subjective scores to be freely available. We build an image retargeting quality database, in which 171 retargeted images (obtained from 57 natural source images of different contents) were created by several representative image retargeting methods. And the perceptual quality of each image is subjectively rated by at least 30 viewers, meanwhile the mean opinion scores (MOS) were obtained. It is revealed that the subject viewers have arrived at a reasonable agreement on the perceptual quality of the retargeted image. Therefore, the MOS values obtained can be regarded as the ground truth for evaluating the quality metric performances. The database is made publicly available (Image Retargeting Subjective Database, [Online]. Available: http://ivp.ee.cuhk.edu.hk/projects/demo/retargeting/index.html) to the research community in order to further research on the perceptual quality assessment of the retargeted images. Moreover, the built image retargeting database is analyzed from the perspectives of the retargeting scale, the retargeting method, and the source image content. We discuss how to retarget the images according to the scale requirement and the source image attribute information. Furthermore, several publicly available quality metrics for the retargeted images are evaluated on the built database. How to develop an effective quality metric for retargeted images is discussed through a specifically designed subjective testing process. It is demonstrated that the metric performance can be further improved, by fusing the descriptors of shape distortion and content information loss.

Read full abstract

Usage of Automatic Speech Recognition (ASR) systems is increasing day-by-day for voice centric applications in mobile handheld and Voice over Internet Protocol (VoIP) devices. The necessity is also increasing to find out the ASR performance under different network impediments. Among them, speech and audio coding standards is the one, which affects the ASR performance greatly, when, using them with different sampling and bit rates in the practical systems. Another common impediment which influences the ASR accuracy is the bit errors in the wireless networks and packet drop conditions in the VoIP networks. ASR performance with some of the speech coding standards under noise conditions for the wireless networks is reported in the literature. However, each study is reporting the ASR performance for few narrowband codecs with different speech databases and different ASR toolkits like RAPHEL, HTK, SPHINX, etc. In this paper, the analysis on ASR performance while using both narrowband and wideband speech and audio coding standards, which are currently accepted for GSM mobile and VoIP networks, using the common speech database-TIMIT, and using ASR toolkit-SPHINX, is presented. The Mean Opinion Score (MOS), which is the generally accepted speech quality measurement technique, is also analyzed for all the speech and audio coding standards, using the same speech database. The results of the studies carried out for the ASR word accuracies and MOS values for different narrowband and wideband speech and audio codecs under no-loss conditions are presented. Results for different rates of packet drop condition which is the common noise scenario in wired networks such as VoIP (which is also merging with wireless networks) are also presented. The observation is that though some of the codecs are showing poor MOS performance at lower bit rates, the corresponding ASR performance is comparable with other codecs at higher bit rates.

Read full abstract

Mean Opinion Score Values Research Articles

Related Topics

Articles published on Mean Opinion Score Values

Pengembangan Aplikasi Text-to-Speech Bahasa Indonesia Menggunakan Metode Finite State Automata Berbasis Android

MCL-3D: A Database for Stereoscopic Image Quality Assessment using 2D-Image-Plus-Depth Source

CID2013: a database for evaluating no-reference image quality assessment algorithms.

Speech Enhancement Using Modified Modulation Magnitude Estimation-Based Spectral Subtraction Algorithm

Image database TID2013: Peculiarities, results and perspectives

Towards Layer Adaptation for Audio Transmission

Coding ECG beats using multiscale compressed sensing based processing

QoE model for video delivered over an LTE network using HTTP adaptive streaming

A Study on Objective Quality Measure for Bandwidth-Extended Speech in Mobile Voice Communications

Image Retargeting Quality Assessment: A Study of Subjective Scores and Objective Metrics

Investigation of Automatic Speech Recognition Performance and Mean Opinion Scores for Different Standard Speech and Audio Codecs

Real-Time VoIP Quality Measurement for Mobile Devices

Objectification of perceptual image quality for mobile video

Two speaker speech separation by LP residual weighting and harmonics enhancement

디지털 영상의 인지적 무참조 화질 평가 방법

Effect of degradations' distribution in a corpus test on auditory ratings

Image Quality Analysis for Visible Spectral Imaging Systems

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Mean Opinion Score Values Research Articles

Related Topics

Articles published on Mean Opinion Score Values

Pengembangan Aplikasi Text-to-Speech Bahasa Indonesia Menggunakan Metode Finite State Automata Berbasis Android

MCL-3D: A Database for Stereoscopic Image Quality Assessment using 2D-Image-Plus-Depth Source

CID2013: a database for evaluating no-reference image quality assessment algorithms.

Speech Enhancement Using Modified Modulation Magnitude Estimation-Based Spectral Subtraction Algorithm

Image database TID2013: Peculiarities, results and perspectives

Towards Layer Adaptation for Audio Transmission

Coding ECG beats using multiscale compressed sensing based processing

QoE model for video delivered over an LTE network using HTTP adaptive streaming

A Study on Objective Quality Measure for Bandwidth-Extended Speech in Mobile Voice Communications

Image Retargeting Quality Assessment: A Study of Subjective Scores and Objective Metrics

Investigation of Automatic Speech Recognition Performance and Mean Opinion Scores for Different Standard Speech and Audio Codecs

Real-Time VoIP Quality Measurement for Mobile Devices

Objectification of perceptual image quality for mobile video

Two speaker speech separation by LP residual weighting and harmonics enhancement

디지털 영상의 인지적 무참조 화질 평가 방법

Effect of degradations' distribution in a corpus test on auditory ratings

Image Quality Analysis for Visible Spectral Imaging Systems