Non-intrusive assessment of perceptual speech quality using a self-organising map

Dorel Picovici, Abdulhussain E. Mahdi

Research output: Contribution to conferencePaperpeer-review

Abstract

A new output-based method for non-intrusive assessment of speech quality for voice communication system is proposed and its performance evaluated. The method is based on comparing the output speech to an appropriate reference representing the closest match from a pre-formulated codebook containing optimally clustered speech parameter vectors extracted from a large number of various undistorted clean speech records. The objective auditory distances between vectors of the distorted speech and their corresponding matching references are then measured and appropriately converted into an equivalent subjective score. The optimal clustering of the reference codebook is achieved by a dynamic k-means method. A self-organising map algorithm is used to match the distorted speech vectors to the references. Speech parameters derived from Bark spectrum analysis, Perceptual Linear Prediction (PLP), and Mel- Frequency Cepstral coefficients (MFCC) are used to provide speaker independent parametric representation of the speech signals as required by an output-based quality measure.

Original languageEnglish
Pages2077-2080
Number of pages4
Publication statusPublished - 2003
Event8th European Conference on Speech Communication and Technology, EUROSPEECH 2003 - Geneva, Switzerland
Duration: 1 Sep 20034 Sep 2003

Conference

Conference8th European Conference on Speech Communication and Technology, EUROSPEECH 2003
Country/TerritorySwitzerland
CityGeneva
Period1/09/034/09/03

Fingerprint

Dive into the research topics of 'Non-intrusive assessment of perceptual speech quality using a self-organising map'. Together they form a unique fingerprint.

Cite this