Abstract
This paper proposes a new output-based method for assessing speech quality and evaluates its performance. The measure is based on comparing the output speech to an artificial reference signal representing the closest match from an appropriately formulated codebook. The codebook holds a number of optimally clustered speech parameter vectors, extracted from an undistorted clean speech database, and provides a reference for computing objective auditory distance measures for distorted speech. The median minimum distance is used as a measure of the objective auditory distance. The required clustering and matching processes are achieved by using an efficient data mining technique known as the Self-Organising Map. Speech parameters derived from Perceptual Linear Prediction (PLP) and Bark Spectrum analysis are used to provide speaker independent information as required by an output-based objective approach for speech quality measure.
Original language | English |
---|---|
Pages (from-to) | 476-479 |
Number of pages | 4 |
Journal | ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings |
Volume | 1 |
Publication status | Published - 2003 |
Event | 2003 IEEE International Conference on Accoustics, Speech, and Signal Processing - Hong Kong, Hong Kong Duration: 6 Apr 2003 → 10 Apr 2003 |