Abstract
An enhanced version of a single-ended measure for non-intrusive assessment of speech quality for telephony applications is described and its performance evaluated. The new measure, which uses only the output of the system, is based on measuring perception-based objective auditory distances between voiced parts of the processed speech whose quality is to be evaluated to appropriately matching references extracted from one of four pre-formulated reference books, depending on their estimated pitch values. The reference books are formed by optimally clustering large number of parametric speech vectors extracted from a database of clean speech records. The measured auditory distances are then mapped into objective listening quality scores (MOS_LQO). The required clustering and matching process was achieved using a K Dimensional tree structure (KD-Tree). The short-time Bark Spectrum analysis is used in order to achieve perception-based, speaker-independent parametric representation of the speech. Reported evaluation results show that the proposed enhanced speech quality assessment method provides quality scores that are highly correlated with MOS_LQS obtained by formal subjective listening tests.
Original language | English |
---|---|
Pages (from-to) | 18-23 |
Number of pages | 6 |
Journal | IEE Conference Publication |
Issue number | CP 511 |
DOIs | |
Publication status | Published - 2005 |
Event | IEE Irish Signals and Systems Conference - Dublin, Ireland Duration: 1 Sep 2005 → 2 Sep 2005 |