Real-time, non-intrusive speech quality estimation: A signal-based model

Adil Raja, Colin Flanagan

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Speech quality estimation, as perceived by humans, is of vital importance to proper functioning of telecommunications networks. Speech quality can be degraded due to various network related problems. In this paper we present a model for speech quality estimation that is a function of various time and frequency domain features of human speech. We have employed a hybrid optimization approach, by using Genetic Programming (GP) to find a suitable structure for the desired model. In order to optimize the coefficients of the model we have employed a traditional GA and a numerical method known as linear scaling. The proposed model outperforms the ITU-T Recommendation P.563 in terms of prediction accuracy, which is the current non-intrusive speech quality estimation model. The proposed model also has a significantly reduced dimensionality. This may reduce the computational requirements of the model.

Original languageEnglish
Title of host publicationGenetic Programming - 11th European Conference, EuroGP 2008, Proceedings
Pages37-48
Number of pages12
DOIs
Publication statusPublished - 2008
Event11th European Conference on Genetic Programming, EuroGP 2008 - Naples, Italy
Duration: 26 Mar 200828 Mar 2008

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume4971 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference11th European Conference on Genetic Programming, EuroGP 2008
Country/TerritoryItaly
CityNaples
Period26/03/0828/03/08

Keywords

  • GP
  • MOS
  • Non-intrusive
  • Signal-based

Fingerprint

Dive into the research topics of 'Real-time, non-intrusive speech quality estimation: A signal-based model'. Together they form a unique fingerprint.

Cite this