VoIP speech quality estimation in a mixed context with genetic programming

Adil Raja, R. Muhammad Atif Azad, Conor Ryan, Colin Flanagan

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Voice over IP (VoIP) speech quality estimation is crucial to providing optimal Quality of Service (QoS). This paper seeks to provide improved speech quality estimation models with better prediction accuracy by considering a richer set of input features than the current International Telecommunications Union-Telecommunication (ITU-T) recommendations. It addresses a transitional phase, where wideband (WB) networks are becoming available. However, they have to co-exist with the existing narrowband (NB) setups for the time being. Quality estimation becomes a challenge in such a mixed context. The ITU-T recommendation (termed E-Model) has recently been extended to deal with the mixed context. However, it evaluates the speech degradation in the WB scenario based solely on codec related distortions (only a subset of factors affecting the speech quality on a VoIP network). The extension is derived out of speech signals evaluated by human subjects: an expensive and difficult to reproduce exercise. This paper innovates by considering a number of other network distortion types as well to produce generalised models that predict the quality degradation to a higher accuracy. To this end, an extensive set of speech samples is subjected to a wide variety of distortions. The degraded signals are evaluated by the currently best available algorithmic approximation of human evaluation of speech to produce quality scores. Using the distortions as the input features and targeting the quality scores, we employ Genetic Programming to produce parsimonious models that show considerable prediction gain compared to the E-Model. As against some existing approaches, where the models are tailored to various telephony codecs, the evolved models generalise across a variety of modern codecs.

Original languageEnglish
Title of host publicationGECCO'08
Subtitle of host publicationProceedings of the 10th Annual Conference on Genetic and Evolutionary Computation 2008
PublisherAssociation for Computing Machinery (ACM)
Pages1627-1634
Number of pages8
ISBN (Print)9781605581309
DOIs
Publication statusPublished - 2008
Event10th Annual Genetic and Evolutionary Computation Conference, GECCO 2008 - Atlanta, GA, United States
Duration: 12 Jul 200816 Jul 2008

Publication series

NameGECCO'08: Proceedings of the 10th Annual Conference on Genetic and Evolutionary Computation 2008

Conference

Conference10th Annual Genetic and Evolutionary Computation Conference, GECCO 2008
Country/TerritoryUnited States
CityAtlanta, GA
Period12/07/0816/07/08

Keywords

  • E-model
  • Genetic programming
  • I,W B,eff
  • PESQ-WB
  • Speech quality
  • Symbolic regression
  • VoIP

Fingerprint

Dive into the research topics of 'VoIP speech quality estimation in a mixed context with genetic programming'. Together they form a unique fingerprint.

Cite this