Perceived audio quality of sounds degraded by\ud non-linear distortions and single-ended\ud assessment using HASQI

Article English OPEN
Kendrick, P ; Jackson, IR ; Li, FF ; Fazenda, BM ; Cox, TJ

For field recordings and user generated content recorded on phones, tablets, and other mobile\ud devices nonlinear distortions caused by clipping and limiting at pre-amplification stages, and\ud dynamic range control (DRC) are common causes of poor audio quality. A single-ended\ud method to detect these distortions and predict perceived degradation in speech, music, and\ud soundscapes has been developed. This was done by training an ensemble of decision trees.\ud During training, both clean and distorted audio was available and so the perceived quality\ud could be gauged using HASQI (Hearing Aid Sound Quality Index). The new single-ended\ud method can correctly predict HASQI from distorted samples to an accuracy of ±0.19 (95%\ud confidence interval) using a quality range between 0.0 and 1.0. The method also has potential\ud for estimating HASQI when other types of degradations are present. Subsequent perceptual\ud tests validated the method for music and soundscapes. For the average mean opinion score\ud for perceived audio quality on a scale from 0 to 1, the single ended method could estimate it\ud within ±0.33.
  • References (47)
    47 references, page 1 of 5

    [1.] C. Wardle, S. Dubberley, and P. Brown, “Amateur Footage: A Global Study of UserGenerated Content in TV and Online-News Output” (2014). [Online]. Available: http://towcenter.org/wpcontent/uploads/2014/04/80458 Tow-Center-ReportWEB.pdf. [Accessed: 26-Nov-2014].

    [2.] I. Jackson, “What You Told Us about Recording Audio: An Overview of Our Web Survey,” The Good Recording Project Blog (2012). [Online]. Available: http://www.goodrecording.net/211/. [Accessed: 20- Nov-2012].

    [3.] I. R. Jackson, P. Kendrick, T. J. Cox, B. M. Fazenda, and F. F. Li, “Perception and Automatic Detection of WindInduced Microphone Noise,” J. Acous. Soc. Am., vol. 136, no. 3, p. 1176 (2014). http://dx.doi.org/10.1121/1.4892772

    [4.] “Sound System Equipment-Part 5 Loudspeakers,” BS EN 60268-5 (2009).

    [5.] “Measurement of Intermodulation Distortion in Audio Systems,” SMPTE Recommended Practice RP 120:2005 (2005).

    [6.] R. Small, “Total Difference-Frequency Distortion: Practical Measurements,” J. Audio Eng. Soc., vol. 34, no. 6, pp. 427-436 (1986 June).

    [7.] “Sound System Equipment-Part 3 Amplifiers,” BS EN 60268-3 (2001).

    [8.] L. Lee and E. Geddes, “Auditory Perception of Nonlinear Distortion,” presented at the 115th Convention of the Audio Engineering Society (2003 Oct.), convention paper 5891.

    [9.] E. Geddes and L. Lee, “Auditory Perception of Nonlinear Distortion-Theory,” presented at the 115th Convention of the Audio Engineering Society (2003 Oct.), convention paper 5890.

    [10.] A. W. Rix, M. P. Hollier, A. P. Hekstra and J. G. Beerends, “PESQ, the New ITU Standard for Objective Measurement of Perceived Speech Quality, Part I-Time Alignment,” J. Audio Eng. Soc., vol. 50, pp. 755-764 (2002 Oct.).

  • Metrics
    0
    views in OpenAIRE
    0
    views in local repository
    186
    downloads in local repository

    The information is available from the following content providers:

    From Number Of Views Number Of Downloads
    University of Salford Institutional Repository - IRUS-UK 0 186
Share - Bookmark