
- •Itu-t p-series recommendations telephone transmission quality, telephone installations, local line networks
- •Intellectual property rights
- •Contents
- •Itu-t Recommendation p.862 Perceptual evaluation of speech quality (pesq): An objective method for end-to-end speech quality assessment of narrow‑band telephone networks and speech codecs1
- •1 Introduction
- •2 Normative references
- •3 Abbreviations
- •4 Scope
- •Table 1/p.862 Factors for which pesq had demonstrated acceptable accuracy
- •Table 2/p.862 – pesq is known to provide inaccurate predictions when used in conjunction with these variables, or is otherwise not intended to be used with these variables
- •5 Conventions
- •6 Overview of pesq
- •Figure 1/p.862 – Overview of the basic philosophy used in pesq
- •7 Comparison between objective and subjective scores
- •7.1 Correlation coefficient
- •7.2 Residual errors
- •8 Preparation of processed speech material
- •8.1 Source material
- •8.1.1 Choice of source material
- •8.1.2 Itu-t Temporal structure and duration of source material
- •8.1.3 Filtering and level calibration
- •8.2 Addition of background noise
- •Figure 2/p.862 – Methods for testing quality with and without environmental noise
- •8.3 Processing through system under test
- •9 Selection of experimental parameters
- •10 Description of pesq algorithm
- •Figure 3/p.862 – Overview of the alignment routine used in pesq to determine the delay per time interval di
- •Irs filtering
- •10.1.3 Time alignment
- •10.1.3.1 Envelope-based alignment
- •10.1.3.2 Fine time alignment
- •10.1.3.3 Utterance splitting
- •10.1.3.4 Perceptual realignment
- •10.2 Perceptual model (Figures 4a and 4b)
- •10.2.1 Precomputation of constant settings
- •10.2.1.1 Fft window size depending on the sample frequency (8 or 16 kHz)
- •10.2.1.2 Absolute hearing threshold
- •10.2.1.3 The power scaling factor
- •10.2.1.4 The loudness scaling factor
- •10.2.2 Irs-receive filtering
- •10.2.3 Computation of the active speech time interval
- •10.2.4 Short-term Fast Fourier Transform
- •10.2.5 Calculation of the pitch power densities
- •10.2.6 Partial compensation of the original pitch power density for transfer function equalization
- •10.2.7 Partial compensation of the distorted pitch power density for time‑varying gain variations between distorted and original signal
- •10.2.8 Calculation of the loudness densities
- •10.2.9 Calculation of the disturbance density
- •10.2.10 Cell-wise multiplication with an asymmetry factor
- •10.2.11 Aggregation of the disturbance densities over frequency and emphasis on soft parts of the original
- •10.2.12 Zeroing of the frame disturbance for frames during which the delay decreased significantly
- •10.2.13 Realignment of bad intervals
- •10.2.14 Aggregation of the disturbance within split second intervals
- •10.2.15 Aggregation of the disturbance over the duration of the speech signal (around 10 s), including a recency factor
- •10.2.16 Computation of the pesq score
- •Bibliography
- •Reference implementation of pesq and conformance testing List of files provided for the ansi-c reference implementation
- •List of files provided for conformance validation
- •Conformance validation
- •Comparison with itu-t p‑series Supplement 23
- •Comparison with variable delay files
- •Additional comparisons
|
INTERNATIONAL TELECOMMUNICATION UNION | |||
|
| |||
|
ITU-T |
P.862 | ||
|
TELECOMMUNICATION STANDARDIZATION SECTOR OF ITU |
(02/2001) | ||
SERIES P: TELEPHONE TRANSMISSION QUALITY, TELEPHONE INSTALLATIONS, LOCAL LINE NETWORKS Methods for objective and subjective assessment of quality
| ||||
Perceptual evaluation of speech quality (PESQ): An objective method for end-to-end speech quality assessment of narrow-band telephone networks and speech codecs | ||||
|
ITU‑T Recommendation P.862 (Formerly CCITT Recommendation) |
Itu-t p-series recommendations telephone transmission quality, telephone installations, local line networks
| |
Vocabulary and effects of transmission parameters on customer opinion of transmission quality |
Series P.10 |
Subscribers' lines and sets |
Series P.30 |
|
P.300 |
Transmission standards |
Series P.40 |
Objective measuring apparatus |
Series P.50 |
|
P.500 |
Objective electro-acoustical measurements |
Series P.60 |
Measurements related to speech loudness |
Series P.70 |
Methods for objective and subjective assessment of quality |
Series P.80 |
|
P.800 |
Audiovisual quality in multimedia services |
Series P.900 |
|
|
For further details, please refer to the list of ITU-T Recommendations.
ITU-T Recommendation P.862
Perceptual evaluation of speech quality (PESQ): An objective method for end-to-end speech quality assessment of narrow-band telephone networks and speech codecs
|
Summary This Recommendation describes an objective method for predicting the subjective quality of 3.1 kHz (narrow-band) handset telephony and narrow-band speech codecs. This Recommendation presents a high-level description of the method, advice on how to use it, and part of the results from a Study Group 12 benchmark carried out in the period 1999-2000. An ANSI-C reference implementation, described in Annex A, is provided in separate files and form an integral part of this Recommendation. A conformance testing procedure is also specified in Annex A to allow a user to validate that an alternative implementation of the model is correct. This ANSI-C reference implementation shall take precedence in case of conflicts between the high-level description as given in this Recommendation and the ANSI-C reference implementaion. This Recommendation includes an electornic attachment containing an ANSI-C reference implementation of PESQ and conformance testing data. |
Source ITU‑T Recommendation P.862 was prepared by ITU‑T Study Group 12 (2001‑2004) and approved under the WTSA Resolution 1 procedure on 23 February 2001. |
|
FOREWORD
The International Telecommunication Union (ITU) is the United Nations specialized agency in the field of telecommunications. The ITU Telecommunication Standardization Sector (ITU-T) is a permanent organ of ITU. ITU-T is responsible for studying technical, operating and tariff questions and issuing Recommendations on them with a view to standardizing telecommunications on a worldwide basis.
The World Telecommunication Standardization Assembly (WTSA), which meets every four years, establishes the topics for study by the ITU‑T study groups which, in turn, produce Recommendations on these topics.
The approval of ITU-T Recommendations is covered by the procedure laid down in WTSA Resolution 1.
In some areas of information technology which fall within ITU-T's purview, the necessary standards are prepared on a collaborative basis with ISO and IEC.
NOTE
In this Recommendation, the expression "Administration" is used for conciseness to indicate both a telecommunication administration and a recognized operating agency.