Добавил:
Опубликованный материал нарушает ваши авторские права? Сообщите нам.
Вуз: Предмет: Файл:
курсач / материалы / es_201980v040102p.pdf
Скачиваний:
0
Добавлен:
13.05.2026
Размер:
1.25 Mб
Скачать

19 ETSI ES 201 980 V4.1.2 (2017-04)

The transmitted signal comprises a succession of OFDM symbols, each symbol being made of a guard interval followed by the so-called useful part of the symbol. Each symbol is the sum of K sine wave portions equally spaced in frequency. Each sine wave portion, called a "cell", is transmitted with a given amplitude and phase and corresponds to a carrier position. Each carrier is referenced by the index k, k belonging to the interval [kmin , kmax ] ( k = 0 corresponds to the reference frequency of the transmitted signal).

The time-related OFDM symbol parameters are expressed in multiples of the elementary time period T, which is equal to 83μs. These parameters are:

Tg : duration of the guard interval;

Ts : duration of an OFDM symbol;

Tu : duration of the useful (orthogonal) part of an OFDM symbol (i.e. excluding the guard interval). The OFDM symbols are grouped to form transmission frames of duration Tf.

As specified in clause 8, a certain number of cells in each OFDM symbol are transmitted with a predetermined amplitude and phase, in order to be used as references in the demodulation process. They are called "reference pilots" and represent a certain proportion of the total number of cells.

Table 2: OFDM symbol parameters

Parameters list

 

 

Robustness mode

 

 

 

A

B

 

C

 

D

E

T (μs)

83

83

 

83

 

83

83

Tu (ms)

24

21

 

14

 

9

(288 × T)

(256 × T)

 

(176 × T)

 

(112 × T)

(27 × T)

 

 

 

Tg (ms)

2

5

 

5

 

7

(32 × T)

(64 × T)

 

(64 × T)

 

(88 × T)

(3 × T)

 

 

 

Tg / Tu

1/9

1/4

 

4/11

 

11/14

1/9

Ts = Tu + Tg (ms)

26

26

 

20

 

16

Tf (ms)

400

400

 

400

 

400

100

5 Source coding modes

5.1Overview

5.1.0Introduction

The audio source coding options in the DRM system are shown in figure 2. As described in clause 4.3, the DRM system offers two audio codecs, xHE-AAC and AAC (in combination with the SBR and PS tools). Optionally, MPS can be used to enable multichannel coding. The encoded audio is composed into audio super frames of constant length.

ETSI

20

ETSI ES 201 980 V4.1.2 (2017-04)

Multiplexing and UEP of audio services is done by means of the multiplex and channel coding units. Audio specific configuration information is transmitted in the SDC (see clause 6.4.3.10).

 

 

 

 

 

 

 

 

 

 

 

 

DRM Audio Source Encoding

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

xHE-AAC Encoder

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

(stereo coding & SBR handled internally/automatically)

 

 

 

 

 

 

 

 

 

audio

 

 

MPEG Surround

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

Audio

 

 

mux &

 

Encoder

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

channel

signal

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

super framing

 

(config. depnd.)

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

coding

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

MPEG PS

 

 

 

SBR Encoder

AAC

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

Encoder

 

 

 

(configuration

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

Encoder

 

 

 

 

 

 

 

 

 

 

 

 

 

 

(config. depnd.)

 

 

 

dependent)

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

DRM Audio Source Decoding

 

 

 

 

 

 

 

 

 

multichannel

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

audio output

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

xHE-AAC

 

 

MPEG Surround

 

 

 

bitstream

 

 

 

 

 

 

 

 

 

Decoder

 

 

 

 

 

 

Decoder

 

 

 

binaural stereo

 

Audio

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

from mux

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

super frame

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

output for

& channel

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

demux

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

headphones

decoding

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

AAC

 

 

 

SBR / PS

 

 

 

 

 

 

 

 

 

mono/stereo

 

 

 

 

 

 

 

 

 

 

 

 

Decoder

 

 

 

Decoder

 

 

 

 

 

 

 

 

 

audio output

Figure 2: Audio source coding overview

5.1.1Extended HE-AAC audio coding (xHE-AAC)

For generic coding of both audio and speech content at all bit rates, a subset of the MPEG xHE-AAC toolbox chosen to best suit the DRM system environment is used. For example a standard configuration for use in one short wave channel could be 16 kbit/s stereo.

Specific features of the xHE-AAC stream within the DRM system are:

Bit rate: xHE-AAC can be used at any bit rate. The granularity of the xHE-AAC bit rate is 20 bit/s for robustness modes A, B, C and D and 80 bit/s for robustness mode E.

Sampling rates: Permitted sampling rates for the use of xHE-AAC within DRM are selected such that the interface of the xHE-AAC codec towards the surrounding application can easily accept or provide 48 kHz audio signals, respectively. The actual core sampling rate is selected by the encoder upon initialization to ensure the best possible audio signal quality and is typically not visible to higher layers of processing.

Audio super framing: To ensure the best possible audio quality particularly at lower bit rates, the xHE-AAC encoder can flexibly assign the available bit rate within certain constraints to each audio frame. Audio super frames - as generated by the xHE-AAC audio encoder and inserted into the DRM logical frames - always have a constant size. However, the number of audio frames per audio super frame is not fixed, and audio frames may span audio super frames. This flexibility is achieved by a slight adjustment to the audio super frame header configuration used for the AAC codec in DRM. One audio super frame is always placed in one DRM logical frame in robustness modes A, B, C and D and in two logical frames in robustness mode E (see

clause 6). In this way no additional synchronization is needed for the audio coding. Retrieval of frame boundaries is also taken care of within the audio super frame.

UEP shall not be used with xHE-AAC.

ETSI

21

ETSI ES 201 980 V4.1.2 (2017-04)

5.1.2AAC audio coding

For generic audio coding, a subset of the MPEG-4 Advanced Audio Coding (AAC) toolbox chosen to best suit the DRM system environment is used. For example a standard configuration for use in one short wave channel could be 20 kbit/s mono.

Specific features of the AAC stream within the DRM system are:

Bit rate: AAC can be used at any bit rate. The granularity of the AAC bit rate is 20 bit/s for robustness modes A, B, C and D and 80 bit/s for robustness mode E.

Sampling rates: permitted sampling rates are 12 kHz and 24 kHz for robustness modes A, B, C and D and 24 kHz and 48 kHz for robustness mode E. 48 kHz is only permitted if the SBR tool is not used.

Transform length: the transform length is 960 to ensure that one audio frame corresponds to 80 ms or 40 ms (robustness modes A, B, C and D) or to 40 ms or 20 ms (robustness mode E) in time. This is required to allow the combination of an integer number of audio frames to build an audio super frame of 400 ms (robustness modes A, B, C and D) or 200 ms (robustness mode E) duration.

Error robustness: a subset of MPEG-4 tools is used to improve the AAC bit stream error robustness in error prone channels (the MPEG-4 EP tool is not used).

Audio super framing: 5 or 10 audio frames are composed into one audio super frame. For robustness modes A, B, C and D, the respective sampling rates are 12 kHz and 24 kHz producing an audio super frame of 400 ms duration; for robustness mode E, the respective sampling rates are 24 kHz and 48 kHz producing an audio super frame of 200 ms duration. The audio frames in the audio super frames are encoded together such that each audio super frame is of constant length, i.e. that bit exchange between audio frames is only possible within an audio super frame. One audio super frame is always placed in one logical frame in robustness modes A, B, C and D and in two logical frames in robustness mode E (see clause 6). In this way no additional synchronization is needed for the audio coding. Retrieval of frame boundaries and provisions for UEP are also taken care of within the audio super frame.

UEP: better graceful degradation and better operation at higher BERs is achieved by applying UEP to the AAC bit stream. Unequal error protection is realized via the multiplex/coding units. For robustness mode E, the length of the higher protected part of an audio super frame shall be a multiple of 2 bytes.

SBR coding

To maintain a reasonable perceived audio quality at low bit rates, classical audio or speech source coding algorithms need to limit the audio bandwidth and to operate at low sampling rates. It is desirable to be able to offer high audio bandwidth also in very low bit rate environments. This can be realized by the use of Spectral Band Replication (SBR).

The purpose of SBR is to recreate the missing high frequency band of the audio signal that could not be coded by the encoder. In order to do this in the best possible way, some side information needs to be transmitted in the audio bitstream, removing a small percentage of the available data rate from the audio coder. This side information is computed on the full bandwidth signal, prior to encoding and aids the reconstruction of the high frequencies after audio/speech decoding.

SBR exists in two versions. The version difference is only reflected in the decoder design. High Quality SBR uses a complex filterbank whereas Low Power SBR uses a real-valued filterbank plus anti-aliasing modules. The Low Power version of SBR offers a significant reduction in complexity as compared to the High Quality version without compromising too much on audio quality. AAC + SBR is defined in MPEG-4 Audio (High Efficiency AAC profile) [2].

PS coding

For improved performance at low bit rate stereo coding, a Parametric Stereo (PS) coder is available. The PS tool can be used when running the configuration AAC + SBR (MPEG High Efficiency AAC profile). The general idea with PS coding is to send stereo image describing data as side information along with a downmixed mono signal. This stereo side information is very concise and only requires a small fraction of the total bit rate allowing the mono signal to have maximum quality for the total bit rate given.

ETSI

Соседние файлы в папке материалы