Добавил:
Опубликованный материал нарушает ваши авторские права? Сообщите нам.
Вуз: Предмет: Файл:
литература / Digital_Video_and_HD_Second_Edition_Algorithms_and_Interfaces.pdf
Скачиваний:
0
Добавлен:
13.05.2026
Размер:
38.02 Mб
Скачать

Luma and colour differences can be computed from R’, G’, and B’ through a 3× 3 matrix multiplication.

Y’PBPR

Y’CBCR

Y’UV

Y’IQ

Colour difference coding (chroma)

In component video, the three components necessary to convey colour information are transmitted separately. Rather than conveying R’G’B’ directly, the relatively poor colour acuity of vision is exploited to reduce data capacity accorded to the colour information, while maintaining full luma detail. First, luma is formed according to Equation 12.1 (or for HD, Equation 12.2). Then, two colour difference signals based upon gammacorrected B’ minus luma and R’ minus luma, B’-Y’ and

R’-Y’, are formed by “matrixing.” Finally, subsampling (filtering) reduces detail in the colour difference (or chroma) components, as I will outline on page 127. Subsampling incurs no loss in sharpness at any reasonable viewing distance.

In component analog video, B’-Y’ and R’-Y’ were scaled to form colour difference signals denoted PB and PR, which were then analog lowpass filtered (horizontally) to about half the luma bandwidth. Y’, PB, and PR each have unity excursion (i.e., 0 to 1, ±0.5, and ±0.5). In computing, these components are commonly denoted U and V.

In component digital video (including M-JPEG, MPEG, and H.264), B’-Y’ and R’-Y’ are scaled to form CB and CR components, which can then be subsampled by digital filtering denoted 4:2:2 or 4:2:0; I will describe the subsampling in a moment.

In composite NTSC or PAL video, B’-Y’ and R’-Y’ were scaled to form U and V components. U and V were then lowpass filtered and combined into a modulated chroma component, C. Luma was then summed with modulated chroma to produce the composite NTSC or PAL signal. Scaling of U and V was arranged so that the excursion of the composite signal (Y’+C) was constrained to the range -13 to +43 of the unity excursion of luma. The historical U and V components are now obsolete, and today UV denotes Y’PBPR (see above).

Composite NTSC video was standardized in 1953 based upon I and Q components that were essentially U and V components rotated 33° and axis-exchanged. Excess detail was supposed to be removed from the Q component so as to improve colour quality. The scheme never achieved significant deployment; I and Q components have been completely obsolete for many decades.

CHAPTER 12

INTRODUCTION TO LUMA AND CHROMA

123

R’G’B’ 4:4:4

Y’CBCR 4:4:4

4:2:2

 

4:1:1

 

 

 

 

4:2:0 (JPEG/JFIF,

4:2:0

 

 

 

 

 

 

 

 

 

 

 

(BT.601)

(480i DV25; D-7)

H.261, MPEG-1)

(MPEG-2 fr)

 

R’0

 

R’1

 

 

Y’0

 

Y’1

 

 

Y’0

 

Y’1

 

 

Y’0

 

Y’1

 

Y’2

 

Y’3

 

 

 

Y’0

 

Y’1

 

 

 

Y’0

 

Y’1

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

R’2

R’3

Y’2

Y’3

 

Y’2

Y’3

 

Y’4

Y’5

Y’6

Y’7

Y’2

Y’3

 

 

Y’2

Y’3

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

G’0

G’1

 

CB0

 

CB1

 

CB0–1

 

 

CB0–3

 

 

 

CB0–3

 

 

 

CB0–3

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

G’2

 

G’3

 

 

CB2

 

CB3

 

CB2–3

 

 

CB4–7

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

B’0

B’1

CR0

CR1

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

CR0–1

 

 

 

 

 

 

CR0–3

 

 

CR0–3

 

 

 

CR0–3

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

B’2

 

B’3

 

 

CR2

 

CR3

 

CR2–3

 

 

 

 

 

 

CR4–7

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

Figure 12.1 Chroma subsampling. A 2× 2 array of R’G’B’ pixels is matrixed into a luma component Y’ and two colour difference components CB and CR. Colour detail is reduced by subsampling CB and CR; providing full luma detail is maintained, no degradation is perceptible. In this sketch, samples are shaded to indicate their spatial position and extent. In 4:2:2, in 4:1:1, and in 4:2:0 used in MPEG-2, CB and CR are cosited (positioned horizontally coincident with a luma sample). In 4:2:0 used in JPEG/JFIF, H.261, and MPEG-1, CB and CR are sited interstitially (midway between luma samples). In the 4:2:0 variant used in consumer 576i DV (not sketched here), CB and CR are vertically sited in line-alternate fashion in each field (starting with a CR sample sited over the top left luma sample.)

 

Chroma subsampling

4:4:4

In Figure 12.1, the left-hand column sketches a 2× 2

 

array of R’G’B’ pixels. (Think of these 2× 2 arrays as

 

being overlaid on the display surface.) Prior to subsam-

 

pling, this sampling is denoted R’G’B’ 4:4:4. With 8 bits

 

per sample, this 2× 2 array of R’G’B’ would consume

 

12 bytes. Each R’G’B’ triplet (pixel) can be losslessly

 

transformed (“matrixed”) into Y’CBCR, as shown in the

 

second column; this is denoted Y’CBCR 4:4:4.

 

In component digital video, data capacity is reduced

 

by subsampling CB and CR using one of three schemes.

4:2:2

Y’CBCR studio digital video according to BT.601 uses

 

4:2:2 sampling: CB and CR components are each

 

subsampled by a factor of 2 horizontally. CB and CR are

 

sampled together, coincident (cosited) with even-

 

numbered luma samples. The 12 bytes of R’G’B’ are

 

reduced to 8, effecting 1.5:1 lossy compression.

4:1:1

Certain digital video systems, such as 480i29.97

DV25, use 4:1:1 sampling, whereby CB and CR components are each subsampled by a factor of 4 horizontally, and cosited with every fourth luma sample. The 12 bytes of R’G’B’ are reduced to 6, effecting 2:1 lossy compression.

124

DIGITAL VIDEO AND HD ALGORITHMS AND INTERFACES

4:2:0

ITU-T Rec. H.261 is a 1990s-vintage videoconferencing standard.

The use of 4 as the numerical basis for subsampling notation is a historical reference to sampling at roughly four times the NTSC colour subcarrier frequency. The 4fSC rate was already in use for composite digital video.

Figure 12.3 Chroma subsampling notation indicates, in the first digit, the luma horizontal sampling reference. The second digit specifies the horizontal subsampling of CB and CR with respect to luma. The third digit originally specified the horizontal subsampling of CR. The notation developed without anticipating vertical subsampling; a third digit of zero now denotes 2:1 vertical subsampling of both CB and CR.

This scheme is used in JPEG/JFIF, MPEG-2, and H.264. CB and CR are each subsampled by a factor of 2 horizontally and a factor of 2 vertically. The 12 bytes of R’G’B’ are reduced to 6. CB and CR are effectively centered vertically halfway between image rows. There are three variants of 4:2:0, having different vertical and horizontal siting. In MPEG-2, CB and CR are cosited horizontally. In JPEG/JFIF, H.261, and MPEG-1, CB and CR are sited interstitially, halfway between alternate luma samples. In 4:2:0 DV, CB and CR alternate line by line. Figure 12.2 overleaf summarizes the various schemes.

Subsampling effects 1.5:1 or 2:1 lossy compression. However, in studio terminology, subsampled video is referred to as uncompressed: The word compression is reserved for techniques such as JPEG, M-JPEG, MPEG, or H.264 that use transform coding (DCT or wavelets).

Chroma subsampling notation

At the outset of digital video, subsampling notation was logical; unfortunately, technology outgrew the notation. In Figure 12.3 below, I strive to clarify today’s nomenclature. Despite appearances, the notation doesn’t specify a ratio! The first digit originally specified luma sample rate relative to 338 MHz; the leading digit is now relative to the sample rate in use. The initial digit is typically 4, since all common chroma ratios are small powers of two – 4, 2, or 1. (3:1:1 subsampling was commercialized in an HD production system – Sony’s HDCAM – and 3 appeared as the leading digit. HDCAM has been superseded by HDCAM SR, which uses 4:2:2 or 4:4:4 subsampling.) Some people use 4:0:0 to denote greyscale (“monochrome”).

Luma horizontal sampling reference (originally, luma fS as multiple of 3 38 MHz)

CB and CR horizontal factor (relative to first digit)

Same as second digit;

or zero, indicating CB and CR are subsampled 2:1 vertically

4:2:2:4If present, same as luma digit; indicates

alpha (key) component

CHAPTER 12

INTRODUCTION TO LUMA AND CHROMA

125

CY

YCY

Y

CY

YCY

Y

4:2:2

CY

YCY

Y

CY

YCY

Y

 

progressive

CY

YCY

Y

CY

YCY

Y

 

 

CY

YCY

Y

CY

YCY

Y

 

 

CY

YCY

Y

CY

YCY

Y

 

 

CY

YCY

Y

CY

YCY

Y

 

 

CY

YCY

Y

CY

YCY

Y

 

 

CY

YCY

Y

CY

YCY

Y

 

 

 

 

 

 

 

 

 

 

 

4:2:0

 

 

 

 

 

 

 

 

 

Y

Y

Y

Y

Y

Y

Y

Y

 

C

 

C

 

C

 

C

 

 

MPEG-2

Y

Y

Y

Y

Y

Y

Y

Y

 

frame

Y

Y

Y

Y

Y

Y

Y

Y

 

picture

C

Y

C

Y

C

Y

C

Y

 

(progressive)

Y

Y

Y

Y

 

Y

Y

Y

Y

Y

Y

Y

Y

 

 

C

Y

C

Y

C

Y

C

Y

 

 

Y

Y

Y

Y

 

 

Y

Y

Y

Y

Y

Y

Y

Y

 

 

C

Y

C

Y

C

Y

C

Y

 

 

Y

Y

Y

Y

 

 

 

 

 

 

 

 

 

 

4:2:0

Y

Y

Y

Y

Y

Y

Y

Y

C

Y

C

Y

C

Y

C

Y

 

JPEG/JFIF

Y

Y

Y

Y

 

Y

Y

Y

Y

Y

Y

Y

Y

 

(progressive)

 

 

C

Y

C

Y

C

Y

C

Y

 

 

Y

Y

Y

Y

 

 

Y

Y

Y

Y

Y

Y

Y

Y

 

 

C

Y

C

Y

C

Y

C

Y

 

 

Y

Y

Y

Y

 

 

Y

Y

Y

Y

Y

Y

Y

Y

 

 

C

Y

C

Y

C

Y

C

Y

 

 

Y

Y

Y

Y

 

 

 

 

 

 

 

 

 

 

 

 

Figure 12.2 Subsampling schemes are summarized here. C indicates a [CB, CR] sample pair when located at the same site; otherwise (as in the DV schemes) individual CB and CR notations indicate the centers of the respective chroma samples. Y’ indicates the center of a luma sample. The schemes in the left column are progressive. The schemes in the right column are interlaced; there, solid letters indicate top field samples and shaded letters indicate bottom field samples.

CY

YCY

Y

CY

YCY

Y

4:2:2

 

 

 

Y

 

Y

 

CY

Y

 

Y

(BT.601)

CY

CY

CY

CY

YCY

Y

CY

YCY

Y

interlaced

 

 

 

 

Y

 

Y

 

 

 

Y

 

Y

 

 

CY

CY

CY

CY

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

CY

YCY

Y

CY

YCY

Y

 

 

CY

Y

 

Y

 

 

 

Y

 

Y

 

CY

CY

CY

 

CY

YCY

Y

CY

YCY

Y

 

 

 

 

Y

 

Y

 

 

 

Y

 

Y

 

 

CY

CY

CY

CY

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

4:2:0

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

Y

Y

Y

Y

 

Y

Y

Y

Y

C

Y

C

Y

C

Y

C

Y

MPEG-2

 

Y

Y

 

Y

Y

 

Y

Y

Y

Y

 

Y

Y

Y

Y

interlaced

 

 

 

 

C

 

 

C

 

 

 

C

 

 

C

 

 

 

 

Y

Y

Y

Y

 

Y

Y

Y

Y

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

Y

Y

Y

Y

 

Y

Y

Y

Y

 

C

Y

C

Y

C

Y

C

Y

 

 

Y

Y

 

Y

Y

 

 

Y

Y

Y

Y

 

Y

Y

Y

Y

 

 

C

 

 

C

 

 

 

C

 

 

C

 

 

 

 

Y

Y

Y

Y

 

Y

Y

Y

Y

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

4:2:0 DV

 

 

 

 

 

 

 

 

 

 

 

CY

YCY

Y

CY

YCY

Y

 

 

R

 

 

R

 

 

 

R

 

 

R

 

interlaced

 

CY

Y

CY

Y

 

CY

Y

CY

Y

 

 

 

 

 

R

 

 

R

 

 

 

R

 

 

R

 

 

CY

Y

CY

Y

CY

Y

CY

Y

 

 

 

B

 

 

B

 

 

 

B

 

 

B

 

 

 

CY

B

Y

CY

B

Y

 

CY

B

Y

CY

B

Y

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

CY

YCY

Y

CY

YCY

Y

 

 

 

R

 

 

R

 

 

 

R

 

 

R

 

 

 

CY

R

Y

CY

R

Y

 

CY

R

Y

CY

R

Y

 

 

 

 

 

CY

Y

CY

Y

CY

Y

CY

Y

 

 

 

B

 

 

B

 

 

 

B

 

 

B

 

 

 

CY

B

Y

CY

B

Y

 

CY

B

Y

CY

B

Y

4:1:1 DV

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

CY

Y

Y

Y

CY

Y

Y

Y

 

 

 

Y

Y

Y

 

 

 

Y

Y

Y

interlaced

 

CY

 

CY

 

CY

Y

Y

Y

CY

Y

Y

Y

 

 

CY

Y

Y

Y

 

CY

Y

Y

Y

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

CY

Y

Y

Y

CY

Y

Y

Y

 

 

CY

Y

Y

Y

 

CY

Y

Y

Y

 

CY

Y

Y

Y

CY

Y

Y

Y

 

 

CY

Y

Y

Y

 

CY

Y

Y

Y

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

126

DIGITAL VIDEO AND HD ALGORITHMS AND INTERFACES

Соседние файлы в папке литература