PS-2020a / part05
.pdfDICOM PS3.5 2020a - Data Structures and Encoding |
Page 71 |
compresseddatastream.ThePixelDatacharacteristicsincludedintheJPEG2000bitstreamshallbeusedtodecodethecompressed data stream.
The requirements when using a Standard Photometric Interpretation (i.e., a Defined Term from PS.3. C.7.6.3.1.2) are specified in Table 8.2.4-1. No other Standard Photometric Interpretation values shall be used.
Table 8.2.4-1. Valid Values of Pixel Data Related Attributes for JPEG 2000 Transfer Syntaxes using Standard Photometric Interpretations
Photometric |
Transfer |
Transfer Syntax UIDSamples |
Planar |
Pixel |
Bits |
Bits High |
|
Interpretation |
Syntax |
per PixelConfigurationRepresentationAllocatedStoredBit |
|||||
MONOCHROME1JPEG 2000 |
1.2.840.10008.1.2.4.90 |
1 |
absent |
0 or 1 |
8, 16, 24, 1-38 0-37 |
||
|
(LosslessOnly) |
|
|
|
32 or 40 |
|
|
MONOCHROME2 |
1.2.840.10008.1.2.4.91 |
|
|
|
|
|
|
|
JPEG 2000 |
|
|
|
|
|
|
PALETTECOLORJPEG 2000 |
1.2.840.10008.1.2.4.90 |
1 |
absent |
0 |
8 or 16 |
1-16 0-15 |
|
|
(LosslessOnly) |
|
|
|
|
|
|
YBR_RCT |
JPEG 2000 |
1.2.840.10008.1.2.4.90 |
3 |
0 |
0 |
8, 16, 24, 1-38 0-37 |
|
|
(LosslessOnly) |
|
|
|
32 or 40 |
|
|
|
JPEG 2000 |
1.2.840.10008.1.2.4.91 |
|
|
|
|
|
|
|
|
|
|
|
|
|
YBR_ICT |
JPEG 2000 |
1.2.840.10008.1.2.4.91 |
3 |
0 |
0 |
8, 16, 24, 1-38 0-37 |
|
|
|
|
|
|
|
32 or 40 |
|
RGB |
JPEG 2000 |
1.2.840.10008.1.2.4.90 |
3 |
0 |
0 |
8, 16, 24, 1-38 0-37 |
|
|
(LosslessOnly) |
|
|
|
32 or 40 |
|
|
|
JPEG 2000 |
1.2.840.10008.1.2.4.91 |
|
|
|
|
|
|
|
|
|
|
|
|
|
YBR_FULL |
JPEG 2000 |
1.2.840.10008.1.2.4.90 |
3 |
0 |
0 |
8, 16, 24, 1-38 0-37 |
|
|
(LosslessOnly) |
|
|
|
32 or 40 |
|
|
|
JPEG 2000 |
1.2.840.10008.1.2.4.91 |
|
|
|
|
|
|
|
|
|
|
|
|
Note
Theserequirementsarespecifiedintermsofconsistencywithwhatisencapsulated,ratherthanintermsoftheuncompressed pixel data from which the compressed data stream may have been derived.
Whendecompressing,shouldthecharacteristicsexplicitlyspecifiedinthecompresseddatastreambeinconsistentwiththosespecified in the DICOM Data Elements, those explicitly specified in the compressed data stream should be used to control the decompression. The DICOM data elements, if inconsistent, can be regarded as suggestions as to the form in which an uncompressed Data Set might be encoded, subject to the general and IOD-specific rules for uncompressed Photometric Interpretation and Planar Configuration, which may require that decompressed data be converted to one of the permitted forms.
The JPEG 2000 bit stream specifies whether or not a reversible or irreversible multi-component (color) transformation [ISO 15444-1 Annex G], if any, has been applied. If no multi-component transformation has been applied, then the components shall correspond tothosespecifiedbytheDICOMAttributePhotometricInterpretation(0028,0004).IftheJPEG2000Part1reversiblemulti-component transformation has been applied then the DICOM Attribute Photometric Interpretation (0028,0004) shall be YBR_RCT. If the JPEG 2000 Part 1 irreversible multi-component transformation has been applied then the DICOM Attribute Photometric Interpretation (0028,0004) shall be YBR_ICT.
Note
1.For example, single component may be present, and the Photometric Interpretation (0028,0004) may be MONO- CHROME2.
2.The application of a JPEG 2000 Part 1 reversible multi-component transformation is signaled in the JPEG 2000 bit stream by a value of 1 rather than 0 in the SGcod Multiple component transformation type of the COD marker segment [ISO 15444-1 Table A.17]. No other value of Photometric Interpretation than YBR_RCT or YBR_ICT is permitted when SGcod Multiple component transformation type is 1.
- Standard -
Page 72 |
DICOM PS3.5 2020a - Data Structures and Encoding |
3.Though it would be unusual, would not take advantage of correlation between the red, green and blue components, and would not achieve effective compression, a Photometric Interpretation of RGB could be specified as long as no multi- component transformation [ISO 15444-1 Annex G] was specified by the JPEG 2000 bit stream. For some applications the use of RGB is permitted, e.g., Whole Slide Microscopy Images, to allow conversion to DICOM from proprietary formats without loss due to color space transformation. Alternative methods of decorrelation of the color components than those specified in [ISO 15444-1 Annex G] are permitted as defined in PS3.3, such as a Photometric Interpretation of YBR_FULL; this may be useful when converting existing YBR_FULL Pixel Data (e.g., in a different Transfer Syntax) without further loss.
Ineithercase(PhotometricInterpretationofRGBorYBR_FULL),thevalueofSGcodMultiplecomponenttransformation type would be 0.
PS3.3 may constrain the values of Photometric Interpretation for specific IODs.
4.Despite the application of a multi-component color transformation and its reflection in the Photometric Interpretation at- tribute, the "color space" remains undefined. There is currently no means of conveying "standard color spaces" either by fixed values (such as sRGB) or by ICC profiles. Note in particular that the JP2 file header is not sent in the JPEG 2000 bit stream that is encapsulated in DICOM.
5.If JPEG 2000 Compressed Pixel Data is decompressed and re-encoded in Native (uncompressed) form, then the Data Elements that are related to the Pixel Data encoding are updated accordingly. If color components are converted from YBR_ICT or YBR_RCT to RGB during decompression and Native re-encoding, the Photometric Interpretation will be changed to RGB in the Data Set with the Native encoding.
6.The upper limit of 40 on Bits Allocated (0028,0100) and 38 on Bits Stored (0028,0101) reflects the maximum JPEG 2000 sample precision of 38 and the DICOM requirement to describe Bits Allocated (0028,0100) as multiples of bytes (octets).
The JPEG 2000 bit stream is capable of encoding both signed and unsigned pixel values, hence the value of Pixel Representation (0028,0103) may be either 0 or 1 for monochrome Photometric Interpretations depending on what has been encoded (as specified in the SIZ marker segment in the precision and sign of component parameter).
The value of Planar Configuration (0028,0006) is irrelevant since the manner of encoding components is specified in the JPEG 2000 standard, hence it shall be set to 0.
8.2.5 MPEG2 Main Profile / Main Level Video Compression
DICOMprovidesamechanismforsupportingtheuseofMPEG2MainProfile/MainLevelVideoCompressionthroughtheEncapsulated Format (see PS3.3). Annex A defines a Transfer Syntax that references the MPEG2 Main Profile / Main Level Standard.
Note
MPEG2 compression is inherently lossy. The context where the usage of lossy compression of medical images is clinically acceptableisbeyondthescopeoftheDICOMStandard.Thepoliciesassociatedwiththeselectionofappropriatecompression parameters (e.g., compression ratio) for MPEG2 Main Profile / Main Level are also beyond the scope of this Standard.
The use of the DICOM Encapsulated Format to support MPEG2 Main Profile / Main Level compressed pixel data requires that the Data Elements that are related to the Pixel Data encoding (e.g., Photometric Interpretation, Samples per Pixel, Planar Configuration, Bits Allocated, Bits Stored, High Bit, Pixel Representation, Rows, Columns, etc.) shall contain values that are consistent with the characteristics of the compressed data stream, with some specific exceptions noted here. The Pixel Data characteristics included in the MPEG2 Main Profile / Main Level bit stream shall be used to decode the compressed data stream.
Note
Theserequirementsarespecifiedintermsofconsistencywithwhatisencapsulated,ratherthanintermsoftheuncompressed pixel data from which the compressed data stream may have been derived.
Whendecompressing,shouldthecharacteristicsexplicitlyspecifiedinthecompresseddatastreambeinconsistentwiththosespecified in the DICOM Data Elements, those explicitly specified in the compressed data stream should be used to control the decompression. The DICOM data elements, if inconsistent, can be regarded as suggestions as to the form in which an uncompressed Data Set might be encoded, subject to the general and IOD-specific rules for uncompressed Photometric Interpretation and Planar Configuration, which may require that decompressed data be converted to one of the permitted forms.
- Standard -
DICOM PS3.5 2020a - Data Structures and Encoding |
Page 73 |
The MPEG2 Main Profile / Main Level bit stream specifies whether or not a reversible or irreversible multi-component (color) trans- formation, if any, has been applied. If no multi-component transformation has been applied, then the components shall correspond to those specified by the DICOM Attribute Photometric Interpretation (0028,0004). MPEG2 Main Profile / Main Level applies an irre- versible multi-component transformation, so DICOM Attribute Photometric Interpretation (0028,0004) shall be YBR_PARTIAL_420 in the case of multi-component data, and MONOCHROME2 in the case of single component data (even though the MPEG2 bit stream itself is always encoded as three components, one luminance and two chrominance).
Note
1.If MPEG2 Compressed Pixel Data is decompressed and re-encoded in Native (uncompressed) form, then the Data Elements that are related to the Pixel Data encoding are updated accordingly. If color components are converted from YBR_PARTIAL_420 to RGB during decompression and Native re-encoding, the Photometric Interpretation will be changed to RGB in the Data Set with the Native encoding.
2.MPEG2 proposes some video formats. Each of the standards specified is used in a different market, including: ITU-R BT.470-2 System M for SD NTSC and ITU-R BT.470-2 System B/G for SD PAL/SECAM. A PAL based system should therefore be based on ITU-BT.470 System B for each of Color Primaries, Transfer Characteristic (gamma) and matrix coefficients and should take a value of 5 as defined in [ISO/IEC 13818-2].
The value of Planar Configuration (0028,0006) is irrelevant since the manner of encoding components is specified in the MPEG2 Main Profile / Main Level standard, hence it shall be set to 0.
In summary:
•Samples per Pixel (0028,0002) shall be 3
•Photometric Interpretation (0028,0004) shall be YBR_PARTIAL_420
•Bits Allocated (0028,0100) shall be 8
•Bits Stored (0028,0101) shall be 8
•High Bit (0028,0102) shall be 7
•Pixel Representation (0028,0103) shall be 0
•Planar Configuration (0028,0006) shall be 0
•Rows(0028,0010),Columns(0028,0011),CineRate(0018,0040)andFrameTime(0018,1063)orFrameTimeVector(0018,1065) shall be consistent with the limitations of Main Profile / Main Level, as specified in Table 8-1.
Table 8-1. MPEG2 Main Profile / Main Level Image Transfer Syntax Rows and Columns Attributes
Video Type |
Spatial resolution |
Frame Rate |
Frame Time |
Maximum Rows |
Maximum |
|
|
(see Note 4) |
(see Note 5) |
|
Columns |
|
|
|
|
||
525-line NTSC |
Full |
30 |
33.33 ms |
480 |
720 |
625-line PAL |
Full |
25 |
40.0 ms |
576 |
720 |
Note
1.Although different combinations of values for Rows and Columns values are possible while respecting the maximum values listed above, it is recommended that the typical 4:3 ratio of image width to height be maintained in order to avoid image deformation by MPEG2 decoders. A common way to maintain the ratio of width to height is to pad the image with black areas on either side.
2."Half" definition of pictures (240x352 and 288x352 for NTSC and PAL, respectively) are always supported by decoders.
3.Main Profile / Main Level allows for various different display and pixel aspect ratios, including the use of square pixels, and the use of non-square pixels with display aspect ratios of 4:3 and 16:9. DICOM specifies no additional restrictions beyond what is provided for in Main Profile / Main Level. All permutations allowed by Main Profile / Main Level are valid and are require to be supported by all DICOM decoders.
- Standard -
Page 74 |
DICOM PS3.5 2020a - Data Structures and Encoding |
4.The actual frame rate for NTSC MPEG2 is approximately 29.97 frames/sec.
5.The nominal Frame Time is supplied for the purpose of inclusion on the DICOM Cine Module Attributes, and should be calculated from the actual frame rate.
One fragment shall contain the whole MPEG2 stream.
Note
1.If a video stream exceeds the maximum length of one fragment, it may be sent as multiple SOP Instances, but each SOP Instance will contain an independent and playable bit stream, and not depend on the encoded bit stream in other (previous) instances. The manner in which such separate instances are related is not specified in the Standard, but mechanisms such as grouping into the same Series, and references to earlier instances using Referenced Image Se- quence may be used.
2.This constraint limits the length of the compressed bit stream to no longer than 232-2 bytes.
The Basic Offset Table shall be empty (present but zero length).
Note
The Basic Offset Table is not used because MPEG2 contains its own mechanism for describing navigation of frames. To enable decoding of only a part of the sequence, MPEG2 manages a header in any group of pictures (GOP) containing a time_code - a 25-bit integer containing the following: drop_frame_flag, time_code_hours, time_code_minutes, marker_bit, time_code_seconds and time_code_pictures.
The container format for the video bit stream is not constrained. For example, it may MPEG-2 Transport Stream (MPEG-TS), MPEG- 2 Program Stream (MPEG-PS), MPEG-2 Elementary Stream (MPEG-ES), MPEG-2 Packetized Elementary Stream (MPEG-PES) (see [ISO/IEC 13818-1]) or MPEG-4 (MP4) container (see [ISO/IEC 14496-12] and [ISO/IEC 14496-14]).
Any audio components present within the MPEG bit stream shall comply with the following restrictions:
•CBR MPEG-1 LAYER III (MP3) Audio Standard
•up to 24 bits
•32 kHz, 44.1 kHz or 48 kHz for the main channel (the complementary channels can be sampled at the half rate, as defined in the Standard)
•one main mono or stereo channel, and optionally one or more complementary channel(s)
Note
1.MPEG-1 Layer III is standardized in Part 3 of the MPEG-1 standard (see [ISO/IEC 11172-3]).
2.Although MPEG describes each channel as including up to 5 signals (e.g., for surround effects), it is recommended to limit each of the two channels to 2 signals each one (stereo).
8.2.6 MPEG2 Main Profile / High Level Video Compression
MPEG2 Main Profile / High Level corresponds to what is commonly known as HDTV ('High Definition Television'). DICOM provides a mechanism for supporting the use of MPEG2 Main Profile / High Level Video Compression through the Encapsulated Format (see PS3.3). Annex A defines a Transfer Syntax that references the MPEG2 Main Profile / High Level Standard.
Note
MPEG2 compression is inherently lossy. The context where the usage of lossy compression of medical images is clinically acceptableisbeyondthescopeoftheDICOMStandard.Thepoliciesassociatedwiththeselectionofappropriatecompression parameters (e.g., compression ratio) for MPEG2 Main Profile / High Level are also beyond the scope of this Standard.
The use of the DICOM Encapsulated Format to support MPEG2 Main Profile / High Level compressed pixel data requires that the Data Elements that are related to the Pixel Data encoding (e.g., Photometric Interpretation, Samples per Pixel, Planar Configuration, Bits Allocated, Bits Stored, High Bit, Pixel Representation, Rows, Columns, etc.) shall contain values that are consistent with the
- Standard -
DICOM PS3.5 2020a - Data Structures and Encoding |
Page 75 |
characteristics of the compressed data stream, with some specific exceptions noted here. The Pixel Data characteristics included in the MPEG2 Main Profile / High Level bit stream shall be used to decode the compressed data stream.
Note
Theserequirementsarespecifiedintermsofconsistencywithwhatisencapsulated,ratherthanintermsoftheuncompressed pixel data from which the compressed data stream may have been derived.
Whendecompressing,shouldthecharacteristicsexplicitlyspecifiedinthecompresseddatastreambeinconsistentwiththosespecified in the DICOM Data Elements, those explicitly specified in the compressed data stream should be used to control the decompression. The DICOM data elements, if inconsistent, can be regarded as suggestions as to the form in which an uncompressed Data Set might be encoded, subject to the general and IOD-specific rules for uncompressed Photometric Interpretation and Planar Configuration, which may require that decompressed data be converted to one of the permitted forms.
Note
IfMPEG2CompressedPixelDataisdecompressedandre-encodedinNative(uncompressed)form,thentheDataElements that are related to the Pixel Data encoding are updated accordingly. If color components are converted from YBR_PAR- TIAL_420 to RGB during decompression and Native re-encoding, the Photometric Interpretation will be changed to RGB in the Data Set with the Native encoding.
The requirements are:
•Planar Configuration (0028,0006) shall be 0
Note
The value of Planar Configuration (0028,0006) is irrelevant since the manner of encoding components is specified in the MPEG2 standard, hence it is set to 0.
•Samples per Pixel (0028,0002) shall be 3
•Photometric Interpretation (0028,0004) shall be YBR_PARTIAL_420 or MONOCHROME2
•Bits Allocated (0028,0100) shall be 8
•Bits Stored (0028,0101) shall be 8
•High Bit (0028,0102) shall be 7
•Pixel Representation (0028,0103) shall be 0
•Rows (0028,0010) shall be either 720 or 1080
•Columns (0028,0011) shall be 1280 if Rows is 720, or shall be 1920 if Rows is 1080.
•The value of MPEG2 aspect_ratio_information shall be 0011 in the encapsulated MPEG2 data stream corresponding to a 'Display Aspect Ratio' (DAR) of 16:9.
•The DICOM attribute Pixel Aspect Ratio (0028,0034) shall be absent. This corresponds to a 'Sampling Aspect Ratio' (SAR) of 1:1.
•Cine Rate (0018,0040) and Frame Time (0018,1063) or Frame Time Vector (0018,1065) shall be consistent with the limitations of Main Profile / High Level, as specified in Table 8-2.
Table 8-2. MPEG2 Main Profile / High Level Image Transfer Syntax Frame Rate Attributes
Video Type |
Spatial resolution layer |
Frame Rate (see Note |
Frame Time (see Note 3) |
|
|
2) |
|
30 Hz HD |
Single level, Enhancement |
30 |
33.33 ms |
25 Hz HD |
Single level, Enhancement |
25 |
40.0 ms |
60 Hz HD |
Single level, Enhancement |
60 |
16.67 ms |
- Standard -
Page 76 |
DICOM PS3.5 2020a - Data Structures and Encoding |
|
|
Video Type |
Spatial resolution layer |
Frame Rate (see Note |
Frame Time (see Note 3) |
|
|
2) |
|
50 Hz HD |
Single level, Enhancement |
50 |
20.00 ms |
Note
1.Therequirementsonrowsandcolumnsaretomaximizeinteroperabilitybetweensoftwareenvironmentsandcommonly available hardware MPEG2 encoder/decoder implementations. Should the source picture have a lower value, it should be re-formatted accordingly by scaling and/or pixel padding prior to MPEG2 encoding.
2.The frame rate of the acquiring camera for '30 Hz HD' MPEG2 may be either 30 or 30/1.001 (approximately 29.97) frames/sec.Similarly,theframerateinthecaseof60Hzmaybeeither60or60/1.001(approximately59.94)frames/sec This may lead to small inconsistencies between the video timebase and real time.
3.The Frame Time (0018,1063) may be calculated from the frame rate of the acquiring camera. A frame time of 33.367 ms corresponds to 29.97 frames per second.
4.The value of chroma_format for this profile and level is defined by MPEG as 4:2:0.
5.Examples of screen resolutions supported by MPEG2 Main Profile / High Level are shown in Table 8-y. Frame rates of 50 Hz and 60 Hz (progressive) at the maximum resolution of 1080 by 1920 are not supported by Main Profile / High Level. Interlace at the maximum resolution is supported at a field rate of 50 Hz or 60 Hz, which corresponds to a frame rate of 25 Hz or 30 Hz respectively as described in Table 8-y.
6.An MPEG2 Main Profile / High Level decoder is able to decode bit streams conforming to lower levels. These include the 1080 by 1440 bit streams of MP@H-14, and the Main Level bit streams used in the existing MPEG2 Main Profile / Main Level Transfer Syntax in the Visible Light IOD.
7.MP@H-14 is not supported by this Transfer Syntax.
8.TherestrictionofDARto16:9isrequiredtoensureinteroperabilitybecauseoflimitationsincommonlyavailablehardware chip set implementations for MPEG2 Main Profile / High Level.
Table 8-3. Examples of MPEG2 Main Profile / High Level Screen Resolution
Rows |
Columns |
Frame rate |
Video Type |
Progressive or |
|
|
|
|
Interlace |
1080 |
1920 |
25 |
25 Hz HD |
P |
1080 |
1920 |
29.97, 30 |
30 Hz HD |
P |
1080 |
1920 |
25 |
25 Hz HD |
I |
1080 |
1920 |
29.97, 30 |
30 Hz HD |
I |
720 |
1280 |
25 |
25 Hz HD |
P |
720 |
1280 |
29.97, 30, |
30 Hz HD |
P |
720 |
1280 |
50 |
50 Hz HD |
P |
720 |
1280 |
59.94, 60 |
60 Hz HD |
P |
One fragment shall contain the whole MPEG2 bit stream.
Note
1.If a video stream exceeds the maximum length of one fragment (approximately 4 GB), it may be sent as multiple SOP Instances, but each SOP Instance will contain an independent and playable bit stream, and not depend on the encoded bit stream in other (previous) instances. The manner in which such separate instances are related is not specified in the Standard, but mechanisms such as grouping into the same Series, and references to earlier instances using Refer- enced Image Sequence may be used.
2.This constraint limits the length of the compressed bit stream to no longer than 232-2 bytes.
- Standard -
DICOM PS3.5 2020a - Data Structures and Encoding |
Page 77 |
The Basic Offset Table in the Pixel Data (7FE0,0010) shall be empty (present but zero length).
Note
The Basic Offset Table is not used because MPEG2 contains its own mechanism for describing navigation of frames. To enable decoding of only a part of the sequence, MPEG2 manages a header in any group of pictures (GOP) containing a time_code - a 25-bit integer containing the following: drop_frame_flag, time_code_hours, time_code_minutes, marker_bit, time_code_seconds and time_code_pictures.
The container format for the video bit stream is not constrained. For example, it may MPEG-2 Transport Stream (MPEG-TS), MPEG- 2 Program Stream (MPEG-PS), MPEG-2 Elementary Stream (MPEG-ES), MPEG-2 Packetized Elementary Stream (MPEG-PES) (see [ISO/IEC 13818-1]) or MPEG-4 (MP4) container (see [ISO/IEC 14496-12] and [ISO/IEC 14496-14]).
Any audio components present within the MPEG2 Main Profile / High Level bit stream shall comply with the restrictions as for MPEG2 Main Profile / Main Level as stated in Section 8.2.5.
8.2.7 MPEG-4 AVC/H.264 High Profile / Level 4.1 Video Compression
MPEG-4 AVC/H.264 High Profile / Level 4.1 corresponds to what is commonly known as HDTV ('High Definition Television'). DICOM provides a mechanism for supporting the use of MPEG-4 AVC/H.264 Image Compression through the Encapsulated Format (see PS3.3). Annex A defines a Transfer Syntax that references the MPEG-4 AVC/H.264 Standard.
Note
MPEG-4 AVC/H.264 compression / High Profile compression is inherently lossy. The context where the usage of lossy compressionofmedicalimagesisclinicallyacceptableisbeyondthescopeoftheDICOMStandard.Thepoliciesassociated with the selection of appropriate compression parameters (e.g., compression ratio) for MPEG-4 AVC/H.264 High Profile / Level 4.1 are also beyond the scope of this Standard.
The use of the DICOM Encapsulated Format to support MPEG-4 AVC/H.264 compressed pixel data requires that the Data Elements that are related to the Pixel Data encoding (e.g., Photometric Interpretation, Samples per Pixel, Planar Configuration, Bits Allocated, Bits Stored, High Bit, Pixel Representation, Rows, Columns, etc.) shall contain values that are consistent with the characteristics of the compressed data stream, with some specific exceptions noted here. The Pixel Data characteristics included in the MPEG-4 AVC/H.264 bit stream shall be used to decode the compressed data stream.
Note
Theserequirementsarespecifiedintermsofconsistencywithwhatisencapsulated,ratherthanintermsoftheuncompressed pixel data from which the compressed data stream may have been derived.
Whendecompressing,shouldthecharacteristicsexplicitlyspecifiedinthecompresseddatastreambeinconsistentwiththosespecified in the DICOM Data Elements, those explicitly specified in the compressed data stream should be used to control the decompression. The DICOM data elements, if inconsistent, can be regarded as suggestions as to the form in which an uncompressed Data Set might be encoded, subject to the general and IOD-specific rules for uncompressed Photometric Interpretation and Planar Configuration, which may require that decompressed data be converted to one of the permitted forms.
Note
IfMPEG-4CompressedPixelDataisdecompressedandre-encodedinNative(uncompressed)form,thentheDataElements that are related to the Pixel Data encoding are updated accordingly. If color components are converted from YBR_PAR- TIAL_420 to RGB during decompression and Native re-encoding, the Photometric Interpretation will be changed to RGB in the Data Set with the Native encoding.
The requirements are:
•Planar Configuration (0028,0006) shall be 0
•Samples per Pixel (0028,0002) shall be 3
•Photometric Interpretation (0028,0004) shall be YBR_PARTIAL_420
•Bits Allocated (0028,0100) shall be 8
- Standard -
Page 78 |
DICOM PS3.5 2020a - Data Structures and Encoding |
•Bits Stored (0028,0101) shall be 8
•High Bit (0028,0102) shall be 7
•Pixel Representation (0028,0103) shall be 0
•The value of MPEG-4 AVC/H.264 sample aspect_ratio_idc shall be 1 in the encapsulated MPEG-4 AVC/H.264 bit stream if as- pect_ratio_info_present_flag is 1.
•Pixel Aspect Ratio (0028,0034) shall be absent. This corresponds to a 'Sampling Aspect Ratio' (SAR) of 1:1.
•The possible values for Rows (0028,0010), Columns (0028,0011), Cine Rate (0018,0040), and Frame Time (0018,1063) or Frame Time Vector (0018,1065) depend on the used Transfer Syntax.
•For MPEG-4 AVC/H.264 High Profile / Level 4.1 Transfer Syntax, the values for these data elements shall be compliant with the High Profile / Level 4.1 of the MPEG-4 AVC/H.264 standard ([ISO/IEC 14496-10]) and restricted to a square pixel aspect ratio.
•For MPEG-4 AVC/H.264 BD-compatible High Profile / Level 4.1 Transfer Syntax, the values for these data elements shall be as specified in Table 8-4.
Table 8-4. Values Permitted for MPEG-4 AVC/H.264 BD-compatible High Profile / Level 4.1
Rows |
Columns |
Frame rate |
Video Type |
Progressive or |
|
|
|
|
Interlace |
1080 |
1920 |
25 |
25 Hz HD |
I |
1080 |
1920 |
29.97 |
30 Hz HD |
I |
1080 |
1920 |
24 |
24 Hz HD |
P |
1080 |
1920 |
23.976 |
24 Hz HD |
P |
720 |
1280 |
50 |
50 Hz HD |
P |
720 |
1280 |
59.94 |
60 Hz HD |
P |
720 |
1280 |
24 |
24 Hz HD |
P |
720 |
1280 |
23.976 |
24 Hz HD |
P |
Note
1.The value of Planar Configuration (0028,0006) is irrelevant since the manner of encoding components is specified in the MPEG-4 AVC/H.264 standard, hence it is set to 0.
2.The limitation on rows and columns are to maximize interoperability between software environments and commonly available hardware MPEG-4 AVC/H.264 encoder/decoder implementations. Source pictures that have a lower value should be re-formatted by scaling and/or pixel padding prior to MPEG-4 AVC/H.264 encoding.
3.The frame rate of the acquiring camera for '30 Hz HD' MPEG-4 AVC/H.264 may be either 30 or 30/1.001 (approximately 29.97) frames/sec. Similarly, the frame rate in the case of 60 Hz may be either 60 or 60/1.001 (approximately 59.94) frames/sec.Thismayleadtosmallinconsistenciesbetweenthevideotimebaseandrealtime.Therelationshipbetween frame rate and frame time is shown in Table 8-5.
4.The Frame Time (0018,1063) may be calculated from the frame rate of the acquiring camera. A frame rate of 29.97 frames per second corresponds to a frame time of 33.367 ms.
5.The value of chroma_format for this profile and level is defined by MPEG as 4:2:0.
6.Example screen resolutions supported by MPEG-4 AVC/H.264 High Profile / Level 4.1 can be taken from Table 8-4. Frame rates of 50 Hz and 60 Hz (progressive) at the maximum resolution of 1080 by 1920 are not supported by MPEG- 4 AVC/H.264 High Profile / Level 4.1. Interlace at the maximum resolution is supported at a field rate of 50 Hz or 60 Hz, which corresponds to a frame rate of 25 Hz or 30 Hz respectively. Smaller resolutions may be used as long as they comply with the square pixel aspect ratio. An example is XGA resolution with an image resolution of 768 by 1024 pixels. For smaller resolutions there are higher frame rates possible. For example it may be up to 80 Hz for XGA.
- Standard -
DICOM PS3.5 2020a - Data Structures and Encoding |
Page 79 |
7.The display aspect ratio is defined implicitly by the pixel resolution of the video picture. Only square pixel aspect ratio is allowed. MPEG-4 AVC/H.264 BD-compatible High Profile / Level 4.1 will only support resolutions that result in a 16:9 display aspect ratio
8.The permitted screen resolutions for the MPEG-4 AVC/H.264 BD-compatible High Profile / Level 4.1 are listed in Table 8-4. Only HD resolutions and no progressive frame rates for 25 or 29.97 frames per seconds are supported. Frame rates of 50 Hz and 60 Hz (progressive) at the maximum resolution of 1080 by 1920 are not supported.
Table 8-5. MPEG-4 AVC/H.264 High Profile / Level 4.1 Image Transfer Syntax Frame Rate Attributes
Video Type |
Spatial resolution layer |
Frame Rate (see Note |
Frame Time (see Note 3) |
|
|
2) |
|
30 Hz HD |
Single level, Enhancement |
30 |
33.33 ms |
25 Hz HD |
Single level, Enhancement |
25 |
40.0 ms |
60 Hz HD |
Single level, Enhancement |
60 |
16.67 ms |
50 Hz HD |
Single level, Enhancement |
50 |
20.00 ms |
One fragment shall contain the whole MPEG-4 AVC/H.264 bit stream.
Note
If a video stream exceeds the maximum length of one fragment (approximately 4 GB), it may be sent as multiple SOP In- stances, but each SOP Instance will contain an independent and playable bit stream, and not depend on the encoded bit stream in other (previous) instances. The manner in which such separate instances are related is not specified in the Standard, but mechanisms such as grouping into the same Series, and references to earlier instances using Referenced Image Sequence may be used.
ThecontainerformatforthevideobitstreamshallbeMPEG-2TransportStream,a.k.a.MPEG-TS(see[ISO/IEC13818-1])orMPEG- 4, a.k.a. MP4 container (see [ISO/IEC 14496-12] and [ISO/IEC 14496-14]). The PTS/DTS of the transport stream shall be used in the MPEG coding.
Any audio components included in the data container shall follow the constraints detailed in Section 8.2.12 Constraints for Audio Data
Integration in AVC and HEVC Compressed Bit Streams.
8.2.8 MPEG-4 AVC/H.264 High Profile / Level 4.2 Video Compression
DICOM provides a mechanism for supporting the use of MPEG-4 AVC/H.264 Image Compression through the Encapsulated Format (see PS3.3). Annex A defines Transfer Syntaxes that reference the MPEG-4 AVC/H.264 Standard.
Note
MPEG-4 AVC/H.264 compression / High Profile compression is inherently lossy. The context where the usage of lossy compressionofmedicalimagesisclinicallyacceptableisbeyondthescopeoftheDICOMStandard.Thepoliciesassociated with the selection of appropriate compression parameters (e.g., compression ratio) for MPEG-4 AVC/H.264 High Profile / Level 4.2 are also beyond the scope of this Standard.
The use of the DICOM Encapsulated Format to support MPEG-4 AVC/H.264 compressed pixel data requires that the Data Elements that are related to the Pixel Data encoding (e.g., Photometric Interpretation, Samples per Pixel, Planar Configuration, Bits Allocated, Bits Stored, High Bit, Pixel Representation, Rows, Columns, etc.) shall contain values that are consistent with the characteristics of the compressed data stream, with some specific exceptions noted here. The Pixel Data characteristics included in the MPEG-4 AVC/H.264 bit stream shall be used to decode the compressed data stream.
Note
Theserequirementsarespecifiedintermsofconsistencywithwhatisencapsulated,ratherthanintermsoftheuncompressed pixel data from which the compressed data stream may have been derived.
Whendecompressing,shouldthecharacteristicsexplicitlyspecifiedinthecompresseddatastreambeinconsistentwiththosespecified in the DICOM Data Elements, those explicitly specified in the compressed data stream should be used to control the decompression. The DICOM data elements, if inconsistent, can be regarded as suggestions as to the form in which an uncompressed Data Set might
- Standard -
Page 80 |
DICOM PS3.5 2020a - Data Structures and Encoding |
be encoded, subject to the general and IOD-specific rules for uncompressed Photometric Interpretation and Planar Configuration, which may require that decompressed data be converted to one of the permitted forms.
Note
IfMPEG-4CompressedPixelDataisdecompressedandre-encodedinNative(uncompressed)form,thentheDataElements that are related to the Pixel Data encoding are updated accordingly. If color components are converted from YBR_PAR- TIAL_420 to RGB during decompression and Native re-encoding, the Photometric Interpretation will be changed to RGB in the Data Set with the Native encoding.
The requirements are:
•Planar Configuration (0028,0006) shall be 0
•Samples per Pixel (0028,0002) shall be 3
•Photometric Interpretation (0028,0004) shall be YBR_PARTIAL_420
•Bits Allocated (0028,0100) shall be 8
•Bits Stored (0028,0101) shall be 8
•High Bit (0028,0102) shall be 7
•Pixel Representation (0028,0103) shall be 0
•The value of MPEG-4 AVC/H.264 sample aspect_ratio_idc shall be 1 in the encapsulated MPEG-4 AVC/H.264 bit stream if as- pect_ratio_info_present_flag is 1.
•Pixel Aspect Ratio (0028,0034) shall be absent. This corresponds to a 'Sampling Aspect Ratio' (SAR) of 1:1.
•The values for Rows (0028,0010), Columns (0028,0011), Cine Rate (0018,0040), and Frame Time (0018,1063) or Frame Time Vector (0018,1065) shall be compliant with the High Profile / Level 4.2 of the MPEG-4 AVC/H.264 standard ([ISO/IEC 14496-10]) and restricted to a square pixel aspect ratio.
Note
1.The value of Planar Configuration (0028,0006) is irrelevant since the manner of encoding components is specified in the MPEG-4 AVC/H.264 standard, hence it is set to 0.
2.The frame rate of the acquiring camera for '30 Hz HD' MPEG-4 AVC/H.264 may be either 30 or 30/1.001 (approximately 29.97) frames/sec. Similarly, the frame rate in the case of 60 Hz may be either 60 or 60/1.001 (approximately 59.94) frames/sec.Thismayleadtosmallinconsistenciesbetweenthevideotimebaseandrealtime.Therelationshipbetween frame rate and frame time is shown in Table 8-7.
3.The Frame Time (0018,1063) may be calculated from the frame rate of the acquiring camera. A frame rate of 29.97 frames per second corresponds to a frame time of 33.367 ms.
4.The value of chroma_format for this profile and level is defined by MPEG as 4:2:0.
Table 8-7. MPEG-4 AVC/H.264 High Profile / Level 4.2 Image Transfer Syntax Frame Rate Attributes
Video Type |
Frame Rate (see Note 2) |
Frame Time (see Note 3) |
30 Hz HD |
30 |
33.33 ms |
25 Hz HD |
25 |
40.0 ms |
60 Hz HD |
60 |
16.67 ms |
50 Hz HD |
50 |
20.00 ms |
Stereo Pairs Present (0022,0028) shall be YES if stereoscopic pairs are present, otherwise shall be NO or absent.
- Standard -