Image encoding method and apparatus, and image decoding method and apparatus

文档序号：1472532 发布日期：2020-02-21 浏览：30次中文

阅读说明：本技术 图像编码方法和设备以及图像解码方法和设备 (Image encoding method and apparatus, and image decoding method and apparatus ) 是由崔棋镐朴缗茱艾琳娜·阿尔辛娜于 2018-07-04 设计创作，主要内容包括：一个实施例可提供一种图像解码方法和能够执行所述图像解码方法的图像解码设备，所述图像解码方法包括如下步骤：确定是否获得第二变换集信息；基于确定是否获得第二变换集信息，确定第二变换集信息；通过使用确定的第二变换集信息来从与多个预测模式对应的多个第二变换集候选中选择应用于第二变换块的任意一个第二变换集；通过基于包括在选择的第二变换集中的多个变换矩阵对第二变换块进行逆变换来生成第一变换块；通过基于第一变换矩阵对第一变换块进行逆变换来生成与第一变换块对应的残差块。(An embodiment may provide an image decoding method and an image decoding apparatus capable of performing the image decoding method, the image decoding method including the steps of: determining whether second transformation set information is obtained; determining second transformation set information based on determining whether second transformation set information is obtained; selecting any one second transform set applied to the second transform block from a plurality of second transform set candidates corresponding to the plurality of prediction modes by using the determined second transform set information; generating a first transform block by inverse-transforming the second transform block based on a plurality of transform matrices included in the selected second transform set; a residual block corresponding to the first transform block is generated by inverse transforming the first transform block based on the first transform matrix.)

1. An image decoding method, comprising:

determining whether second transformation set information is obtained;

determining second transformation set information based on determining whether second transformation set information is obtained;

selecting any one of second transform sets applied to the second transform block from a plurality of second transform set candidates corresponding to the plurality of prediction modes by using the determined second transform set information;

generating a first transform block by inverse-transforming the second transform block based on a plurality of transform matrices included in the selected second transform set; and is

A residual block corresponding to the first transform block is generated by inverse transforming the first transform block based on the first transform matrix.

2. The image decoding method according to claim 1,

the second transform set corresponds to a predetermined angle for generating the first transform block by rotational-transforming the second transform block, and

the plurality of transformation matrices included in the second transformation set include a vertical transformation matrix and a horizontal transformation matrix for a rotational transformation.

3. The image decoding method of claim 1, wherein the second transform set information includes at least one of: information indicating whether to perform a second transform on the first transform block, and second transform set selection information indicating a second transform set applied to the first transform block.

4. The image decoding method of claim 1, wherein the determining of the second transform set information comprises:

when it is determined that the second transform set information is not obtained, generating the first transform block by not inverse-transforming the second transform block or by inverse-transforming the second transform block based on any one of second transform sets included in a predetermined base set transform set group; and

when it is determined that the second transform set information is obtained, the second transform set information is obtained from the bitstream.

5. The image decoding method of claim 1, wherein the plurality of prediction modes comprises at least two intra prediction modes of a plurality of intra prediction modes for the first transform block.

6. The image decoding method of claim 1, wherein the determining whether the second transform set information is obtained comprises: whether to obtain the second transform set information is determined based on at least one of a size, a shape, a prediction mode, (encoding tool), a number of non-zero coefficients, a sum of squares of the non-zero coefficients, a depth, and a quantization parameter of the second transform block.

7. The image decoding method of claim 1, wherein the determining of the second transform set information comprises: when the information indicating whether to perform the second transform indicates that the second transform is not to be performed, second transform set selection information indicating a second transform set applied to the first transform block is not obtained.

8. The image decoding method of claim 3, wherein the information indicating the second transform set applied to the first transform block is decoded based on any one of: a context-based adaptive binary arithmetic coding (CABAC) method, a fixed length method and a unary method,

wherein the CABAC method performs context modeling by using information on a second transform set of peripheral blocks of the first transform block.

9. The image decoding method of claim 1, wherein the generating of the first transform block by inverse-transforming the second transform block comprises: the coefficients of the first transform block are inversely rearranged by inversely transforming the second transform block using a transform matrix selected from a second transform set corresponding to any one of a plurality of predetermined rearrangement angles.

10. The image decoding method according to claim 1,

the generating of the first transform block by inverse-transforming the second transform block includes: generating a first transform block by inverse-transforming at least a part of the second transform block, and

the second transform block is a coefficient block having a size of 1: N or N:1 ratio.

11. An image encoding method comprising:

generating a first transform block by transforming the residual block based on a first transform matrix;

selecting any one of second transform sets applied to the first transform block from among a plurality of second transform set candidates corresponding to a plurality of prediction modes; and is

The second transform block is generated by transforming the first transform block based on a plurality of transform matrices included in the selected second transform set.

12. The image encoding method of claim 11, further comprising: generating second transformation set information comprising at least one of: information indicating whether to perform a second transform on the first transform block, and second transform set selection information indicating a second transform set applied to the first transform block.

13. The image encoding method of claim 11, wherein the generating of the second transform block by transforming the first transform block comprises: coefficients of the second transform block are rearranged by transforming the first transform block using a transform matrix selected from a second transform set corresponding to any one of a plurality of predetermined rearrangement angles.

14. The image encoding method as claimed in claim 11,

the second transform set corresponds to a predetermined angle for generating a second transform block by rotationally transforming the first transform block, and

the plurality of transformation matrices included in the second transformation set include a vertical transformation matrix and a horizontal transformation matrix for a rotational transformation.

15. An image decoding apparatus comprising at least one processor, wherein the at least one processor is configured to:

determining whether second transformation set information is obtained;

determining second transformation set information based on whether the second transformation set information is obtained;

generating a first transform block by inverse-transforming the second transform block based on a plurality of transform matrices included in the selected second transform set; and is

A residual block corresponding to the first transform block is generated by inverse transforming the first transform block based on the first transform matrix.

Technical Field

The present disclosure relates to an image encoding method and apparatus and an image decoding method and apparatus. More particularly, the present disclosure relates to methods and apparatuses for encoding and decoding coefficients of a frequency domain.

Background

According to most of the image encoding method and apparatus and the image decoding method and apparatus, an image of a pixel domain is transformed into an image of a frequency domain and encoded for image compression. Discrete Cosine Transform (DCT) is a well-known technique for image or speech compression. In recent years, many attempts have been made to find more efficient coding methods. In audio coding, parametric coding shows better results than DCT, and for two-dimensional (2D) data, the coefficients of Karhunen Loeve Transform (KLT) have the smallest bit size, but the amount of overhead information increases significantly.

Disclosure of Invention

Technical problem

An image encoding method and apparatus and an image decoding method and apparatus are provided that provide efficient compression and have minimal overhead information.

Solution to the problem

One or more embodiments provide an image decoding method including: determining whether second transformation set information is obtained; determining second transformation set information based on determining whether second transformation set information is obtained; selecting any one of second transform sets applied to the second transform block from a plurality of second transform set candidates corresponding to the plurality of prediction modes by using the determined second transform set information; generating a first transform block by inverse-transforming the second transform block based on a plurality of transform matrices included in the selected second transform set; and generating a residual block corresponding to the first transform block by inverse-transforming the first transform block based on the first transform matrix.

One or more embodiments provide an image encoding method including: generating a first transform block by transforming the residual block based on a first transform matrix; selecting any one of second transform sets applied to the first transform block from among a plurality of second transform set candidates corresponding to a plurality of prediction modes; and generating a second transform block by transforming the first transform block based on a plurality of transform matrices included in the selected second transform set.

One or more embodiments include an image decoding apparatus, wherein the image decoding apparatus includes at least one processor configured to: determining whether second transformation set information is obtained; determining second transformation set information based on determining whether second transformation set information is obtained; selecting any one of second transform sets applied to the second transform block from a plurality of second transform set candidates corresponding to the plurality of prediction modes by using the determined second transform set information; generating a first transform block by inverse-transforming the second transform block based on a plurality of transform matrices included in the selected second transform set; and generating a residual block corresponding to the first transform block by inverse-transforming the first transform block based on the first transform matrix.

Advantageous effects

Based on the image encoding method and apparatus and the image decoding method and apparatus according to the present disclosure, efficient compression can be provided and minimum overhead information can be generated. Thus, overall, the amount of data to be stored or to be transmitted to the decoder can be reduced.

Drawings

Fig. 1 is a block diagram of an image decoding apparatus according to an embodiment.

Fig. 2 is a block diagram of an image encoding apparatus according to an embodiment.

Fig. 3 is a flowchart of a process of performing an inverse second transform performed by the image decoding apparatus according to the embodiment.

Fig. 4 is a flowchart of a process of performing the second transform performed by the image encoding apparatus according to the embodiment.

Fig. 5 illustrates a method of performing a second transformation by using a horizontal transformation matrix and a vertical transformation matrix included in a second transformation set according to an embodiment.

Fig. 6 illustrates a process of applying an inverse second transform performed by the image decoding apparatus according to the embodiment.

Fig. 7 illustrates an example of transform angles corresponding to a plurality of Intra Prediction Modes (IPMs) according to an embodiment.

Fig. 8 illustrates a process of determining at least one coding unit by dividing a current coding unit according to an embodiment.

Fig. 9 illustrates a process of determining at least one coding unit by dividing a non-square coding unit according to an embodiment.

Fig. 10 illustrates a process of dividing a coding unit based on at least one of block shape information and division shape mode information according to an embodiment.

Fig. 11 illustrates a method of determining a predetermined coding unit from an odd number of coding units according to an embodiment.

Fig. 12 illustrates an order of processing a plurality of coding units when the plurality of coding units are determined by dividing a current coding unit according to an embodiment.

Fig. 13 illustrates a process of determining that a current coding unit is to be divided into odd-numbered coding units when the coding units cannot be processed in a predetermined order according to an embodiment.

Fig. 14 illustrates a process of determining at least one coding unit by dividing a first coding unit according to an embodiment.

Fig. 15 illustrates that when a second coding unit having a non-square shape determined by dividing a first coding unit satisfies a predetermined condition, shapes into which the second coding unit can be divided are limited, according to an embodiment.

Fig. 16 illustrates a process of dividing a square coding unit when the division shape mode information indicates that the square coding unit is not to be divided into four square coding units according to an embodiment.

Fig. 17 illustrates that a processing order between a plurality of coding units may be changed according to a process of dividing the coding units according to an embodiment.

Fig. 18 illustrates a process of determining a depth of a coding unit as a shape and a size of the coding unit change when the coding unit is recursively divided to determine a plurality of coding units according to an embodiment.

Fig. 19 illustrates a depth that can be determined based on the shape and size of a coding unit and a Partial Index (PID) for distinguishing the coding units according to an embodiment.

Fig. 20 illustrates determining a plurality of coding units based on a plurality of predetermined data units included in a picture according to an embodiment.

Fig. 21 illustrates a processing block used as a unit for determining a determination order of reference coding units included in a picture according to an embodiment.

Detailed Description

Best mode for carrying out the invention

One or more embodiments provide an image encoding method including: generating a first transform block by transforming the residual block based on a first transform matrix; selecting any one of second transform sets applied to the first transform block from among a plurality of second transform set candidates corresponding to a plurality of prediction modes; the second transform block is generated by transforming the first transform block based on a plurality of transform matrices included in the selected second transform set.

One or more embodiments include an image decoding apparatus, wherein the image decoding apparatus includes at least one processor configured to: determining whether second transformation set information is obtained; determining second transformation set information based on determining whether second transformation set information is obtained; selecting any one of second transform sets applied to the second transform block from a plurality of second transform set candidates corresponding to the plurality of prediction modes by using the determined second transform set information; generating a first transform block by inverse-transforming the second transform block based on a plurality of transform matrices included in the selected second transform set; a residual block corresponding to the first transform block is generated by inverse transforming the first transform block based on the first transform matrix.

Disclosure of the invention

Advantages and features of the present disclosure and methods of accomplishing the same will become more apparent by reference to the embodiments that are described below with reference to the accompanying drawings. This disclosure may, however, be embodied in many different forms and should not be construed as limited to the embodiments set forth herein; rather, these embodiments are provided so that this disclosure will be thorough and complete, and will fully convey the concept of the disclosure to those skilled in the art.

Terms used in the present specification will be described briefly and the present disclosure will be described in detail.

Terms used in the present disclosure are selected as terms that are widely used at present as general terms as possible in consideration of their functions in the present disclosure. However, the terms may be changed according to the intention of a person skilled in the relevant art, a case, or the emergence of new technology. Further, some terms used herein may be arbitrarily selected by the applicant. In this case, these terms are defined in detail in the following detailed description. Accordingly, the terms used herein should be understood based on their unique meaning and the entire context of the present disclosure.

As used herein, the singular forms "a", "an" and "the" are intended to include the plural forms as well, unless the context clearly indicates otherwise.

Throughout the specification, it will be further understood that: when a component "comprises" or "comprising" an element, the component may also comprise, but not exclude, other elements, unless otherwise defined. In addition, the term "unit" used in the specification may represent a software or hardware (such as FPGA or ASIC) component, and may perform a specific function. However, the "unit" is not limited to software or hardware. The "unit" may be configured to be included in a storage medium that can perform addressing, or may be configured to reproduce one or more processors. Thus, for example, a "unit" may include software components, object-oriented software components, class components and task components, processors, functions, attributes, programs, subroutines, segments of program code, drivers, firmware, microcode, circuitry, data, databases, data structures, tables, arrays, and variables. The functionality provided for by the components and "units" may be integrated into a fewer number of components and "units" or may be further separated into additional components and "units".

Hereinafter, "image" may indicate a still image such as a still image of a video, or may indicate a moving image, that is, an animated moving image such as a video.

Hereinafter, "sampling point" is data assigned to a sampling point position of an image and may represent data to be processed. For example, the pixel values in the image in the spatial domain and the transform coefficients on the transform region may be samples. A unit including at least one of the samples may be defined as a block.

Hereinafter, embodiments of the present disclosure will be described in detail with reference to the accompanying drawings so that those of ordinary skill in the art can easily perform the embodiments. In addition, portions irrelevant to the description will be omitted to clearly describe the present disclosure.

Hereinafter, the image encoding apparatus and the image decoding apparatus, and the image encoding method and the image decoding method will be described in detail with reference to fig. 1 to 21. A method of determining a data unit of an image will be described according to an embodiment with reference to fig. 8 to 21 and an image encoding method and apparatus and an image decoding method and apparatus that perform a second transform by using a second transform set including a transform matrix will be described with reference to fig. 1 to 7.

In this disclosure, the second transformation represents: for a first transform block, a transform is performed to a second transform block.

In more detail, in an image encoding process using an image encoding apparatus, a residual block may be transformed into a first transform block via a first transform, the first transform block may be transformed into a second transform block via a second transform, and the second transform block may be transformed into a quantized second transform block via quantization.

In contrast, in the image decoding process using the image decoding apparatus, the quantized second transform block may be transformed into the second transform block via inverse quantization, the second transform block may be transformed into the first transform block via inverse second transform, and the first transform block may be transformed into the residual block via inverse first transform.

Fig. 1 is a block diagram of an image decoding apparatus 10 according to an embodiment.

The image decoding apparatus 10 may include a second transform set information obtainer 101, an inverse second transformer 102, and a residual block generator 103. Alternatively, the second transformation set information obtainer 101, the inverse second transformer 102, and the residual block generator 103 may correspond to one processor or a plurality of processors interoperating with each other.

Image decoding apparatus 10 may determine whether to obtain the second transform set information.

The image decoding apparatus 10 may determine whether to obtain the second transform set information based on at least one of a size, a shape (that is, a ratio of a width to a height of the second transform block), a prediction mode, whether to use a specific coding tool, the number of non-zero coefficients, a sum of squares of the non-zero coefficients, a depth, and a quantization parameter of the second transform block.

Here, the number of non-zero coefficients and the sum of squares of the non-zero coefficients of the second transform block used to determine whether to obtain the second transform set information may represent the number of non-zero coefficients and the sum of squares of the non-zero coefficients included in the second transform block, wherein quantization of the second transform block is completed after the transform.

In addition, whether to obtain the second transformation set information may be determined based on whether a particular encoding tool is used. Here, the encoding tool for determining whether to obtain the second transform set information may include at least one of a multi-type tree partition (MTT), an Adaptive Motion Vector Resolution (AMVR), an extreme motion vector expression (UMVE), affine motion prediction, inter prediction modification (IPR), decoding side motion vector modification (DMVR), bidirectional optical flow (BIO), multi-core transform (MTR), Spatial Variation Transform (SVT), scan region-based coefficient coding (SRCC), transform domain residual symbol prediction (TD-RSP), multi-hypothesis probability update (bacmca), multi-parameter intra (MPI), and position-dependent intra prediction combination (PDPC). However, the encoding tool need not be so limited.

When it is determined that the second transform set information is not obtained, image decoding apparatus 10 may not inverse-transform the second transform block, or may generate the first transform block by inverse-transforming the second transform block based on the second transform set.

For example, when the multi-core transform tool is used for encoding of the first transform block, the image decoding apparatus 10 may not obtain the second transform set information and may not inverse-transform the second transform block, or may generate the first transform block by inverse-transforming the second transform block based on the second transform set.

As another example, when the sum of squares of non-zero coefficients of the quantized second transform block is greater than a certain threshold, the image decoding apparatus 10 may not obtain the second transform set information and may not inverse-transform the second transform block, or may generate the first transform block by inverse-transforming the second transform block based on the second transform set. Here, the specific threshold may be set differently based on at least one of the number of non-zero coefficients of the quantized second transform block, a sum of squares of the non-zero coefficients, a depth, and a quantization parameter.

As another example, for a 128 × 128 sized second transform block, image decoding apparatus 10 may not obtain second transform set information and may not inverse transform the second transform block, or may generate the first transform block by inverse transforming the second transform block based on the second transform set.

When it is determined that the second transform set information is obtained, the image decoding apparatus 10 may obtain the second transform set information from the bitstream.

Image decoding apparatus 10 may determine the second transform set information based on whether the second transform set information is obtained.

That is, the image decoding apparatus 10 may obtain information indicating whether to obtain the second transform set information for an upper layer unit of the current block. When the information indicating whether to obtain the second transform set information indicates that the second transform set information is obtained, the image decoding apparatus 10 may obtain the second transform set information.

The second transform set information may include at least one of information indicating whether to perform a second transform and second transform set selection information indicating a second transform set applied to the first transform block.

The second transformation set information may include an index having a length of 1 bit indicating whether to perform the second transformation and an additional index having a length of n bits indicating the second transformation set.

For example, the second transform set information may include an additional index having a length of 2 bits indicating a second transform set among the plurality of second transform set candidates. The index having a length of 2 bits may be used to specify an arbitrary second transform set from among the 4 second transform set candidates.

In another embodiment, the second transform set information may include an index having a length of n bits to indicate whether to perform both the second transform and the second transform set.

For example, the second transformation set information may include an index having a length of 2 bits to indicate whether to perform the second transformation and the second transformation set. The index having a length of 2 bits may be used to indicate whether to perform the second transform (00) or to indicate any one of 3 second transform sets (01, 10, 11).

The configuration method of the second transform set information may be determined based on at least one of a size, a shape, a prediction mode, a coding tool, a number of non-zero coefficients, a sum of squares of non-zero coefficients, a depth, and a quantization parameter of the second transform block.

For example, the second transform set information on the second transform block having the size of 64 × 64 may include an index having a length of 1 bit indicating whether to perform the second transform and an additional index having a length of 1 bit indicating the second transform set.

On the other hand, the second transform set information on the second transform block having the size of 64 × 16 may include only an index having a length of 2 bits to indicate whether to perform the second transform and both the second transform set.

The second transform set selection information indicating the second transform set may be entropy-encoded and entropy-decoded according to any one of a context-based adaptive binary arithmetic coding (CABAC) method, a fixed length method, and a unary method.

In particular, according to the CABAC method, context modeling may be performed by using information on a second transform set of peripheral blocks of the first transform block.

For example, when the unary binarization method is used, the image decoding apparatus 10 may assign each second transform set to 0, 10, 110, or 111.

As another example, when the CABAC method is used, the image decoding apparatus 10 may assign each second transform set candidate as 0, 10, 110, or 111 via the unary binarization method, and may then perform context modeling of estimating the probability of a bin required for binary arithmetic coding.

As another example, when the fixed length coding method is used, the image decoding apparatus 10 may allocate each second transform set to 00, 01, 10, or 11. When the CABAC method is used, the image decoding apparatus 10 may perform context modeling by using information on the second transform set of the peripheral block of the first transform block (that is, an index indicating whether to perform the second transform on the peripheral block of the first transform block and indicating the second transform set applied to the peripheral block of the first transform block).

An entropy encoding method of the information indicating the second transform set may be determined based on at least one of a size, a shape, a prediction mode, an encoding tool, a number of non-zero coefficients, a sum of squares of non-zero coefficients, a depth, and a quantization parameter of the second transform block.

When the information indicating whether to perform the second transform included in the second transform set information indicates that the second transform is not performed, the image decoding apparatus 10 may not obtain the information indicating the second transform set.

For example, when the index having a length of 1 bit indicating whether to perform the second transform for the first transform block indicates not to perform the second transform, the image decoding apparatus 10 may not obtain the index having a length of 2 bits indicating the second transform set, thereby reducing the total number of parameters for the second transform to reduce overhead.

The image decoding apparatus 10 may select any one of the second transform set candidates from among a plurality of second transform set candidates corresponding to a plurality of prediction modes by using the obtained second transform set information.

Here, the plurality of prediction modes may include at least two intra prediction modes.

The second transformation set may correspond to a predetermined angle for generating the first transformation block by performing a rotational transformation on the second transformation block, wherein the plurality of transformation matrices included in the second transformation set may include a horizontal transformation matrix and a vertical transformation matrix for the rotational transformation.

The image decoding apparatus 10 may generate the first transform block by inverse-transforming the second transform block based on a plurality of transform matrices included in the selected second transform set.

According to an embodiment, the inverse transform for the second transform block may be a rotational transform. The second transform set for the rotational transform may correspond to a predetermined angle for generating the first transform block by rotationally transforming the second transform block. The second transform set for the rotational transform may include a horizontal transform matrix and a vertical transform matrix for performing the rotational transform on the second transform block.

A detailed method of performing the rotation transformation by using the horizontal transformation matrix and the vertical transformation matrix included in the second transformation set will be described below with reference to fig. 5.

Image decoding apparatus 10 may generate the first transform block by inverse transforming at least a portion of a second transform block, where the second transform block may correspond to a coefficient block having a size in a ratio of 1: N or N: 1.

Here, coefficient blocks having a size of a scale of 1: N may include coefficient blocks having a size of 4 × 8, 8 × 16, 16 × 32, 32 × 64, 4 × 16, 8 × 32, 16 × 64, 4 × 32, 8 × 64, and 4 × 64 and coefficient blocks having a size of a scale of N:1 may include coefficient blocks having a size of 8 × 4, 16 × 8, 32 × 16, 64 × 32, 16 × 4, 32 × 8, 64 × 16, 32 × 4, 64 × 8, and 64 × 4.

The image decoding apparatus 10 may inversely rearrange the coefficients of the first transform block by inversely transforming the second transform block using a transform matrix selected from a second transform set corresponding to any one of a plurality of predetermined rearrangement angles.

For example, the image decoding apparatus 10 may inversely rearrange the coefficients of the first transform block by inversely transforming the second transform block using the transform matrix included in the second transform set corresponding to any of predetermined rearrangement angles of 90 degrees, 180 degrees, and 270 degrees.

A detailed method performed by image decoding apparatus 10 to generate the first transform block by inverse-transforming the second transform block based on the plurality of transform matrices included in the second transform set will be described below with reference to fig. 6.

Fig. 2 is a block diagram of the image encoding apparatus 20 according to the embodiment.

The image encoding apparatus 20 may include a first transform block generator 201, a second transformer 202, and a second transform set information generator 203. Alternatively, the first transformation block generator 201, the second transformer 202, and the second transformation set information generator 203 may correspond to one processor or a plurality of processors interoperating with each other.

The image encoding apparatus 20 may generate the first transform block by transforming the residual block based on the first transform matrix.

The image encoding apparatus 20 may select any one of the second transform sets applied to the first transform block from among a plurality of second transform set candidates corresponding to a plurality of prediction modes.

The image encoding apparatus 20 may generate the second transform block by transforming the first transform block based on a plurality of transform matrices included in the selected second transform set.

According to an embodiment, the transform for the first transform block may be a rotational transform, and the second transform set for the rotational transform may correspond to a predetermined angle for the rotational transform of the first transform block. The second transform set for the rotational transform may include a horizontal transform matrix and a vertical transform matrix for performing the rotational transform on the first transform block.

The second transform set selection information indicating the second transform set may be entropy-encoded based on any one of a CABAC method, a fixed length method, and a unary method.

In particular, according to the CABAC method, context modeling may be performed by using information on a second transform set of peripheral blocks of the first transform block.

For example, when the unary binarization method is used, the image encoding apparatus 20 may transform information indicating the second transform set applied to the first transform block into binary bit strings such as 0, 10, 110, and 111.

As another example, when the CABABC method is used, the image encoding apparatus 20 may transform information indicating the second transform set applied to the first transform block into a binary bit string such as 0, 10, 110, and 111 via a unary binarization encoding method, and may then perform context modeling of estimating the probability of binary bits required for binary arithmetic encoding.

As another example, when a fixed length coding method is used, the image encoding apparatus 20 may transform information indicating the second transform set applied to the first transform block into a binary bit string having a fixed length of 2 bits, such as 00, 01, 10, and 11.

When the CABAC method is used, the image encoding apparatus 20 may perform context modeling by using information on the second transform set of the peripheral block of the first transform block (that is, an index indicating whether to perform the second transform on the peripheral block of the first transform block and indicating a second transform set applied to the peripheral block of the first transform block among the plurality of second transform set candidates).

An entropy encoding method indicating information of the second transform set applied to the first transform block may be determined based on at least one of a size, a shape, a prediction mode, an encoding tool, a number of non-zero coefficients, a sum of squares of the non-zero coefficients, a depth, and a quantization parameter of the second transform block.

The image encoding apparatus 20 may rearrange the coefficients of the second transform block by transforming the first transform block using a transform matrix selected from the second transform set corresponding to any one of a plurality of predetermined rearrangement angles.

The image encoding apparatus 20 may generate second transform set information including at least one of information on whether to perform a second transform with respect to the first transform block and information indicating a second transform set applied to the first transform block.

Fig. 3 is a flowchart of a process of performing the inverse second transform performed by the image decoding apparatus 10 according to the embodiment.

In operation S301, the image decoding apparatus 10 may determine whether second transform set information is obtained.

On the other hand, when determining to obtain the second transform set information, the image decoding apparatus 10 may obtain the second transform set information from the bitstream.

In operation S302, the image decoding apparatus 10 may determine second transform set information based on whether the second transform set information is obtained.

The second transform set information may include at least one of information on whether to perform a second transform and second transform set selection information indicating a second transform set applied to the first transform block.

When the information on whether to perform the second transform included in the second transform set information indicates that the second transform is not to be performed, the image decoding apparatus 10 may not obtain the second transform set selection information indicating the second transform set.

In operation S303, the image decoding apparatus 10 may select any one second transform set from a plurality of second transform set candidates corresponding to a plurality of prediction modes by using the second transform set information.

The second transformation set may correspond to a predetermined angle for generating the second transformation block by performing a rotational transformation on the first transformation block, wherein the plurality of transformation matrices included in the second transformation set may include a horizontal transformation matrix and a vertical transformation matrix for the rotational transformation.

In operation S304, the image decoding apparatus 10 may generate the first transform block by inverse-transforming the second transform block based on a plurality of transform matrices included in the selected second transform set.

The inverse transform for the second transform block may be a rotational transform, and the second transform set for the rotational transform may correspond to a predetermined angle for generating the first transform block by performing the rotational transform on the second transform block. Here, the second transform set for the rotational transform may include a horizontal transform matrix and a vertical transform matrix for generating the first transform block by performing the rotational transform on the second transform block.

In operation S305, the image decoding apparatus 10 may generate a residual block corresponding to the first transform block by inverse-transforming the first transform block based on the first transform matrix.

Fig. 4 is a flowchart of a process of performing the second transform performed by the image encoding apparatus 20 according to the embodiment.

In operation S401, the image encoding apparatus 20 may generate a first transform block by transforming the residual block based on the first transform matrix.

In operation S402, the image encoding apparatus 20 may select any one of second transform sets applied to the first transform block from among a plurality of second transform set candidates corresponding to a plurality of prediction modes.

In operation S403, the image encoding apparatus 20 may generate the second transform block by transforming the first transform block based on a plurality of transform matrices included in the selected second transform set.

Referring to fig. 5, the plurality of transform matrices included in the second transform set may include a horizontal transform matrix 51 and a vertical transform matrix 55 for transforming the first transform block 53.

In more detail, the second transform for the first transform block 53 may be a rotational transform. Here, the second transformation set may correspond to a predetermined angle for generating the second transformation block 57 by performing a rotational transformation on the first transformation block 53, wherein the plurality of transformation matrices included in the second transformation set may include a horizontal transformation matrix 51 and a vertical transformation matrix 55 for the rotational transformation.

In general, horizontal transform matrix 51 may include angle parameters for performing a partial transform between rows of coefficients included in first transform block 53 and vertical transform matrix 55 may include angle parameters for performing a partial transform between columns of coefficients included in first transform block 53.

That is, the horizontal transformation matrix 51 and the vertical transformation matrix 55 may include a combination of angle parameters for performing a rotation transformation of the first transformation block 53 by a specific angle.

Accordingly, a plurality of parameters included in a specific second transformation set may be differently set for each specific second transformation set to correspond to a specific angle for generating the second transformation block 57.

The horizontal transformation matrix 51 and the vertical transformation matrix 55 included in the second transformation set may be used to perform the second transformation only in a specific direction based on the size and shape of the first transformation block or additionally perform the second transformation in the specific direction.

For example, for a first transform block having any particular size and shape, the second transform may be performed based on only the horizontal transform matrix 51, or after the second transform is performed based on the horizontal transform matrix 51 and the vertical transform matrix 55, the second transform may be additionally performed based on only the horizontal transform matrix 51.

The relationship between the angle for performing the rotational transform on the first transform block 53 and the second transform set will be described below with reference to fig. 7.

Fig. 6 illustrates a process of applying the inverse second transform performed by the image decoding apparatus 10 according to the embodiment.

Referring to fig. 6, the image decoding apparatus 10 may perform inverse quantization (Inv Q)601 on the second transform block 61 that is 4 × 4 to 64 × 64 in size and quantized, to generate a second transform block.

Further, the image decoding device 10 may generate the first transform block by performing inverse transform (Inv STR)603 on the second transform block generated via the Inv Q601.

Finally, image decoding apparatus 10 may perform inverse transform (Inv Core transform) 605 on the first transform block based on the first transform matrix in order to transform the first transform block into a residual block.

The image decoding apparatus 10 may determine a method of performing the InvSTR 603 on the second transform block based on at least one of a size, a shape, a prediction mode, a coding tool, the number of non-zero coefficients, a sum of squares of the non-zero coefficients, a depth, and a quantization parameter of the second transform block.

In general, even when the compression rate is improved via the inverse transform for the second transform block, the total amount of data is not reduced when the number of parameters for the inverse transform is increased, thereby increasing overhead.

Therefore, in order to fully obtain the effect of the inverse transform for the second transform block having a large size, the coefficients included in the selected sample block must be coefficients that affect the compression of the image data. Therefore, in order to improve the compression rate, the image decoding apparatus 10 may perform inverse transformation by selecting only coefficients for low-frequency components having a high probability of having a value other than "0".

That is, the frequency coefficient for the low frequency component may be included in the upper left portion of the general frequency coefficient block generated through DCT, and thus the image decoding apparatus 10 may inverse-transform the second transform block having a size equal to or greater than a predetermined size using the coefficient sample.

The image decoding apparatus 10 may generate the first transform block by inverse-transforming at least a portion of the second transform block having the size of N × N. Here, the image decoding apparatus 10 may perform inverse transformation by selecting only coefficients located at the upper left portion of the second transform block as a sample block.

For example, as shown in fig. 6, the image decoding apparatus 10 may select an 8 × 8-sized sample block including only some coefficients of a second transform block having a size of 16 × 16 or having a size larger than the size of 16 × 16, and may then perform an inverse transform 603 on the selected sample block. Here, the image decoding apparatus 10 may not perform the inverse transform 603 on the remaining portion of the second transform block.

The image decoding apparatus 10 may generate the first transform block by inverse-transforming at least a portion of the second transform block having a size of a ratio of 1: N or N: 1.

For example, the image decoding apparatus 10 may perform inverse transformation by selecting a sample block having any one of 4 × 8, 4 × 16, and 4 × 32 sizes for a second transform block having a size of a ratio of 1: N.

In more detail, the inverse transform may be performed only on the left-side sample block having a size of 4 × 4 for the second transform blocks having sizes of 4 × 8, 4 × 16, 4 × 32, and 4 × 64, and the inverse transform may be performed only on the left up-sample block having a size of 4 × 4 or the left-side sample block having a size of 8 × 8 for the second transform blocks having sizes of 8 × 16, 8 × 32, and 8 × 64. Alternatively, for second transform blocks having sizes of 8 × 16, 8 × 32, and 8 × 64, the inverse transform may be performed only on left upsampled blocks having sizes of 4 × 8, 4 × 16, and 4 × 32, respectively.

For a second transform block having a size of a ratio of N:1, the inverse transform may be performed by selecting a block of samples having any one of 8 × 4, 16 × 4, and 32 × 4 sizes.

In more detail, the inverse transform may be performed only on an upper sample block having a size of 4 × 4 for second transform blocks having sizes of 8 × 4, 16 × 4, 32 × 4, and 64 × 4, and may be performed only on a left upper sample block having a size of 4 × 4 or an upper sample block having a size of 8 × 8 for second transform blocks having sizes of 16 × 8, 32 × 8, and 64 × 8. Alternatively, for second transform blocks having sizes of 16 × 8, 32 × 8, and 64 × 8, the inverse transform may be performed only on left upsampled blocks having sizes of 8 × 4, 16 × 4, and 32 × 4, respectively.

Here, the image decoding apparatus 10 may not perform the inverse transform 603 on the remaining portion of the second transform block.

When image decoding apparatus 10 inverse-transforms a second transform block having a predetermined size or a size greater than the predetermined size, image decoding apparatus 10 may divide the second transform block into a plurality of sub-blocks and inverse-transform all of the sub-blocks to generate a first transform block.

For example, the image decoding apparatus 10 may divide the second transform block having a size of 16 × 16 or more than 16 × 16 into sub blocks having a size of 8 × 8 and may perform inverse transform on all the sub blocks.

Image decoding apparatus 10 may determine a portion of the quantized second transform block to which an inverse transform is to be applied based on the number of non-zero coefficients of the quantized second transform block.

For example, when a non-zero coefficient included in the quantized second transform block is equal to or greater than a predetermined threshold, the image decoding apparatus 10 may select a sample block having a size of 4 × 4 or 8 × 8 and perform inverse transform on the selected sample block.

Fig. 7 illustrates an example of angles corresponding to a plurality of Intra Prediction Modes (IPMs) according to an embodiment.

Fig. 7 illustrates angles corresponding to a plurality of IPMs, and the angles are divided into four angle groups for convenience of explanation. Here, the first to sixteenth angles may be referred to as a first horizontal angle 71, the seventeenth to thirty-second angles may be referred to as a second horizontal angle 73, the thirty-third to forty-eighth angles may be referred to as a first vertical angle 75 and the forty-ninth to sixty-fourteenth angles may be referred to as a second vertical angle 77.

When the image encoding apparatus 20 performs the rotational transform on the first transform block based on the angles corresponding to the plurality of IPMs, a specific angle group including at least one angle among the angles corresponding to the plurality of IPMs may correspond to any one of the second transform sets.

For example, first to thirty-second angles included in first to second horizontal angles 71 and 73 among first to sixty-fourth angles corresponding to sixty-four IPMs (except 0: planar mode and 1: DC mode in sixty-six IPMs) may be set to correspond to any one second transform set included in the plurality of second transform set candidates.

Here, when the image encoding apparatus 20 performs the rotation transform on the first transform block based on any one of the first to thirty-second angles included in the first and second horizontal angles 71 and 73, the first transform block may be rotation-transformed by using the horizontal transform matrix and the vertical transform matrix included in the specific second transform set corresponding to the first and second horizontal angles 71 and 73.

The first through sixty-fourth angles may be used to construct 2n second transform set candidates. Here, the second transform set information may include information indicating a second transform set applied to the first transform block among the 2n second transform set candidates. For example, the information indicating the second transform set applied to the first transform block among the 2n second transform set candidates may be an index having a length of n bits.

The plurality of second transform set candidates may respectively correspond to a plurality of IPMs. Here, the prediction mode may include a plurality of IPMs for the first transform block.

For example, the plurality of second transform set candidates may be differently set based on IPM of the first transform block. When four second transform sets (i.e., set 1, set 2, set 3, and set 4) are set as a plurality of second transform set candidates for IPM0, the other four second transform sets (i.e., set 5, set 6, set 7, and set 8) may be set as a plurality of second transform set candidates for IPM 1.

As another example, the plurality of IPMs corresponding to the plurality of second transform set candidates may include at least two IPMs of the plurality of IPMs for the first transform block. When four second transform sets (i.e., set 1, set 2, set 3, and set 4) are set as a plurality of second transform set candidates for IPM0, the same four second transform sets (i.e., set 1, set 2, set 3, and set 4) may be set as a plurality of second transform set candidates for IPM1, IPM2, and IPM 3.

That is, when the second transform set is set based on IPM of the first transform block, the same plurality of second transform set candidates may be used for the plurality of IPM, thereby reducing the total number of predetermined second transform sets, and overhead may be reduced since the total number of parameters used for the second transform is reduced.

As another example, the plurality of second transform set candidates may be differently set based on an inter prediction mode of the first transform block. When four second transform sets, i.e., set 1, set 2, set 3, and set 4, are set as a plurality of second transform set candidates for an affine motion prediction mode in the inter prediction mode, the other four second transform sets, i.e., set 5, set 6, set 7, and set 8, may be set as a plurality of second transform set candidates for an Advanced Motion Vector Prediction (AMVP) mode in the inter prediction mode.

As another example, the plurality of second transform sets may be differently set based on the size and shape of the first transform block (that is, the ratio of the width to the height of the first transform block). When four second transform sets (i.e., set 1, set 2, set 3, and set 4) are set as a plurality of second transform set candidates for a square first transform block having a size of a scale of 1:1, the other four second transform sets (i.e., set 5, set 6, set 7, and set 8) may be set as a plurality of second transform set candidates for a rectangular first transform block having a size of a scale of 1: 2.

The second transform set may be set to a default mode for inverse transforming the second transform block based on any one of the second transform sets included in the basic second transform set determined in advance when it is determined that the second transform set information is not obtained. Here, a second transformation set in a default mode predetermined via syntax elements parsed in the video slice level may be used.

Hereinafter, the division of the coding unit will be described in detail according to an embodiment of the present disclosure.

An image may be divided into maximum coding units. The size of the maximum coding unit may be determined based on information obtained from the bitstream. The shape of the largest coding unit may be a square of the same size. However, it is not limited thereto. In addition, the maximum coding unit may be hierarchically divided into coding units based on the division shape mode information obtained from the bitstream. The division shape mode information may include at least one of information indicating whether to perform division, division direction information, and division type information. The information indicating whether to perform the division may indicate whether to divide the coding unit. The division direction information may indicate that the division is performed in a horizontal direction or a vertical direction. The partition type information may indicate that the coding unit is divided based on any one of the two-partition, the three-partition, and the four-partition.

For example, the partition shape mode information (SPLIT _ mode) may indicate that the current coding unit is not partitioned (NO _ SPLIT). In addition, the partition shape mode information may indicate a QUAD partition (QUAD _ partition). In addition, the partition shape mode information may indicate a vertical binary partition (BI _ VER _ SPLIT). In addition, the partition shape mode information may indicate a vertical binary partition (BI _ VER _ SPLIT). In addition, the partition shape mode information may indicate a horizontal binary partition (BI _ HOR _ SPLIT). In addition, the partition shape mode information may indicate a vertical triple partition (TRI _ VER _ partition). In addition, the partition shape mode information may indicate a horizontal triple partition (TRI _ horslpit).

The image decoding apparatus may obtain division shape mode information from a bitstream from one binary bit string. The form of the bitstream received by the image decoding apparatus may include a fixed-length binary code, a unary code, a truncated unary code, a predetermined binary code, and the like. The binary bit string may indicate information in an array of binary numbers. The binary string may comprise at least one bit. The image decoding apparatus may obtain division shape mode information corresponding to the binary bit string based on the division rule. The image decoding apparatus may determine whether to divide the coding unit, the division direction, and the division type based on the at least one binary bit string.

The coding unit may be less than or equal to the maximum coding unit. For example, when the partition shape mode information indicates that the coding unit is not partitioned, the coding unit may have the same size as the maximum coding unit. When the division shape mode information indicates that the coding unit is divided, the maximum coding unit may be divided into the coding units. In addition, when the partition shape mode information for the coding unit indicates that the coding unit is partitioned, the coding unit may be partitioned into smaller coding units. However, the division of the image is not limited thereto, and the maximum coding unit and the coding unit may not be different from each other. The division of the coding unit will be described in more detail with reference to fig. 8 to 21.

In addition, the coding unit may be divided into prediction units for image prediction. The prediction unit may be the same as or smaller than the coding unit. In addition, the coding unit may be divided into transform units for image transformation. The transform unit may be the same as or smaller than the coding unit. The shapes and sizes of the transform unit and the prediction unit may be unrelated to each other. The coding unit may be different from the prediction unit and the transform unit, or the coding unit, the prediction unit, and the transform unit may be identical to each other. The division of the prediction unit and the transform unit may be performed by the same method as the division of the coding unit. The division of the coding unit will be described in more detail with reference to fig. 8 to 21. The current block and the peripheral block of the present disclosure may indicate any one unit of a maximum coding unit, a prediction unit, and a transform unit. In addition, the current block or the current encoding unit may be a block currently performing decoding or encoding or a block currently performing division. The peripheral block may be a block reconstructed before the current block. The peripheral block may be spatially or temporally adjacent to the current block. The peripheral block may be located at any one of a lower left portion, a left portion, an upper right portion, a right portion, and a lower right portion of the current block.

Fig. 8 illustrates a process of determining at least one coding unit by dividing a current coding unit, performed by an image decoding apparatus according to an embodiment.

The block shape may include 4 nx 4N, 4 nx 2N, 2 nx 4N, 4 nx N, or nx 4N. Here, N may be a positive integer. The block shape information may be information indicating at least one of a shape, a direction, a width to height ratio, and a size of the coding unit.

The shape of the coding unit may include a square shape and a non-square shape. When the width and height of the coding unit are the same (i.e., when the block shape of the coding unit is 4N × 4N), the image decoding apparatus may determine the block shape information of the coding unit as a square. The image decoding apparatus may determine the shape of the coding unit to be non-square.

When the width and the height of the coding unit are different from each other (that is, when the block shape of the coding unit is 4N × 2N, 2N × 4N, 4N × N, or N × 4N), the image decoding apparatus may determine the block shape information of the coding unit to be non-square. When the shape of the coding unit is non-square, the image decoding apparatus may determine the width to height ratio as at least one of 1:2, 2:1, 1:4, 4:1, 1:8, and 8:1 from the block shape information of the coding unit. In addition, the image decoding apparatus may determine whether the coding unit is in the horizontal direction or the vertical direction based on the width and the height of the coding unit. In addition, the image decoding apparatus may determine the size of the coding unit based on at least one of the width, the height, and the area of the coding unit.

According to the embodiment, the image decoding apparatus may determine the shape of the coding unit by using the block shape information, and may determine the division method of the coding unit by using the division shape mode information. That is, the coding unit division method indicated by the division shape mode information may be determined based on a block shape indicated by block shape information used by the image decoding apparatus.

The image decoding apparatus may obtain the partition shape mode information from the bitstream. However, it is not limited thereto. The image decoding apparatus and the image encoding apparatus 20 may determine the predetermined division shape mode information based on the block shape information. The image decoding apparatus may determine predetermined partition shape mode information for a maximum coding unit or a minimum coding unit. For example, the image decoding apparatus may determine the partition shape mode information for the maximum coding unit as the quad partition. In addition, the image decoding apparatus may determine the division shape mode information for the minimum coding unit as "not divided". In detail, the image decoding apparatus may determine the size of the maximum coding unit to be 256 × 256. The image decoding apparatus may determine the predetermined division shape mode information as the four divisions. The quad-division is a division shape pattern in which the width and height of the coding unit are divided into two. The image decoding apparatus may obtain a coding unit of 128 × 128 size from a maximum coding unit of 256 × 256 size based on the partition shape mode information. In addition, the image decoding apparatus may determine the size of the minimum coding unit to be 4 × 4. The image decoding apparatus may obtain partition shape mode information indicating "not to perform partitioning" for the minimum coding unit.

According to an embodiment, the image decoding apparatus may use block shape information indicating that the current coding unit has a square shape. For example, the image decoding apparatus may determine whether to divide a square coding unit, whether to divide the square coding unit vertically, whether to divide the square coding unit horizontally, or whether to divide the square coding unit into four coding units based on the division shape mode information. Referring to fig. 8, when the block shape information of the current coding unit 800 indicates a square shape, the decoder may determine that a coding unit 810a having the same size as the current coding unit 800 is not divided based on the division shape mode information indicating that division is not performed, or may determine a coding unit 810b, 810c, or 810d divided based on the division shape mode information indicating a predetermined division method.

Referring to fig. 8, according to an embodiment, the image decoding apparatus may determine two coding units 810b obtained by dividing the current coding unit 800 in the vertical direction based on the division shape mode information indicating that the division is performed in the vertical direction. The image decoding apparatus may determine two coding units 810c obtained by dividing the current coding unit 800 in the horizontal direction based on the division shape mode information indicating that the division is performed in the horizontal direction. The image decoding apparatus may determine four coding units obtained by dividing the current coding unit 800 in the vertical direction and the horizontal direction based on the division shape mode information indicating the division in the vertical direction and the horizontal direction. However, the dividing method of the square coding unit is not limited to the above-described method, and the division shape mode information may indicate various methods. The predetermined division method of dividing the square encoding unit will be described in detail with respect to various embodiments.

Fig. 9 illustrates a process of determining at least one coding unit by dividing a non-square coding unit performed by the image decoding apparatus according to the embodiment.

According to an embodiment, the image decoding apparatus may use block shape information indicating that the current coding unit has a non-square shape. The image decoding apparatus may determine whether to divide the current coding unit of the non-square shape or whether to divide the current coding unit of the non-square shape by using a predetermined division method, based on the division shape mode information. Referring to fig. 9, when block shape information of a current coding unit 900 or 950 indicates a non-square shape, the image decoding apparatus may determine a coding unit 910 or 960 having the same size as the current coding unit 900 or 950, or determine coding units 920a and 920b, coding units 930a to 930c, coding units 970a and 970b, or coding units 980a to 980c, which are divided based on partition shape mode information indicating a predetermined partition method, based on partition shape mode information indicating that division is not performed. The predetermined division method of dividing the non-square coding units will be described in detail with respect to various embodiments below.

According to the embodiment, the image decoding apparatus may determine the division method of the coding unit by using the division shape mode information, and in this case, the division shape mode information may indicate the number of one or more coding units generated by dividing the coding unit. Referring to fig. 9, when the division shape mode information indicates that the current coding unit 900 or 950 is divided into two coding units, the image decoding apparatus may determine two coding units 920a and 920b or coding units 970a and 970b included in the current coding unit 900 or 950 by dividing the current coding unit 900 or 950 based on the division shape mode information.

According to an embodiment, when the image decoding apparatus divides the non-square current coding unit 900 or 950 based on the division shape mode information, the position of the long side of the non-square current coding unit 900 or 950 may be considered. For example, the image decoding apparatus may determine a plurality of coding units by dividing a long side of the current coding unit 900 or 950 in consideration of the shape of the current coding unit 900 or 950.

According to an embodiment, the division shape mode information indicates that the coding unit is divided into odd blocks, and the image decoding apparatus may determine an odd number of coding units included in the current coding unit 900 or 950. For example, when the division shape mode information indicates that the current coding unit 900 or 950 is divided into three coding units, the image decoding apparatus may divide the current coding unit 900 or 950 into three coding units 930a, 930b, and 930c or 980a, 980b, and 980 c.

The width to height ratio of the current coding unit 900 or 950 may be 4:1 or 1:4, depending on the embodiment. When the ratio of the width to the height is 4:1, the width is greater than the height, and thus, the block shape information may be in the horizontal direction. When the ratio of the width to the height is 1:4, the width is smaller than the height, and thus, the block shape information may be a vertical direction. The image decoding apparatus may determine to divide the current coding unit into odd blocks based on the division shape mode information. In addition, the image decoding apparatus may determine the division direction of the current coding unit 900 or 950 based on the block shape information of the current coding unit 900 or 950. For example, when the current coding unit 900 is in the vertical direction, the image decoding apparatus may divide the current coding unit 900 in the horizontal direction to determine coding units 930a, 930b, and 930 c. In addition, when the current encoding unit 950 is in the horizontal direction, the image decoding apparatus may divide the current encoding unit 950 in the vertical direction to determine encoding units 980a, 980b, and 980 c.

According to an embodiment, the image decoding apparatus may determine an odd number of coding units included in the current coding unit 900 or 950, and not all of the determined coding units may have the same size. For example, the determined odd number of coding units 930a, 930b, and 930c or a predetermined coding unit 930b or a predetermined coding unit 980b of the coding units 980a, 980b, and 980c may have a size different from the size of the other coding units 930a and 930c or the coding units 980a and 980 c. That is, the coding units determinable by dividing the current coding unit 900 or 950 may have various sizes, and in some cases, an odd number of coding units 930a, 930b, and 930c or all of the coding units 980a, 980b, and 980c may have different sizes.

According to an embodiment, when the division shape mode information indicates that the encoding unit is divided into odd blocks, the image decoding apparatus may determine an odd number of encoding units included in the current encoding unit 900 or 950, and may apply a predetermined restriction on at least one of the odd number of encoding units generated by dividing the current encoding unit 900 or 950. Referring to fig. 9, the image decoding apparatus may allow a decoding method of a coding unit 930b or a coding unit 980b to be different from that of other coding units 930a and 930c or coding units 980a and 980c, wherein the coding unit 930b or the coding unit 980b is located at a central position among three coding units 930a, 930b, and 930c or coding units 980a, 980b, and 980c generated by dividing the current coding unit 900 or 950. For example, the image decoding apparatus may restrict the coding unit 930b or 980b at the central position from being divided no longer or only a predetermined number of times, unlike the other coding units 930a and 930c or 980a and 980 c.

Fig. 10 illustrates a process of dividing an encoding unit based on at least one of block shape information and divided shape mode information, performed by an image decoding apparatus according to an embodiment.

According to an embodiment, the image decoding apparatus may determine whether to divide the first coding unit 1000 of the square into coding units or not to divide the first coding unit 1000 of the square based on at least one of the block shape information and the division shape mode information. According to an embodiment, when the division shape mode information indicates that the first encoding unit 1000 is divided in the horizontal direction, the image decoding apparatus may determine the second encoding unit 1010 by dividing the first encoding unit 1000 in the horizontal direction. The first coding unit, the second coding unit, and the third coding unit used according to the embodiment are terms used to understand a relationship before and after dividing the coding unit. For example, the second coding unit may be determined by dividing the first coding unit, and the third coding unit may be determined by dividing the second coding unit. It will be understood that the structure of the first, second and third encoding units follows the above description.

According to an embodiment, the image decoding apparatus may determine to divide the determined second coding unit 1010 into coding units or not to divide the determined second coding unit 1010 based on at least one of block shape information and division shape mode information. Referring to fig. 10, the image decoding apparatus may or may not divide the non-square second encoding unit 1010 determined by dividing the first encoding unit 1000 into one or more third encoding units 1020a or 1020b, 1020c, and 1020d based on at least one of the block shape information and the division shape mode information. The image decoding apparatus may obtain at least one of block shape information and division shape mode information, and determine a plurality of second coding units (e.g., 1010) of various shapes by dividing the first coding unit 1000 based on the obtained at least one of block shape information and division shape mode information, and may divide the second coding unit 1010 by using a division method of the first coding unit 1000 based on the at least one of block shape information and division shape mode information. According to an embodiment, when the first encoding unit 1000 is divided into the second encoding units 1010 based on at least one of the block shape information and the partition shape mode information of the first encoding unit 1000, the second encoding units 1010 may also be divided into the third encoding units 1020a or the third encoding units 1020b, 1020c, and 1020d based on at least one of the block shape information and the partition shape mode information of the second encoding units 1010. That is, the coding units may be recursively divided based on at least one of block shape information and divided shape mode information of each coding unit. Accordingly, the square coding unit may be determined by dividing the non-square coding unit, and the non-square coding unit may be determined by recursively dividing the square coding unit.

Referring to fig. 10, a predetermined coding unit (e.g., a coding unit at a center position or a square coding unit) among an odd number of third coding units 1020b, 1020c, and 1020d determined by dividing the non-square second coding unit 1010 may be recursively divided. According to an embodiment, the square third encoding unit 1020b among the odd number of third encoding units 1020b, 1020c, and 1020d may be divided into a plurality of fourth encoding units in the horizontal direction. The non-square fourth encoding unit 1030b or 1030d among the plurality of fourth encoding units 1030a, 1030b, 1030c and 1030d may be divided into a plurality of encoding units. For example, the non-square fourth coding unit 1030b or 1030d may be divided into an odd number of coding units. Methods that may be used to recursively divide the coding units are described below with respect to various embodiments.

According to an embodiment, the image decoding apparatus may determine to divide the third encoding unit 1020a or each of the third encoding units 1020b, 1020c, and 1020d into encoding units based on at least one of the block shape information and the division shape mode information. According to an embodiment, the image decoding apparatus may determine not to divide the second encoding unit 1010 based on at least one of the block shape information and the divided shape mode information. According to an embodiment, the image decoding apparatus may divide the non-square second encoding unit 1010 into an odd number of third encoding units 1020b, 1020c, and 1020 d. The image decoding apparatus may apply a predetermined restriction to a predetermined third encoding unit among the odd-numbered third encoding units 1020b, 1020c, and 1020 d. For example, the image decoding apparatus may limit the third encoding unit 1020c located at the center position among the odd number of third encoding units 1020b, 1020c, and 1020d to no longer be divided or to be divided a settable number of times.

Referring to fig. 10, the image decoding apparatus may limit the third encoding unit 1020c at the center position among the odd number of third encoding units 1020b, 1020c, and 1020d included in the non-square second encoding unit 1010 to no longer be divided, to be divided by using a predetermined dividing method (e.g., to be divided only into four encoding units or to be divided by using the dividing method of the second encoding unit 1010), or to be divided only a predetermined number of times (e.g., to be divided only n times (where n > 0)). However, the restriction on the third encoding unit 1020c at the center position is not limited to the above example, and may include various restrictions for decoding the third encoding unit 1020c at the center position different from the other third encoding units 1020b and 1020 d.

According to the embodiment, the image decoding apparatus may obtain at least one of block shape information and division shape mode information for dividing the current coding unit from a predetermined position in the current coding unit.

Fig. 11 illustrates a method of determining a predetermined coding unit from an odd number of coding units performed by an image decoding apparatus according to an embodiment.

Referring to fig. 11, at least one of block shape information and division shape mode information of the current coding unit 1100 or 1150 may be obtained from a sample point (e.g., a sample point 1140 or 1190 of a center position) of a predetermined position among a plurality of sample points included in the current coding unit 1100 or 1150. However, the predetermined position in the current coding unit 1100 from which at least one of the block shape information and the divided shape mode information can be obtained is not limited to the center position in fig. 11, and may include various positions (e.g., upper, lower, left, right, upper-left, lower-left, upper-right, and lower-right positions) included in the current coding unit 1100. The image decoding apparatus may obtain at least one of block shape information and division shape mode information from the predetermined position, and determine to divide or not to divide the current coding unit into coding units of various shapes and various sizes.

According to an embodiment, the image decoding apparatus may select one of the coding units when the current coding unit is divided into a predetermined number of coding units. As will be described below with respect to various embodiments, various methods may be used to select one of a plurality of coding units.

According to an embodiment, the image decoding apparatus may divide a current coding unit into a plurality of coding units, and may determine a coding unit at a predetermined position.

According to an embodiment, the image decoding apparatus may use information indicating each of positions of an odd number of coding units to determine a coding unit at a center position among the odd number of coding units. Referring to fig. 11, the image decoding apparatus may determine an odd number of encoding units 1120a, 1120b, and 1120c or encoding units 1160a, 1160b, and 1160c by dividing the current encoding unit 1100 or 1150. The image decoding apparatus may determine the encoding unit 1120b or the encoding unit 1160b at the center position by using information on the positions of the odd-numbered encoding units 1120a, 1120b, and 1120c or the encoding units 1160a, 1160b, and 1160 c. For example, the image decoding apparatus may determine the coding unit 1120b of the central position by determining the positions of the coding units 1120a, 1120b, and 1120c based on information indicating the positions of predetermined sampling points included in the coding units 1120a, 1120b, and 1120 c. In detail, the image decoding apparatus may determine the encoding unit 1120b at the center position by determining the positions of the encoding units 1120a, 1120b, and 1120c based on information indicating the positions of the left upper samples 1130a, 1130b, and 1130c of the encoding units 1120a, 1120b, and 1120 c.

According to an embodiment, the information indicating the positions of the left upper samples 1130a, 1130b, and 1130c respectively included in the coding units 1120a, 1120b, and 1120c may include information about the positions or coordinates of the coding units 1120a, 1120b, and 1120c in the picture. According to an embodiment, the information indicating the positions of the left upper samples 1130a, 1130b, and 1130c respectively included in the coding units 1120a, 1120b, and 1120c may include information indicating the width or height of the coding units 1120a, 1120b, and 1120c included in the current coding unit 1100, and the width or height may correspond to information indicating the difference between the coordinates of the coding units 1120a, 1120b, and 1120c in the picture. That is, the image decoding apparatus may determine the encoding unit 1120b at the center position by directly using information on the positions or coordinates of the encoding units 1120a, 1120b, and 1120c in the picture, or by using information on the widths or heights of the encoding units corresponding to the differences between the coordinates.

According to an embodiment, information indicating the position of the left upper spline 1130a of the upper coding unit 1120a may indicate coordinates (xa, ya), information indicating the position of the left upper spline 1130b of the middle coding unit 1120b may indicate coordinates (xb, yb), and information 1120c indicating the position of the left upper spline 1130c of the lower coding unit may indicate coordinates (xc, yc). The image decoding apparatus may determine the middle encoding unit 1120b by using the coordinates of the left upper samples 1130a, 1130b, and 1130c included in the encoding units 1120a, 1120b, and 1120c, respectively. For example, when the coordinates of the left upper samples 1130a, 1130b, and 1130c are sorted in an ascending order or a descending order, the coding unit 1120b including the coordinates (xb, yb) of the sample 1130b at the center position may be determined as the coding unit at the center position among the coding units 1120a, 1120b, and 1120c determined by dividing the current coding unit 1100. However, the coordinates indicating the positions of the left upper samples 1130a, 1130b, and 1130c may include coordinates indicating absolute positions in a screen, or coordinates (dxb, dyb) indicating the relative position of the left upper sample 1130b of the middle coding unit 1120b with respect to the position of the left upper sample 1130a of the upper coding unit 1120a and coordinates (dxc, dyc) indicating the relative position of the left upper sample 1130c of the lower coding unit 1120c with respect to the position of the left upper sample 1130a of the upper coding unit 1120a may be used. The method of determining the encoding unit at the predetermined position by using the coordinates of the samples included in the encoding unit as the information indicating the positions of the samples is not limited to the above-described method, and may include various arithmetic methods capable of using the coordinates of the samples.

According to an embodiment, the image decoding apparatus may divide the current encoding unit 1100 into a plurality of encoding units 1120a, 1120b, and 1120c, and may select one of the encoding units 1120a, 1120b, and 1120c based on a predetermined criterion. For example, the image decoding apparatus may select the coding unit 1120b having a size different from the sizes of the other coding units from the coding units 1120a, 1120b, and 1120 c.

According to the embodiment, the image decoding apparatus may determine the widths or heights of the coding units 1120a, 1120b, and 1120c by using coordinates (xa, ya) indicating the position of the left upper sample 1130a of the upper coding unit 1120a, coordinates (xb, yb) indicating the position of the left upper sample 1130b of the middle coding unit 1120b, and coordinates (xc, yc) indicating the position of the left upper sample 1130c of the lower coding unit 1120 c. The image decoding apparatus may determine the respective sizes of the encoding units 1120a, 1120b, and 1120c by using coordinates (xa, ya), (xb, yb), and (xc, yc) indicating the positions of the encoding units 1120a, 1120b, and 1120 c. According to an embodiment, the image decoding apparatus may determine the width of the upper encoding unit 1120a as the width of the current encoding unit 1100. According to an embodiment, the image decoding apparatus may determine the height of the upper encoding unit 1120a as yb-ya. According to an embodiment, the image decoding apparatus may determine the width of the intermediate encoding unit 1120b as the width of the current encoding unit 1100. According to an embodiment, the image decoding apparatus may determine the height of the intermediate encoding unit 1120b as yc-yb. According to an embodiment, the image decoding apparatus may determine the width or height of the lower encoding unit by using the width or height of the current encoding unit and the widths or heights of the upper encoding unit 1120a and the middle encoding unit 1120 b. The image decoding apparatus may determine the coding unit having a size different from the sizes of the other coding units based on the determined widths and heights of the coding units 1120a, 1120b, and 1120 c. Referring to fig. 11, the image decoding apparatus may determine an intermediate encoding unit 1120b having a size different from that of the upper encoding unit 1120a and the lower encoding units 1120c as an encoding unit of a predetermined position. However, the above-described method of determining a coding unit having a size different from the sizes of other coding units, which is performed by the image decoding apparatus, corresponds only to an example of determining a coding unit at a predetermined position by using the size of a coding unit determined based on the coordinates of the sampling points, and thus various methods of determining a coding unit at a predetermined position by comparing the sizes of coding units determined based on the coordinates of predetermined sampling points may be used.

The image decoding apparatus may determine the widths or heights of the coding units 1160a, 1160b, and 1160c by using the coordinates (xd, yd) indicating the position of the left sample 1170a of the left coding unit 1160a, the coordinates (xe, ye) indicating the position of the left sample 1170b of the middle coding unit 1160b, and the coordinates (xf, yf) indicating the position of the left sample 1170c of the right coding unit 1160 c. The image decoding apparatus may determine the respective sizes of the encoding units 1160a, 1160b, and 1160c by using coordinates (xd, yd), (xe, ye), and (xf, yf) indicating the positions of the encoding units 1160a, 1160b, and 1160 c.

According to an embodiment, the image decoding apparatus may determine the width of the left encoding unit 1160a as xe-xd. According to an embodiment, the image decoding apparatus may determine the height of the left encoding unit 1160a as the height of the current encoding unit 1150. According to an embodiment, the image decoding apparatus may determine the width of the intermediate encoding unit 1160b as xf-xe. According to an embodiment, the image decoding apparatus may determine the height of the intermediate encoding unit 1160b as the height of the current encoding unit 1150. According to an embodiment, the image decoding apparatus may determine the width or height of the right encoding unit 1160c by using the width or height of the current encoding unit 1150 and the widths or heights of the left encoding unit 1160a and the middle encoding unit 1160 b. The image decoding apparatus may determine the coding unit having a size different from the sizes of the other coding units based on the determined width and height of the coding units 1160a, 1160b, and 1160 c. Referring to fig. 11, the image decoding apparatus may determine an intermediate encoding unit 1160b having a size different from that of the left and right encoding units 1160a and 1160c as an encoding unit of a predetermined position. However, the above-described method of determining a coding unit having a size different from the sizes of other coding units, which is performed by the image decoding apparatus, corresponds only to an example of determining a coding unit at a predetermined position by using the size of a coding unit determined based on the coordinates of the sampling points, and thus various methods of determining a coding unit at a predetermined position by comparing the sizes of coding units determined based on the coordinates of the predetermined sampling points may be used.

However, the position of the sampling point considering the position for determining the coding unit is not limited to the above-described upper-left position, and information on an arbitrary position of the sampling point included in the coding unit may be used.

According to the embodiment, the image decoding apparatus may select a coding unit at a predetermined position from an odd number of coding units determined by dividing the current coding unit in consideration of the shape of the current coding unit. For example, when the current coding unit has a non-square shape having a width greater than a height, the image decoding apparatus may determine a coding unit at a predetermined position in the horizontal direction. That is, the image decoding apparatus may determine one of the coding units at different positions in the horizontal direction and impose a restriction on the coding unit. When the current coding unit has a non-square shape with a height greater than a width, the image decoding apparatus may determine a coding unit at a predetermined position in the vertical direction. That is, the image decoding apparatus may determine one of the coding units at different positions in the vertical direction and may apply a restriction to the coding unit.

According to the embodiment, the image decoding apparatus may use the information indicating the respective positions of the even number of coding units to determine the coding unit at a predetermined position among the even number of coding units. The image decoding apparatus may determine an even number of coding units by bi-dividing a current coding unit, and may determine a coding unit at a predetermined position by using information on positions of the even number of coding units. The operation related thereto may correspond to the operation of determining the coding unit at a predetermined position (e.g., a center position) among the odd number of coding units, which has been described in detail above with respect to fig. 11, and thus a detailed description thereof will not be provided herein.

According to an embodiment, when a current coding unit that is not square is divided into a plurality of coding units, predetermined information about a coding unit at a predetermined position may be used in a dividing operation to determine a coding unit at a predetermined position among the plurality of coding units. For example, the image decoding apparatus may use at least one of block shape information and divided shape mode information stored in samples included in a coding unit at a center position in a dividing operation to determine a coding unit at the center position among a plurality of coding units determined by dividing the current coding unit.

Referring to fig. 11, the image decoding apparatus may divide a current coding unit 1100 into a plurality of coding units 1120a, 1120b, and 1120c based on at least one of block shape information and divided shape mode information, and may determine a coding unit 1120b at a center position among the plurality of coding units 1120a, 1120b, and 1120 c. Further, the image decoding apparatus may determine the encoding unit 1120b at the center position in consideration of the position from which at least one of the block shape information and the division shape mode information is obtained. That is, at least one of block shape information and division shape mode information of the current coding unit 1100 may be obtained from the sampling point 1140 at the center position of the current coding unit 1100, and when the current coding unit 1100 is divided into a plurality of coding units 1120a, 1120b, and 1120c based on the at least one of the block shape information and the division shape mode information, the coding unit 1120b including the sampling point 1140 may be determined as the coding unit at the center position. However, the information for determining the coding unit at the central position is not limited to at least one of the block shape information and the partition shape mode information, and various types of information may be used to determine the coding unit at the central position.

According to the embodiment, predetermined information for identifying a coding unit at a predetermined position may be obtained from predetermined samples included in the coding unit to be determined. Referring to fig. 11, the image decoding apparatus may determine a coding unit at a predetermined position (e.g., a coding unit at a center position among a plurality of divided coding units) among a plurality of coding units 1120a, 1120b, and 1120c determined by dividing the current coding unit 1100, using at least one of block shape information and divided shape pattern information obtained from a sample point at a predetermined position (e.g., a sample point at a center position of the current coding unit 1100) among the current coding unit 1100. That is, the image decoding apparatus may determine samples at predetermined positions by considering the block shape of the current encoding unit 1100, determine an encoding unit 1120b including samples from which predetermined information (e.g., at least one of block shape information and divided shape mode information) may be obtained among the plurality of encoding units 1120a, 1120b, and 1120c determined by dividing the current encoding unit 1100, and may apply a predetermined restriction to the encoding unit 1120 b. Referring to fig. 11, according to the embodiment, the image decoding apparatus may determine a sample 1140 at a center position of a current encoding unit 1100 as a sample from which predetermined information may be obtained, and may apply a predetermined limit to an encoding unit 1120b including the sample 1140 in a decoding operation. However, the positions of the samples from which the predetermined information can be obtained are not limited to the above-described positions, and may include any positions of the samples included in the encoding unit 1120b to be determined for limitation.

According to an embodiment, the position of a sample point from which predetermined information can be obtained may be determined based on the shape of the current encoding unit 1100. According to an embodiment, the block shape information may indicate whether the current coding unit has a square shape or a non-square shape, and a position of a sampling point from which the predetermined information may be obtained may be determined based on the shape. For example, the image decoding apparatus may determine a sample point located on a boundary dividing at least one of the width and the height of the current coding unit into halves as a sample point from which predetermined information can be obtained, by using at least one of information on the width of the current coding unit and information on the height of the current coding unit. As another example, when the block shape information of the current coding unit indicates a non-square shape, the image decoding apparatus may determine one of the samples adjacent to a boundary for dividing the long side of the current coding unit in half as the sample from which the predetermined information can be obtained.

According to an embodiment, when a current coding unit is divided into a plurality of coding units, the image decoding apparatus may determine a coding unit at a predetermined position among the plurality of coding units using at least one of block shape information and division shape mode information. According to the embodiment, the image decoding apparatus may obtain at least one of block shape information and division shape mode information from samples at predetermined positions in the coding units, and divide the plurality of coding units generated by dividing the current coding unit by using at least one of the division shape mode information and the block shape information obtained from samples at predetermined positions in each of the plurality of coding units. That is, the coding units may be recursively divided based on at least one of block shape information and divided shape pattern information obtained from samples at predetermined positions in each coding unit. The operation of recursively dividing the coding units has been described above with respect to fig. 10, and thus a detailed description thereof will not be provided here.

According to the embodiment, the image decoding apparatus may determine one or more coding units by dividing a current coding unit, and may determine an order of decoding the one or more coding units based on a predetermined block (e.g., the current coding unit).

Fig. 12 illustrates an order in which an image decoding apparatus processes a plurality of coding units when the plurality of coding units are determined by dividing a current coding unit according to an embodiment.

According to the embodiment, the image decoding apparatus may determine the second encoding units 1210a and 1210b by dividing the first encoding unit 1200 in the vertical direction, determine the second encoding units 1230a and 1230b by dividing the first encoding unit 1200 in the horizontal direction, or determine the second encoding units 1250a to 1250d by dividing the first encoding unit 1200 in the vertical direction and the horizontal direction, based on the block shape information and the division shape mode information.

Referring to fig. 12, the image decoding apparatus may determine that the second encoding units 1210a and 1210b determined by dividing the first encoding unit 1200 in the vertical direction are processed in the horizontal direction order 1210 c. The image decoding apparatus may determine that the second encoding units 1230a and 1230b determined by dividing the first encoding unit 1200 in the horizontal direction are processed in the vertical direction order 1230 c. The image decoding apparatus may determine that the second encoding units 1250a to 1250d determined by dividing the first encoding unit 1200 in the vertical direction and the horizontal direction are processed in a predetermined order (e.g., in a raster scan order or a zigzag scan order 1250e) for processing the encoding units in one row and then processing the encoding units in the next row.

According to an embodiment, the image decoding apparatus may recursively divide the encoding units. Referring to fig. 12, the image decoding apparatus may determine the plurality of coding units 1210a, 1210b, 1230a, 1230b, 1250a, 1250b, 1250c, and 1250d by dividing the first coding unit 1200, and may recursively divide each of the determined plurality of coding units 1210a, 1210b, 1230a, 1230b, 1250a, 1250b, 1250c, and 1250 d. The division method of the plurality of coding units 1210a, 1210b, 1230a, 1230b, 1250a, 1250b, 1250c, and 1250d may correspond to the division method of the first coding unit 1200. In this way, each of the plurality of coding units 1210a, 1210b, 1230a, 1230b, 1250a, 1250b, 1250c, and 1250d may be independently divided into a plurality of coding units. Referring to fig. 12, the image decoding apparatus may determine the second encoding units 1210a and 1210b by dividing the first encoding unit 1200 in the vertical direction, and may determine whether each of the second encoding units 1210a and 1210b is divided or not divided independently.

According to the embodiment, the image decoding apparatus may determine the third encoding units 1220a and 1220b by dividing the left second encoding unit 1210a in the horizontal direction, and may not divide the right second encoding unit 1210 b.

According to an embodiment, the processing order of the coding units may be determined based on the operation of dividing the coding units. In other words, the processing order of the divided coding units may be determined based on the processing order of the coding units immediately before being divided. The image decoding apparatus may determine the processing order of the third encoding units 1220a and 1220b determined by dividing the left-side second encoding unit 1210a, independently of the right-side second encoding unit 1210 b. Since the third encoding units 1220a and 1220b are determined by dividing the left second encoding unit 1210a in the horizontal direction, the third encoding units 1220a and 1220b may be processed in the vertical direction order 1220 c. Since the left second encoding unit 1210a and the right second encoding unit 1210b are processed in the horizontal direction order 1210c, the right second encoding unit 1210b may be processed after the third encoding units 1220a and 1220b included in the left second encoding unit 1210a are processed in the vertical direction order 1220 c. The operation of determining the processing order of the coding units based on the coding units before being divided is not limited to the above-described example, and the coding units divided and determined to be various shapes may be independently processed in a predetermined order using various methods.

Fig. 13 illustrates a process of determining that a current coding unit is to be divided into an odd number of coding units, which is performed by the image decoding apparatus when the coding units cannot be processed in a predetermined order, according to the embodiment.

According to an embodiment, the image decoding apparatus may determine whether the current coding unit is divided into an odd number of coding units based on the obtained block shape information and the divided shape mode information. Referring to fig. 13, a square first coding unit 1300 may be divided into non-square second coding units 1310a and 1310b, and the second coding units 1310a and 1310b may be independently divided into third coding units 1320a and 1320b and third coding units 1320c to 1320 e. According to an embodiment, the image decoding apparatus may determine the plurality of third encoding units 1320a and 1320b by dividing the left second encoding unit 1310a in the horizontal direction, and may divide the right second encoding unit 1310b into an odd number of third encoding units 1320c to 1320 e.

According to the embodiment, the image decoding apparatus may determine whether to divide an arbitrary coding unit into an odd number of coding units by determining whether the third coding units 1320a and 1320b and the third coding units 1320c to 1320e can be processed in a predetermined order. Referring to fig. 13, the image decoding apparatus may determine third coding units 1320a and 1320b and third coding units 1320c to 1320e by recursively dividing the first coding unit 1300. The image decoding apparatus may determine whether any one of the first encoding unit 1300, the second encoding units 1310a and 1310b, and the third encoding units 1320a and 1320b, and the third encoding units 1320c, 1320d, and 1320e is divided into an odd number of encoding units based on at least one of the block shape information and the division shape mode information. For example, the second encoding unit 1310b located at the right side among the second encoding units 1310a and 1310b may be divided into odd number of third encoding units 1320c, 1320d, and 1320 e. The processing order of the plurality of coding units included in the first coding unit 1300 may be a predetermined order (e.g., a zigzag scanning order 1330), and the image decoding apparatus may determine whether the third coding units 1320c, 1320d, and 1320e determined by dividing the right-side second coding unit 1310b into odd-numbered coding units satisfy a condition sufficient for processing in the predetermined order.

According to an embodiment, the image decoding apparatus may determine whether the third encoding units 1320a and 1320b and the third encoding units 1320c, 1320d and 1320e included in the first encoding unit 1300 satisfy a condition for processing in a predetermined order, and the condition relates to whether at least one of the width and height of the second encoding units 1310a and 1310b is divided in half along the boundary of the third encoding units 1320a and 1320b and the third encoding units 1320c, 1320d and 1320 e. For example, the third encoding units 1320a and 1320b determined by dividing the height of the non-square left second encoding unit 1310a by half satisfy the condition. Since the boundary of the third coding units 1320c, 1320d, and 1320e determined by dividing the right second coding unit 1310b into three coding units does not divide the width or height of the right second coding unit 1310b in half, it may be determined that the third coding units 1320c, 1320d, and 1320e do not satisfy the condition. When the condition is not satisfied as described above, the image decoding apparatus may determine that the scanning order is discontinuous, and determine that the right second encoding unit 1310b is divided into odd-numbered encoding units based on the determination result. According to the embodiment, when the coding unit is divided into an odd number of coding units, the image decoding apparatus may apply a predetermined restriction to the coding unit at a predetermined position in the divided coding units. The limits or the predetermined locations have been described above with respect to various embodiments, and thus a detailed description thereof will not be provided herein.

Fig. 14 illustrates a process of determining at least one coding unit by dividing the first coding unit 1400, which is performed by the image decoding apparatus according to the embodiment.

According to an embodiment, the image decoding apparatus may divide the first encoding unit 1400 based on at least one of block shape information and division shape mode information obtained by the receiver 110. The square first coding unit 1400 may be divided into four square coding units or may be divided into a plurality of non-square coding units. For example, referring to fig. 14, when the block shape information indicates that the first encoding unit 1400 has a square shape and the division shape mode information indicates that the first encoding unit 1400 is divided into non-square encoding units, the image decoding apparatus may divide the first encoding unit 1400 into a plurality of non-square encoding units. In detail, when the division shape mode information indicates that an odd number of coding units are determined by dividing the first coding unit 1400 in the horizontal direction or the vertical direction, the image decoding apparatus may divide the square-shaped first coding unit 1400 into the odd number of coding units (e.g., the second coding units 1410a, 1410b, and 1410c determined by dividing the square-shaped first coding unit 1400 in the vertical direction, or the second coding units 1420a, 1420b, and 1420c determined by dividing the square-shaped first coding unit 1400 in the horizontal direction).

According to an embodiment, the image decoding apparatus may determine whether the second coding units 1410a, 1410b, 1410c, 1420a, 1420b, and 1420c included in the first coding unit 1400 satisfy a condition of processing in a predetermined order, and the condition relates to whether at least one of the width and height of the first coding unit 1400 is divided in half along the boundary of the second coding units 1410a, 1410b, 1410c, 1420a, 1420b, and 1420 c. Referring to fig. 14, since the boundaries of the second encoding units 1410a, 1410b, and 1410c determined by dividing the square-shaped first encoding unit 1400 in the vertical direction do not divide the width of the first encoding unit 1400 in half, it may be determined that the first encoding unit 1400 does not satisfy the condition for processing in a predetermined order. In addition, since the boundaries of the second encoding units 1420a, 1420b, and 1420c determined by dividing the square-shaped first encoding unit 1400 in the horizontal direction do not divide the height of the first encoding unit 1400 in half, it may be determined that the first encoding unit 1400 does not satisfy the condition for performing the processing in the predetermined order. When the condition is not satisfied as described above, the image decoding apparatus may determine that the scan order is discontinuous, and may determine that the first encoding unit 1400 is divided into an odd number of encoding units based on a result of the determination. According to the embodiment, when the coding unit is divided into an odd number of coding units, the image decoding apparatus may apply a predetermined restriction to the coding unit at a predetermined position in the divided coding units. The limits or the predetermined locations have been described above with respect to various embodiments, and thus a detailed description thereof will not be provided herein.

According to the embodiment, the image decoding apparatus may determine the coding units of various shapes by dividing the first coding unit.

Referring to fig. 14, the image decoding apparatus may divide a square first encoding unit 1400 or a non-square first encoding unit 1430 or 1450 into various shaped encoding units.

Fig. 15 illustrates that when the second encoding unit having a non-square shape determined by dividing the first encoding unit 1500 satisfies a predetermined condition, shapes into which the second encoding unit can be divided by the image decoding apparatus are limited according to the embodiment.

According to an embodiment, the image decoding apparatus may determine to divide the square first coding unit 1500 into the non-square second coding units 1510a, 1510b, 1520a, and 1520b based on at least one of block shape information and division shape mode information obtained by the receiver 110. The second encoding units 1510a, 1510b, 1520a, and 1520b may be independently divided. As such, the video decoding apparatus may determine whether to divide or not to divide the first coding unit 1500 into the plurality of coding units based on at least one of the block shape information and the division shape mode information of each of the second coding units 1510a, 1510b, 1520a, and 1520 b. According to the embodiment, the image decoding apparatus may determine the third encoding units 1512a and 1512b by dividing the non-square left-side second encoding unit 1510a determined by dividing the first encoding unit 1500 in the vertical direction in the horizontal direction. However, when the left second encoding unit 1510a is divided in the horizontal direction, the image decoding apparatus may restrict the right second encoding unit 1510b from being divided in the horizontal direction in which the left second encoding unit 1510a is divided. When the third encoding units 1514a and 1514b are determined by dividing the right-side second encoding unit 1510b in the same direction, the third encoding units 1512a, 1512b, 1514a, and 1514b may be determined because the left-side second encoding unit 1510a and the right-side second encoding unit 1510b are independently divided in the horizontal direction. However, this case is equivalent to the case where the image decoding apparatus divides the first coding unit 1500 into the four square second coding units 1530a, 1530b, 1530c, and 1530d based on at least one of the block shape information and the division shape mode information, and may be inefficient in terms of image decoding.

According to an embodiment, the image decoding apparatus may determine the third coding units 1522a, 1522b, 1524a, and 1524b by dividing the non-square second coding unit 1520a or 1520b determined by dividing the first coding unit 1500 in the horizontal direction in the vertical direction. However, when the second encoding unit (e.g., the upper second encoding unit 1520a) is divided in the vertical direction, the image decoding apparatus may restrict another second encoding unit (e.g., the lower second encoding unit 1520b) from being divided in the vertical direction in which the upper second encoding unit 1520a is divided, for the above-described reason.

Fig. 16 illustrates a process of dividing a square encoding unit performed by the image decoding apparatus when the division shape mode information indicates that the square encoding unit is not to be divided into four square encoding units according to the embodiment.

According to an embodiment, the image decoding apparatus may determine the second encoding units 1616a, 1616b, 1620a, 1620b, etc. by dividing the first encoding unit 1600 based on at least one of block shape information and division shape mode information. The division shape mode information may include information on various methods of dividing the coding unit, but the information on various division methods may not include information for dividing the coding unit into four square coding units. According to such division shape mode information, the image decoding apparatus may not divide the square first encoding unit 1600 into four square second encoding units 1630a, 1630b, 1630c, and 1630 d. The image decoding apparatus may determine the non-square second coding units 1610a, 1610b, 1620a, 1620b, and the like based on the division shape mode information.

According to an embodiment, the image decoding apparatus may independently divide the non-square second coding units 1610a, 1610b, 1620a, 1620b, and the like. Each of the second coding units 1610a, 1610b, 1620a, 1620b, etc. may be recursively divided in a predetermined order based on at least one of block shape information and division shape mode information, and the dividing method may correspond to a method of dividing the first coding unit 1600.

For example, the image decoding apparatus may determine the square third coding units 1612a and 1612b by dividing the left second coding unit 1610a in the horizontal direction, and may determine the square third coding units 1614a and 1614b by dividing the right second coding unit 1610b in the horizontal direction. Also, the image decoding apparatus may determine the square third encoding units 1616a, 1616b, 1616c, and 1616d by dividing both the left second encoding unit 1610a and the right second encoding unit 1610b in the horizontal direction. In this case, coding units having the same shape as the second coding units 1630a, 1630b, 1630c, and 1630d of four squares divided from the first coding unit 1600 may be determined.

As another example, the image decoding apparatus may determine the third encoding units 1622a and 1622b of a square shape by dividing the upper second encoding unit 1620a in the vertical direction, and may determine the third encoding units 1624a and 1624b of a square shape by dividing the lower second encoding unit 1620b in the vertical direction. Further, the image decoding apparatus may determine the third encoding units 1626a, 1626b, 1626c, and 1626d of a square shape by dividing both the upper second encoding unit 1620a and the lower second encoding unit 1620b in the vertical direction. In this case, coding units having the same shape as the second coding units 1630a, 1630b, 1630c, and 1630d of four squares divided from the first coding unit 1600 may be determined.

Fig. 17 illustrates that a processing order between a plurality of coding units may be changed according to a process of dividing the coding units according to an embodiment.

According to an embodiment, the image decoding apparatus may divide the first encoding unit 1700 based on the block shape information and the divided shape mode information. When the block shape information indicates a square shape and the division shape mode information indicates that the first encoding unit 1700 is divided in at least one of the horizontal direction and the vertical direction, the image decoding apparatus may determine the second encoding unit (e.g., 1710a, 1710b, 1720a, 1720b, etc.) by dividing the first encoding unit 1700. Referring to fig. 17, non-square second coding units 1710a, 1710b, 1720a, and 1720b determined by dividing the first coding unit 1700 only in the horizontal direction or the vertical direction may be independently divided based on block shape information and divided shape mode information of each coding unit. For example, the image decoding apparatus may determine the third encoding units 1716a, 1716b, 1716c, and 1716d by dividing the second encoding units 1710a and 1710b generated by dividing the first encoding unit 1700 in the vertical direction in the horizontal direction, and may determine the third encoding units 1726a, 1726b, 1726c, and 1726d by dividing the second encoding units 1720a and 1720b generated by dividing the first encoding unit 1700 in the horizontal direction in the vertical direction. The operation of dividing the second encoding units 1710a, 1710b, 1720a, and 1720b has been described above with respect to fig. 16, and thus a detailed description thereof will not be provided herein.

According to an embodiment, the image decoding apparatus may process the encoding units in a predetermined order. The operation of processing the coding units in a predetermined order has been described above with respect to fig. 17, and thus a detailed description thereof will not be provided here. Referring to fig. 17, the image decoding apparatus may determine four square third encoding units 1716a, 1716b, 1716c, and 1716d and third encoding units 1726a, 1726b, 1726c, and 1726d by dividing the square first encoding unit 1700. According to an embodiment, the image decoding apparatus may determine the processing order of the third encoding units 1716a, 1716b, 1716c, and 1716d and the third encoding units 1726a, 1726b, 1726c, and 1726d based on the dividing method of the first encoding unit 1700.

According to the embodiment, the image decoding apparatus may determine the third encoding units 1716a, 1716b, 1716c, and 1716d by dividing the second encoding units 1710a and 1710b generated by the first encoding unit 1700 in the vertical direction in the horizontal direction, and may process the third encoding units 1716a, 1716b, 1716c, and 1716d in the processing order 1717, wherein the processing order 1717 first processes the third encoding units 1716a and 1716c included in the left-side second encoding unit 1710a in the vertical direction and then processes the third encoding units 1716b and 1716d included in the right-side second encoding unit 1710b in the vertical direction.

According to the embodiment, the image decoding apparatus may determine the third encoding units 1726a, 1726b, 1726c, and 1726d by vertically dividing the second encoding units 1720a and 1720b generated by horizontally dividing the first encoding unit 1700, and may process the third encoding units 1726a, 1726b, 1726c, and 1726d in a processing order 1727, wherein the processing order 1727 first processes the third encoding units 1726a and 1726b included in the upper second encoding unit 1720a in the horizontal direction and then processes the third encoding units 1726c and 1726d included in the lower second encoding unit 1720b in the horizontal direction.

Referring to fig. 17, third encoding units 1716a, 1716b, 1716c, and 1716d and third encoding units 1726a, 1726b, 1726c, and 1726d of squares may be determined by dividing the second encoding units 1710a, 1710b, 1720a, and 1720b, respectively. Although second coding units 1710a and 1710b determined by dividing the first coding unit 1700 in the vertical direction are different from second coding units 1720a and 1720b determined by dividing the first coding unit 1700 in the horizontal direction, third coding units 1716a, 1716b, 1716c, and 1716d and third coding units 1726a, 1726b, 1726c, and 1726d divided from the second coding units finally show coding units of the same shape divided from the first coding unit 1700. In this way, by recursively dividing the coding unit in different manners based on at least one of the block shape information and the divided shape mode information, the image decoding apparatus can process the plurality of coding units in different orders even if the coding units are finally determined to be the same shape.

According to an embodiment, the image decoding apparatus may determine the depth of the coding unit based on a predetermined criterion. For example, the predetermined criterion may be the length of a long side of the coding unit. When the length of the long side of the coding unit before being divided is 2n times (n >0) the length of the long side of the current coding unit after being divided, the image decoding apparatus may determine that the depth of the current coding unit is increased by n from the depth of the coding unit before being divided. In the following description, a coding unit having an increased depth is represented as a coding unit having a deeper depth.

Referring to fig. 18, according to an embodiment, an image decoding apparatus may determine second and third encoding units 1802 and 1804 deeper by dividing a first encoding unit 1800 of a SQUARE based on block shape information indicating a SQUARE shape (e.g., the block shape information may be expressed as "0: SQUARE"). Assuming that the size of the square first coding unit 1800 is 2N × 2N, the second coding unit 1802 determined by dividing the width and height of the first coding unit 1800 into 1/2 may have a size of N × N. Further, the third coding unit 1804, which is determined by dividing the width and height of the second coding unit 1802 to 1/2, may have a size of N/2 XN/2. In this case, the width and height of the third coding unit 1804 is 1/4 the width and height of the first coding unit 1800. When the depth of the first coding unit 1800 is D, the depth of the second coding unit 1802, whose width and height are 1/2 of the width and height of the first coding unit 1800, may be D +1, and the depth of the third coding unit 1804, whose width and height are 1/4 of the first coding unit 1800, may be D + 2.

According to an embodiment, the image decoding apparatus may determine the deeper-depth second encoding unit 1812 or 1822 and the third encoding unit 1814 or 1824 by dividing the non-square first encoding unit 1810 or 1820 based on block shape information indicating a non-square shape (e.g., block shape information may be represented as "1: NS _ VER" indicating a non-square shape having a height greater than a width, or as "2: NS _ HOR" indicating a non-square shape having a width greater than a height).

The image decoding apparatus may determine the second encoding unit 1802, 1812, or 1822 by dividing at least one of the width and the height of the first encoding unit 1810 having a size of N × 2N. That is, the image decoding apparatus may determine the second encoding unit 1802 having the size of N × N or the second encoding unit 1822 having the size of N × N/2 by dividing the first encoding unit 1810 in the horizontal direction, or may determine the second encoding unit 1812 having the size of N/2 × N by dividing the first encoding unit 1810 in the horizontal direction and the vertical direction.

According to an embodiment, the image decoding apparatus may determine the second encoding unit 1802, 1812, or 1822 by dividing at least one of the width and the height of the first encoding unit 1820 having a size of 2N × N. That is, the image decoding apparatus may determine the second encoding unit 1802 having a size of N × N or the second encoding unit 1812 having a size of N/2 × N by dividing the first encoding unit 1820 in the vertical direction, or may determine the second encoding unit 1822 having a size of N × N/2 by dividing the first encoding unit 1820 in the horizontal direction and the vertical direction.

According to an embodiment, the image decoding apparatus may determine the third encoding unit 1804, 1814, or 1824 by dividing at least one of the width and the height of the second encoding unit 1802 having a size of N × N. That is, the image decoding apparatus may determine the third encoding unit 1804 having a size of N/2 × N/2, the third encoding unit 1814 having a size of N/4 × N/2, or the third encoding unit 1824 having a size of N/2 × N/4 by dividing the second encoding unit 1802 in the vertical direction and the horizontal direction.

According to an embodiment, the image decoding apparatus may determine the third encoding unit 1804, 1814, or 1824 by dividing at least one of the width and the height of the second encoding unit 1812 having a size of N/2 × N. That is, the image decoding apparatus may determine the third encoding unit 1804 having a size of N/2 × N/2 or the third encoding unit 1824 having a size of N/2 × N/4 by dividing the second encoding unit 1812 in the horizontal direction, or may determine the third encoding unit 1814 having a size of N/4 × N/2 by dividing the second encoding unit 1812 in the vertical direction and the horizontal direction.

According to an embodiment, the image decoding apparatus may determine the third encoding unit 1804, 1814, or 1824 by dividing at least one of the width and the height of the second encoding unit 1822 having a size of N × N/2. That is, the image decoding apparatus may determine the third encoding unit 1804 having a size of N/2 × N/2 or the third encoding unit 1814 having a size of N/4 × N/2 by dividing the second encoding unit 1822 in the vertical direction, or may determine the third encoding unit 1824 having a size of N/2 × N/4 by dividing the second encoding unit 1822 in the vertical direction and the horizontal direction.

According to an embodiment, the image decoding apparatus may divide the square encoding unit 1800, 1802, or 1804 in a horizontal direction or a vertical direction. For example, the image decoding apparatus may determine the first coding unit 1810 having the size of N × 2N by dividing the first coding unit 1800 having the size of 2N × 2N in the vertical direction, or may determine the first coding unit 1820 having the size of 2N × N by dividing the first coding unit 1800 in the horizontal direction. According to an embodiment, when determining a depth based on the length of the longest side of a coding unit, the depth of the coding unit determined by dividing the first coding unit 1800 having a size of 2N × 2N in the horizontal direction or the vertical direction may be the same as the depth of the first coding unit 1800.

According to an embodiment, the width and height of the third encoding unit 1814 or 1824 may be 1/4 of the width and height of the first encoding unit 1810 or 1820. When the depth of the first coding unit 1810 or 1820 is D, the depth of the second coding unit 1812 or 1822 having the width and height 1/2 of the first coding unit 1810 or 1820 may be D +1, and the depth of the third coding unit 1814 or 1824 having the width and height 1/4 of the first coding unit 1810 or 1820 may be D + 2.

Fig. 19 illustrates a depth that can be determined based on the shape and size of a coding unit and a Partial Index (PID) for distinguishing the coding units according to an embodiment.

According to an embodiment, the image decoding apparatus may determine the second encoding units of various shapes by dividing the first encoding unit 1900 of a square. Referring to fig. 19, the image decoding apparatus may determine second coding units 1902a and 1902b, second coding units 1904a and 1904b, and second coding units 1906a, 1906b, 1906c, and 1906d by dividing the first coding unit 1900 in at least one of the vertical direction and the horizontal direction based on the division shape mode information. That is, the image decoding apparatus may determine the second encoding units 1902a and 1902b, the second encoding units 1904a and 1904b, and the second encoding units 1906a, 1906b, 1906c, and 1906d based on the partition shape mode information of the first encoding unit 1900.

According to an embodiment, the depths of the second coding units 1902a and 1902b, 1904a and 1904b, and 1906a, 1906b, 1906c, and 1906d, which are determined based on the partition shape mode information of the square-shaped first coding unit 1900, may be determined based on the lengths of the long sides thereof. For example, because the length of the side of the square first coding unit 1900 is equal to the length of the long side of the non-square second coding units 1902a and 1902b and the second coding units 1904a and 1904b, the first coding unit 1900 and the non-square second coding units 1902a and 1902b and the second coding units 1904a and 1904b may have the same depth, e.g., D. However, when the image decoding apparatus divides the first coding unit 1900 into the four square second coding units 1906a, 1906b, 1906c, and 1906D based on the division shape mode information, since the length of the side of the square second coding units 1906a, 1906b, 1906c, and 1906D is 1/2 that is the length of the side of the first coding unit 1900, the depth of the second coding units 1906a, 1906b, 1906c, and 1906D may be D +1 that is 1 deeper than the depth D of the first coding unit 1900.

According to an embodiment, the image decoding apparatus may determine the plurality of second encoding units 1912a and 1912b and the second encoding units 1914a, 1914b, and 1914c by dividing the first encoding unit 1910 having a height greater than a width in the horizontal direction based on the division shape mode information. According to the embodiment, the image decoding apparatus may determine the plurality of second encoding units 1922a and 1922b and the second encoding units 1924a, 1924b, and 1924c by dividing the first encoding unit 1920 having the width greater than the height in the vertical direction based on the division shape mode information.

According to an embodiment, the depth of the second coding units 1912a, 1912b, 1914a, 1914b, 1914c, 1922a, 1922b, 1924a, 1924b and 1924c determined based on the partition shape mode information of the non-square first coding unit 1910 or 1920 may be determined based on the length of the long side thereof. For example, because the length of the sides of square second coding elements 1912a and 1912b is 1/2 with a height greater than the length of the long side of non-square shaped first coding element 1910 by width, the depth of square second coding elements 1912a and 1912b is D +1 that is 1 deeper than depth D of non-square first coding element 1910.

In addition, the image decoding apparatus may divide the non-square first coding unit 1910 into odd number of second coding units 1914a, 1914b, and 1914c based on the division shape mode information. The odd number of second coding units 1914a, 1914b, and 1914c may include non-square second coding units 1914a and 1914c and square second coding unit 1914 b. In this case, since the length of the long sides of the non-square second coding units 1914a and 1914c and the length of the side of the square second coding unit 1914b are 1/2 of the length of the long side of the first coding unit 1910, the depths of the second coding units 1914a, 1914b, and 1914c may be D +1 that is 1 deeper than the depth D of the non-square first coding unit 1910. The image decoding apparatus may determine the depth of the coding units divided from the first coding unit 1920 having the non-square shape having the width greater than the height by using the above-described method of determining the depth of the coding units divided from the first coding unit 1910.

According to an embodiment, when the divided odd-numbered coding units do not have the equal size, the image decoding apparatus may determine the PIDs for identifying the divided coding units based on a size ratio between the coding units. Referring to fig. 19, the divided coding unit 1914b of the center position among the odd-numbered coding units 1914a, 1914b, and 1914c may have a width equal to that of the other coding units 1914a and 1914c and a height twice that of the other coding units 1914a and 1914 c. That is, in this case, the coding unit 1914b at the center position may include two other coding units 1914a or 1914 c. Thus, when the PID of the coding unit 1914b at the center position based on the scanning order is 1, the PID of the coding unit 1914c located adjacent to the coding unit 1914b may be increased by 2 and thus may be 3. That is, there may be discontinuities in the PID values. According to an embodiment, the image decoding apparatus may determine whether odd-numbered divided coding units do not have an equal size based on whether there is a discontinuity in PIDs for identifying the divided coding units.

According to an embodiment, the image decoding apparatus may determine whether to use a specific division method based on PID values for identifying a plurality of coding units determined by dividing a current coding unit. Referring to fig. 19, the image decoding apparatus may determine even-numbered coding units 1912a and 1912b or odd-numbered coding units 1914a, 1914b, and 1914c by dividing a first coding unit 1910 having a rectangular shape with a height greater than a width. The image decoding apparatus may identify the respective coding units using the PIDs. According to an embodiment, the PID may be obtained from a sample point (e.g., upper left sample point) of a predetermined position of each coding unit.

According to the embodiment, the image decoding apparatus may determine the coding unit at a predetermined position among the divided coding units by using the PID for distinguishing the coding units. According to an embodiment, when the divided shape mode information of the first coding unit 1910 having a rectangular shape with a height greater than a width indicates that a coding unit is divided into three coding units, the image decoding apparatus may divide the first coding unit 1910 into three coding units 1914a, 1914b, and 1914 c. The image decoding apparatus may allocate a PID to each of the three coding units 1914a, 1914b, and 1914 c. The image decoding apparatus may compare PIDs of odd-numbered divided coding units to determine a coding unit at a center position among the coding units. The image decoding apparatus may determine the coding unit 1914b having a PID corresponding to an intermediate value among PIDs of coding units as a coding unit at a center position among coding units determined by dividing the first coding unit 1910. According to an embodiment, when the divided coding units do not have an equal size, the image decoding apparatus may determine the PIDs for distinguishing the divided coding units based on a size ratio between the coding units. Referring to fig. 19, a width of a coding unit 1914b generated by dividing a first coding unit 1910 may be equal to widths of other coding units 1914a and 1914c, and a height may be twice as large as heights of the other coding units 1914a and 1914 c. In this case, when the PID of the coding unit 1914b at the center position is 1, the PID of the coding unit 1914c located adjacent to the coding unit 1914b may be increased by 2 and thus may be 3. When the PID does not uniformly increase as described above, the image decoding apparatus may determine to divide the coding unit into a plurality of coding units including a coding unit having a size different from sizes of other coding units. According to an embodiment, when the division shape mode information indicates that the coding unit is divided into an odd number of coding units, the image decoding apparatus may divide the current coding unit in such a manner that a coding unit of a predetermined position (e.g., a coding unit of a center position) among the odd number of coding units has a size different from sizes of other coding units. In this case, the image decoding apparatus may determine the coding units having the center positions of different sizes by using the PIDs of the coding units. However, the PID and size or position of the coding unit of the predetermined position are not limited to the above examples, and various PIDs of the coding unit and various positions and sizes may be used.

According to the embodiment, the image decoding apparatus may use a predetermined data unit in which the coding unit starts to be recursively divided.

Fig. 20 illustrates determining a plurality of coding units based on a plurality of predetermined data units included in a picture according to an embodiment.

According to an embodiment, the predetermined data unit may be defined as a data unit for starting to recursively divide the coding unit by using at least one of the block shape information and the division shape mode information. That is, the predetermined data unit may correspond to a coding unit for determining the highest depth of a plurality of coding units divided from a current picture. In the following description, for convenience of explanation, a predetermined data unit is referred to as a reference data unit.

According to an embodiment, the reference data unit may have a predetermined size and a predetermined size shape. According to an embodiment, the reference coding unit may include M × N samples. Here, M and N may be equal to each other and may be integers expressed as powers of 2. That is, the reference data unit may have a square shape or a non-square shape, and may be divided into an integer number of coding units.

According to an embodiment, an image decoding apparatus may divide a current picture into a plurality of reference data units. According to the embodiment, the image decoding apparatus may divide a plurality of reference data units divided from a current picture by using division information on each reference data unit. The operation of dividing the reference data unit may correspond to a dividing operation using a quadtree structure.

According to an embodiment, the image decoding apparatus may determine in advance a minimum size allowed for a reference data unit included in a current picture. Accordingly, the image decoding apparatus may determine various reference data units having a size equal to or greater than the minimum size, and may determine one or more coding units by using the block shape information and the partition shape mode information with reference to the determined reference data units.

Referring to fig. 20, the image decoding apparatus may use a square reference coding unit 2000 or a non-square reference coding unit 2002. According to an embodiment, the shape and size of a reference coding unit may be determined based on various data units (e.g., sequences, pictures, slices, slice segments, maximum coding units, etc.) that can include one or more reference coding units.

According to an embodiment, the receiver 110 of the image decoding apparatus may obtain at least one of reference coding unit shape information and reference coding unit size information regarding each of various data units from a bitstream. The operation of dividing the square reference coding unit 2000 into one or more coding units has been described above with respect to the operation of dividing the current coding unit 300 of fig. 18, and the operation of dividing the non-square reference coding unit 2002 into one or more coding units has been described above with respect to the operation of dividing the current coding unit 900 or 950 of fig. 9. Therefore, a detailed description thereof will not be provided herein.

According to an embodiment, the image decoding apparatus may use the PID for identifying the size and shape of the reference coding unit to determine the size and shape of the reference coding unit according to some data units previously determined based on a predetermined condition. That is, the receiver 110 may obtain only a PID for identifying a size and a shape of a reference coding unit for each slice, slice segment, or maximum coding unit, which is a data unit (e.g., a data unit having a size equal to or smaller than a slice) satisfying a predetermined condition among various data units (e.g., a sequence, a picture, a slice segment, a maximum coding unit, etc.), from a bitstream. The image decoding apparatus can determine the size and shape of the reference data unit for each data unit satisfying a predetermined condition by using the PID. When the reference coding unit shape information and the reference coding unit size information are obtained and used from the bitstream according to each data unit having a relatively small size, the efficiency of using the bitstream may not be high, and thus only the PID may be obtained and used instead of directly obtaining the reference coding unit shape information and the reference coding unit size information. In this case, at least one of the size and shape of the reference coding unit corresponding to the PID for identifying the size and shape of the reference coding unit may be predetermined. That is, the image decoding apparatus may determine at least one of the size and the shape of the reference coding unit included in the data unit serving as the unit for obtaining the PID by selecting at least one of the size and the shape of the predetermined reference coding unit based on the PID.

According to an embodiment, the image decoding apparatus may use one or more reference coding units included in the maximum coding unit. That is, the maximum coding unit divided from the picture may include one or more reference coding units, and the coding unit may be determined by recursively dividing each reference coding unit. According to an embodiment, at least one of the width and the height of the maximum coding unit may be an integer multiple of at least one of the width and the height of the reference coding unit. According to an embodiment, the size of the reference coding unit may be obtained by dividing the maximum coding unit n times based on a quadtree structure. That is, according to various embodiments, the image decoding apparatus may determine the reference coding unit by dividing the maximum coding unit n times based on the quadtree structure, and may divide the reference coding unit based on at least one of the block shape information and the divided shape mode information.

Fig. 21 illustrates processing blocks used as a unit for determining the determination order of reference coding units included in a picture 2100 according to an embodiment.

According to an embodiment, an image decoding apparatus may determine one or more processing blocks divided from a picture. The processing block is a data unit including one or more reference coding units divided from a picture, and the one or more reference coding units included in the processing block may be determined according to a specific order. That is, the determination order of the one or more reference coding units determined in each processing block may correspond to one of various types of orders for determining the reference coding units, and may vary according to the processing block. The determined order of the reference coding units determined for each processing block may be one of various orders (e.g., a raster scan order, a zigzag scan, an N-shaped scan, an upper right diagonal scan, a horizontal scan, and a vertical scan), but is not limited to the above scan order.

According to an embodiment, an image decoding apparatus may obtain processing block size information and may determine a size of one or more processing blocks included in a picture. The image decoding apparatus may obtain processing block size information from a bitstream and may determine a size of one or more processing blocks included in a picture. The size of the processing block may be a predetermined size of the data unit indicated by the processing block size information.

According to an embodiment, the receiver 110 of the image decoding apparatus may obtain the processing block size information from the bitstream according to each specific data unit. For example, the processing block size information may be obtained from a bitstream in units of data such as images, sequences, pictures, slices, or slice segments. That is, the receiver 110 may obtain the processing block size information from the bitstream according to each of various data units, and the image decoding apparatus may determine the size of one or more processing blocks divided from the picture by using the obtained processing block size information. The size of the processing block may be an integer multiple of the size of the reference coding unit.

According to an embodiment, the image decoding apparatus may determine the sizes of the processing blocks 2102 and 2112 included in the picture 2100. For example, the image decoding apparatus may determine the size of the processing block based on processing block size information obtained from the bitstream. Referring to fig. 21, according to an embodiment, the image decoding apparatus may determine that the width of the processing blocks 2102 and 2112 is four times the width of the reference coding unit, and may determine that the height of the processing blocks 2102 and 2112 is four times the height of the reference coding unit. The image decoding apparatus may determine a determination order of one or more reference coding units in one or more processing blocks.

According to an embodiment, the image decoding apparatus may determine the processing blocks 2102 and 2112 included in the picture 2100 based on the sizes of the processing blocks, and may determine the determination order of one or more reference coding units in the processing blocks 2102 and 2112. According to an embodiment, determining the reference coding unit may include determining a size of the reference coding unit.

According to an embodiment, the image decoding apparatus may obtain determination order information of one or more reference coding units included in one or more processing blocks from a bitstream, and may determine a determination order for the one or more reference coding units based on the obtained determination order information. The determination order information may be defined to determine an order or direction of the reference coding unit in the processing block. That is, the determination order of the reference coding units may be independently determined for each processing block.

According to the embodiment, the image decoding apparatus may obtain the determination order information of the reference coding unit from the bitstream according to each specific data unit. For example, the receiver 110 may obtain the determined order information of the reference coding unit from the bitstream according to each data unit such as an image, a sequence, a picture, a slice segment, or a processing block. Since the determination order information of the reference coding unit indicates the order in which the reference coding unit is determined in the processing block, the determination order information may be obtained for each specific data unit including an integer number of processing blocks.

According to an embodiment, the image decoding apparatus may determine one or more reference encoding units based on the determined determination order.

According to an embodiment, the receiver 110 may obtain the determined order information of the reference coding units from the bitstream as information related to the processing blocks 2102 and 2112, and the image decoding apparatus may determine the determined order of one or more reference coding units included in the processing blocks 2102 and 2112 and determine one or more reference coding units included in the picture 2100 based on the determined order. Referring to fig. 21, the image decoding apparatus may determine the determination orders 2104 and 2114 of one or more reference encoding units in the processing blocks 2102 and 2112, respectively. For example, when the determination order information of the reference coding unit is obtained for each processing block, different types of determination order information of the reference coding unit may be obtained for the processing blocks 2102 and 2112. When the determination order 2104 of the reference coding units in the processing block 2102 is a raster scan order, the reference coding units included in the processing block 2102 may be determined according to the raster scan order. In contrast, when the determination order 2114 of the reference coding unit in another processing block 2112 is the backward raster scan order, the reference coding unit included in the processing block 2112 may be determined according to the backward raster scan order.

According to an embodiment, the image decoding apparatus may decode the determined one or more reference coding units. The image decoding apparatus may decode the image based on the reference encoding unit determined as described above. The method of decoding the reference coding unit may include various image decoding methods.

According to the embodiment, the image decoding apparatus may obtain block shape information indicating a shape of a current coding unit or partition shape mode information indicating a partition method of the current coding unit from a bitstream, and may use the obtained information. Block shape information or division shape mode information may be included in a bitstream related to various data units. For example, the image decoding apparatus may use block shape information or partition shape mode information included in a sequence parameter set, a picture parameter set, a video parameter set, a slice header, or a slice segment header. Further, the image decoding apparatus may obtain a syntax corresponding to the block shape information or the division shape mode information from the bitstream according to each maximum coding unit, each reference coding unit, or each processing block, and may use the obtained syntax.

While the present disclosure has been particularly shown and described with reference to embodiments thereof, it will be understood by those of ordinary skill in the art that various changes in form and details may be made therein without departing from the spirit and scope of the present disclosure as defined by the following claims. Accordingly, the embodiments should be considered in a descriptive sense only and not for purposes of limitation. Therefore, the scope of the present disclosure is defined not by the detailed description of the present disclosure but by the appended claims, and all differences within the scope will be construed as being included in the present disclosure.

The method may be implemented as a program executed in a computer, and may be implemented in a general-purpose digital computer for executing the program by using a computer readable recording medium. The computer-readable recording medium may include magnetic storage media (e.g., Read Only Memory (ROM), floppy disks, hard disks, etc.) and optical recording media (e.g., Compact Disks (CD) -ROMs, Digital Versatile Disks (DVDs), etc.).

52页详细技术资料下载

上一篇：一种医用注射器针头装配设备

下一篇：用于在虚拟现实应用程序中发送信号通知与组成图片相关联的信息的系统和方法

Image encoding method and apparatus, and image decoding method and apparatus

相关技术

网友询问留言