Intra-frame coding speed optimization method, device and medium based on historical information

文档序号：1925601 发布日期：2021-12-03 浏览：21次中文

阅读说明：本技术 基于历史信息的帧内编码速度优化方法、装置及介质 (Intra-frame coding speed optimization method, device and medium based on historical information ) 是由梁凡贾一凡于 2021-08-05 设计创作，主要内容包括：本发明公开了基于历史信息的帧内编码速度优化方法、装置及介质,方法包括：获取编码单元；当所述编码单元为第一轮编码单元时,对所述第一轮编码单元的相关指标进行计算,并根据所述相关指标的计算结果对所述第一轮编码单元进行编码；将所述相关指标的计算结果与动态阈值进行比较,确定编码单元的划分类型；其中,所述动态阈值根据编码单元的历史划分信息进动态调整；当所述编码单元为后续轮编码单元时,判断所述后续轮编码单元是否提前终止划分；若是,则返回获取编码单元的步骤；反之,则对所述后续轮编码单元进行编码；完成对所有编码单元的编码操作。本发明的兼容性好且速度快,可广泛应用于视频编码技术领域。(The invention discloses a method, a device and a medium for optimizing intra-frame coding speed based on historical information, wherein the method comprises the following steps: acquiring a coding unit; when the coding unit is a first round coding unit, calculating a related index of the first round coding unit, and coding the first round coding unit according to the calculation result of the related index; comparing the calculation result of the correlation index with a dynamic threshold value to determine the division type of the coding unit; wherein the dynamic threshold is dynamically adjusted according to historical partition information of the coding units; when the coding unit is a subsequent wheel coding unit, judging whether the subsequent wheel coding unit terminates division in advance; if yes, returning to the step of acquiring the coding unit; otherwise, encoding the subsequent wheel encoding unit; and completing the coding operation on all the coding units. The invention has good compatibility and high speed, and can be widely applied to the technical field of video coding.)

1. An intra-frame coding speed optimization method based on historical information is characterized by comprising the following steps:

acquiring a coding unit;

when the coding unit is a first round coding unit, calculating a related index of the first round coding unit, and coding the first round coding unit according to the calculation result of the related index;

comparing the calculation result of the correlation index with a dynamic threshold value to determine the division type of the coding unit; wherein the dynamic threshold is dynamically adjusted according to historical partition information of the coding units;

when the coding unit is a subsequent wheel coding unit, judging whether the subsequent wheel coding unit terminates division in advance; if yes, returning to the step of acquiring the coding unit; otherwise, encoding the subsequent wheel encoding unit;

and completing the coding operation on all the coding units.

2. The method of claim 1, wherein the method further comprises:

and judging each coding unit in the coding tree unit, and judging whether the coding unit is a first round coding unit or a subsequent round coding unit.

3. The method according to claim 1, wherein the correlation indicators comprise texture information, horizontal gradient values and vertical gradient values, and the calculating the correlation indicator of the first round of coding units comprises:

calculating horizontal gradient values of the first round of coding units;

calculating a vertical gradient value of the first round of encoding units;

calculating texture information of the first round of coding units according to the horizontal gradient value and the vertical gradient value;

wherein, the calculation formula of the horizontal gradient value Gx is as follows:

the calculation formula of the vertical gradient value Gy is as follows;

the texture information T (i, j) is calculated by the following formula:

T(i，j)＝|Gx(i，j)|+|Gy(i，j)|

where P represents a pixel matrix of 3 × 3 size centered on the pixel value of the (i, j) position; (i, j) represents the position of the jth row and ith column in the image.

4. The method of claim 3, wherein the method further comprises:

in the step of calculating the correlation index of the first round of coding units, an average texture value of a plurality of sub-units in the first round of coding units is calculated.

5. The method as claimed in claim 1, wherein the comparing the calculation result of the correlation index with a dynamic threshold to determine the partition type of the coding unit comprises:

for the first round of coding units, when the rate distortion cost is smaller than the rate distortion cost corresponding to the type to be divided, the dynamic threshold value is adjusted;

the adjustment formula of the dynamic threshold is as follows:

where Thr represents the adjusted threshold; thr _ old represents the threshold before adjustment; t represents the average texture value of the first round of coding units.

6. The method of claim 1, wherein the determining whether the subsequent round of coding units terminates partitioning early comprises:

in the sub-strategy aiming at the homogeneity, for the type to be divided, if the average texture value of the coding unit is smaller than the corresponding dynamic threshold value, the type to be divided is skipped;

in the sub-strategy aiming at the directivity, judging whether a first skipping condition is met, if so, skipping the type to be divided;

in the sub-strategy for the texture difference between the sub-parts, whether a second skipping condition is met is judged, and if yes, the current partition type is terminated in advance.

7. The method of claim 6, wherein the intra coding speed is optimized based on the history information,

the expression of the first skip condition is:

the expression of the second skip condition is:

Diff_ratio＜Thr

wherein the content of the first and second substances,

represents the average horizontal gradient value of the current coding unit;represents the average vertical gradient value of the current coding unit; thr represents the threshold of the decision; BT-V represents a binary tree vertical partition mode; TT-V represents a ternary tree vertical partition mode; BT-H represents a binary tree horizontal division mode; TT-H stands for the horizontal division mode of the ternary tree(ii) a Diff _ ratio represents the sub-block disparity; ratio _1 represents the sub-block disparity 1; ratio _2 represents the sub-block disparity 2;represents the average texture value of the first sub-block;represents the average texture value of the second sub-block;represents the average texture value of the third sub-block.

8. An intra-coding speed optimization apparatus based on history information, comprising:

a first module for obtaining an encoding unit;

a second module, configured to, when the coding unit is a first-round coding unit, calculate a correlation index of the first-round coding unit, and code the first-round coding unit according to a calculation result of the correlation index;

a third module, configured to compare the calculation result of the correlation index with a dynamic threshold, and determine a partition type of the coding unit; wherein the dynamic threshold is dynamically adjusted according to historical partition information of the coding units;

a fourth module, configured to determine whether the subsequent round of coding unit terminates partitioning in advance when the coding unit is the subsequent round of coding unit; if yes, returning to the step of acquiring the coding unit; otherwise, encoding the subsequent wheel encoding unit;

and the fifth module is used for finishing the coding operation of all the coding units.

9. An electronic device comprising a processor and a memory;

the memory is used for storing programs;

the processor executing the program realizes the method according to any one of claims 1-7.

10. A computer-readable storage medium, characterized in that the storage medium stores a program, which is executed by a processor to implement the method according to any one of claims 1-7.

Technical Field

The invention relates to the technical field of video coding, in particular to a method, a device and a medium for optimizing intra-frame coding speed based on historical information.

Background

VVC (Versatile Video coding) can improve coding efficiency while maintaining subjective and objective visual quality40% and above. The improvement of coding efficiency benefits from a number of newly adopted coding techniques and tools, such as QTMT partition scheme, multi-line reference prediction (MRL), Matrix Intra Prediction (MIP), multiple transform kernel selection (MTS), low frequency non-separable transform (LFNST), Intra sub-block partitioning (Intra)ISP) and the like. These newly adopted coding tools, while effective in improving compression efficiency, also introduce coding complexity significantly. Too high coding complexity can affect the real-time performance of coding and improve the implementation difficulty of engineering landing.

Experts have called for effective control of coding complexity for dramatically increasing coding times. According to the report, compared with HEVC, configuration in full frameRandom Access configuration (Random Access) and low latency configuration(s) (( P/B), the encoding time of VVC is increased by 25, 7 and 6 times respectively, and the encoding efficiency is correspondingly improved by about 25%, 36% and 32%. Obviously, the complexity of intra-coding increases far beyond that of inter-coding, and it is currently the most crucial and tricky way to control the complexity of intra-coding.

In the VVC fast algorithm of the conventional method, although the encoding time can be reduced by 20% -50%, the encoding loss is close to or even exceeds 1%. Considering that the overall gain of VVC intra coding compared to HEVC is only 25%, too high coding losses (e.g. greater than 1%) are unacceptable. In other words, these existing algorithms still do not achieve a satisfactory compromise and balance in coding efficiency and coding time.

Disclosure of Invention

In view of this, embodiments of the present invention provide a method, an apparatus, and a medium for optimizing intra-frame coding speed based on historical information, which are fast and have good compatibility.

One aspect of the present invention provides a method for optimizing intra-frame coding speed based on historical information, including:

acquiring a coding unit;

and completing the coding operation on all the coding units.

Optionally, the method further comprises:

and judging each coding unit in the coding tree unit, and judging whether the coding unit is a first round coding unit or a subsequent round coding unit.

Optionally, the correlation indicator includes texture information, a horizontal gradient value and a vertical gradient value, and the calculating the correlation indicator of the first round of encoding units includes:

calculating horizontal gradient values of the first round of coding units;

calculating a vertical gradient value of the first round of encoding units;

calculating texture information of the first round of coding units according to the horizontal gradient value and the vertical gradient value;

wherein, the calculation formula of the horizontal gradient value Gx is as follows:

the calculation formula of the vertical gradient value Gy is as follows;

the texture information T (i, j) is calculated by the following formula:

T(i,j)＝|Gx(i,j)|+|Gy(i,j)|

where P represents a pixel matrix of 3 × 3 size centered on the pixel value of the (i, j) position; (i, j) represents the position of the jth row and ith column in the image.

Optionally, the method further comprises:

in the step of calculating the correlation index of the first round of coding units, an average texture value of a plurality of sub-units in the first round of coding units is calculated.

Optionally, the comparing the calculation result of the correlation index with a dynamic threshold to determine the partition type of the coding unit includes:

for the first round of coding units, when the rate distortion cost is smaller than the rate distortion cost corresponding to the type to be divided, the dynamic threshold value is adjusted;

the adjustment formula of the dynamic threshold is as follows:

where Thr represents the adjusted threshold; thr _ old represents the threshold before adjustment; t represents the average texture value of the first round of coding units.

Optionally, the determining whether the subsequent round of coding units terminates partitioning in advance includes:

in the sub-strategy aiming at the directivity, judging whether a first skipping condition is met, if so, skipping the type to be divided;

in the sub-strategy for the texture difference between the sub-parts, whether a second skipping condition is met is judged, and if yes, the current partition type is terminated in advance.

Optionally, the expression of the first skip condition is:

the expression of the second skip condition is:

Diff_ratio<Thr

wherein the content of the first and second substances,

represents the average horizontal gradient value of the current coding unit;represents the average vertical gradient value of the current coding unit; thr represents the threshold of the decision; BT-V represents a binary tree vertical partition mode; TT-V represents a ternary tree vertical partition mode; BT-H represents a binary tree horizontal division mode; TT-H stands for the horizontal division mode of the ternary tree; diff _ ratio represents the sub-block disparity; ratio _1 represents the sub-block disparity 1; ratio _2 represents the sub-block disparity 2;represents the average texture value of the first sub-block;represents the average texture value of the second sub-block;represents the average texture value of the third sub-block.

Another aspect of the embodiments of the present invention further provides an apparatus for optimizing intra-frame coding speed based on historical information, including:

a first module for obtaining an encoding unit;

and the fifth module is used for finishing the coding operation of all the coding units.

In another aspect, an embodiment of the present invention further provides an electronic device, including a processor and a memory;

the memory is used for storing programs;

the processor executes the program to implement the method as described above.

In another aspect, the present invention provides a computer-readable storage medium, which stores a program, where the program is executed by a processor to implement the method described above.

The embodiment of the invention also discloses a computer program product or a computer program, which comprises computer instructions, and the computer instructions are stored in a computer readable storage medium. The computer instructions may be read by a processor of a computer device from a computer-readable storage medium, and the computer instructions executed by the processor cause the computer device to perform the foregoing method.

The embodiment of the invention firstly obtains a coding unit; when the coding unit is a first round coding unit, calculating a related index of the first round coding unit, and coding the first round coding unit according to the calculation result of the related index; comparing the calculation result of the correlation index with a dynamic threshold value to determine the division type of the coding unit; wherein the dynamic threshold is dynamically adjusted according to historical partition information of the coding units; when the coding unit is a subsequent wheel coding unit, judging whether the subsequent wheel coding unit terminates division in advance; if yes, returning to the step of acquiring the coding unit; otherwise, encoding the subsequent wheel encoding unit; and completing the coding operation on all the coding units. The invention has good compatibility and high speed.

Drawings

In order to more clearly illustrate the technical solutions in the embodiments of the present application, the drawings needed to be used in the description of the embodiments are briefly introduced below, and it is obvious that the drawings in the following description are only some embodiments of the present application, and it is obvious for those skilled in the art to obtain other drawings based on these drawings without creative efforts.

FIG. 1 is an exemplary diagram of different partition combinations of the present invention resulting in the same CU structure;

FIG. 2 is a flowchart illustrating the overall steps of an embodiment of the present invention;

FIG. 3 is a schematic diagram of TT-V division in a CU according to an embodiment of the invention.

Detailed Description

In order to make the objects, technical solutions and advantages of the present application more apparent, the present application is described in further detail below with reference to the accompanying drawings and embodiments. It should be understood that the specific embodiments described herein are merely illustrative of the present application and are not intended to limit the present application.

The embodiment of the invention provides an intra-frame coding speed optimization method based on historical information, which comprises the following steps: