Image prediction method, encoder, decoder, and storage medium

文档序号：1956988 发布日期：2021-12-10 浏览：17次中文

阅读说明：本技术 图像预测方法、编码器、解码器以及存储介质 (Image prediction method, encoder, decoder, and storage medium ) 是由万帅霍俊彦马彦卓张伟于 2019-10-12 设计创作，主要内容包括：本申请实施例公开了一种图像预测方法、编码器、解码器以及存储介质,该方法包括：通过预测模型,获得图像中当前块的待预测图像分量的初始预测值；对所述初始预测值进行滤波处理,得到所述当前块的待预测图像分量的目标预测值。(The embodiment of the application discloses an image prediction method, an encoder, a decoder and a storage medium, wherein the method comprises the following steps: obtaining an initial prediction value of a current block to be predicted of an image component in an image through a prediction model; and filtering the initial prediction value to obtain a target prediction value of the current block to-be-predicted image component.)

1. An image prediction method applied to an encoder, the method comprising:

obtaining an initial prediction value of a current block to be predicted of an image component in an image through a prediction model;

and filtering the initial prediction value to obtain a target prediction value of the current block to-be-predicted image component.

2. The method according to claim 1, wherein the filtering the initial prediction value to obtain a target prediction value of the current block for the image component to be predicted comprises:

performing filtering processing on the initial predicted value by using a preset processing mode, wherein the preset processing mode comprises value correction processing;

and obtaining the target predicted value according to the processing result.

3. The method of claim 1, wherein the method further comprises:

and determining a reference value of the image component to be predicted of the current block according to the image component to be referred of the current block.

4. An image prediction method applied to a decoder, the method comprising:

obtaining an initial prediction value of a current block to be predicted of an image component in an image through a prediction model;

and filtering the initial prediction value to obtain a target prediction value of the current block to-be-predicted image component.

5. The method according to claim 4, wherein the filtering the initial prediction value to obtain the target prediction value of the current block for the image component to be predicted comprises:

performing filtering processing on the initial predicted value by using a preset processing mode, wherein the preset processing mode comprises value correction processing;

and obtaining the target predicted value according to the processing result.

6. The method of claim 4, wherein the method further comprises:

and determining a reference value of the image component to be predicted of the current block according to the image component to be referred of the current block.

7. An encoder comprising a first prediction unit and a first processing unit, wherein,

the first prediction unit is configured to obtain an initial prediction value of a current block to be predicted of an image component in the image through a prediction model;

the first processing unit is configured to perform filtering processing on the initial prediction value to obtain a target prediction value of the current block to-be-predicted image component.

8. An encoder comprising a first memory and a first processor, wherein,

the first memory for storing a computer program operable on the first processor;

the first processor, when running the computer program, is configured to perform the method of any of claims 1 to 3.

9. A decoder comprising a second prediction unit and a second processing unit, wherein,

the second prediction unit is configured to obtain an initial prediction value of a current block to be predicted of an image component in the image through a prediction model;

and the second processing unit is configured to perform filtering processing on the initial prediction value to obtain a target prediction value of the current block to-be-predicted image component.

10. A decoder comprising a second memory and a second processor, wherein,

the second memory for storing a computer program operable on the second processor;

the second processor, when executing the computer program, is configured to perform the method of any of claims 4 to 6.

11. A computer storage medium, wherein the computer storage medium stores an image prediction program that, when executed by a first processor or a second processor, implements the method of any of claims 1 to 6.

Technical Field

The embodiments of the present application relate to the field of video encoding and decoding technologies, and in particular, to an image prediction method, an encoder, a decoder, and a storage medium.

Background

In the latest Video Coding standard h.266/multifunctional Video Coding (VVC), the existence of cross-component prediction has been allowed; among them, Cross-component Linear Model Prediction (CCLM) is one of typical Cross-component Prediction techniques. With cross-component prediction techniques, it may be achieved that one component predicts the other component (or its residual), e.g., the chroma component from the luma component, or the luma component from the chroma component, or the chroma component from the chroma component, etc.

The different components have different statistical properties such that there is a difference in statistical properties between the components. However, when component prediction is performed, the conventional cross-component prediction technology is not considered comprehensively, so that the prediction efficiency is low.

Disclosure of Invention

Embodiments of the present application provide an image prediction method, an encoder, a decoder, and a storage medium, which can improve not only prediction efficiency but also coding and decoding efficiency of a video image by balancing statistical characteristics of each image component after cross-component prediction.

The technical scheme of the embodiment of the application can be realized as follows:

in a first aspect, an embodiment of the present application provides an image prediction method applied to an encoder or a decoder, where the method includes:

obtaining an initial prediction value of a current block to be predicted of an image component in an image through a prediction model;

and filtering the initial prediction value to obtain a target prediction value of the current block to-be-predicted image component.

In a second aspect, an embodiment of the present application provides an encoder, which includes a first prediction unit and a first processing unit, wherein,

the first prediction unit is configured to obtain an initial prediction value of a current block to be predicted of an image component in the image through a prediction model;

and the first processing unit is configured to perform filtering processing on the initial prediction value to obtain a target prediction value of the current block to-be-predicted image component.

In a third aspect, embodiments of the present application provide an encoder that includes a first memory and a first processor, wherein,

a first memory for storing a computer program operable on the first processor;

a first processor for performing the method according to the first aspect when running the computer program.

In a fourth aspect, an embodiment of the present application provides a decoder, which includes a second prediction unit and a second processing unit, wherein,

the second prediction unit is configured to obtain an initial prediction value of a current block to be predicted of an image component in the image through a prediction model;

and the second processing unit is configured to perform filtering processing on the initial prediction value to obtain a target prediction value of the current block to-be-predicted image component.

In a fifth aspect, an embodiment of the present application provides a decoder, which includes a second memory and a second processor, wherein,

a second memory for storing a computer program operable on the second processor;

a second processor for performing the method according to the first aspect when running the computer program.

In a sixth aspect, the present application provides a computer storage medium storing an image prediction program, which when executed by a first processor or a second processor implements the method according to the first aspect.

The embodiment of the application provides an image prediction method, an encoder, a decoder and a storage medium, wherein an initial prediction value of a current block to be predicted of an image component in an image is obtained through a prediction model; then, filtering the initial prediction value to obtain a target prediction value of the current block to-be-predicted image component; therefore, the filtering processing is continuously carried out on at least one image component after the at least one image component of the current block is predicted, the statistical characteristics of the image components after the inter-component prediction can be balanced, the prediction efficiency is improved, the prediction residual error of the image components is smaller because the obtained target prediction value is closer to the true value, the transmission bit rate in the coding and decoding process is less, and the coding and decoding efficiency of the video image can be improved.

Drawings

Fig. 1 is a schematic structural diagram of a conventional cross-component prediction architecture provided in the related art;

fig. 2 is a block diagram illustrating a video coding system according to an embodiment of the present disclosure;

fig. 3 is a block diagram illustrating a video decoding system according to an embodiment of the present application;

fig. 4 is a schematic flowchart of an image prediction method according to an embodiment of the present disclosure;

fig. 5 is a schematic structural diagram illustrating an improved cross-component prediction architecture according to an embodiment of the present disclosure;

fig. 6 is a schematic structural diagram of an encoder according to an embodiment of the present disclosure;

fig. 7 is a schematic hardware structure diagram of an encoder according to an embodiment of the present disclosure;

fig. 8 is a schematic structural diagram of a decoder according to an embodiment of the present application;

fig. 9 is a schematic hardware structure diagram of a decoder according to an embodiment of the present disclosure.

Detailed Description

So that the manner in which the features and elements of the present embodiments can be understood in detail, a more particular description of the embodiments, briefly summarized above, may be had by reference to embodiments, some of which are illustrated in the appended drawings.

In a video image, a first image component, a second image component and a third image component are generally adopted to characterize a coding block; wherein the three image components are respectively a luminance component, a blue chrominance component and a red chrominance component, and specifically, the luminance component is generally represented by a symbol Y, the blue chrominance component is generally represented by a symbol Cb or U, and the red chrominance component is generally represented by a symbol Cr or V; thus, the video image can be represented in YCbCr format, and also in YUV format.

In the embodiment of the present application, the first image component may be a luminance component, the second image component may be a blue chrominance component, and the third image component may be a red chrominance component, but the embodiment of the present application is not particularly limited.

In order to further improve the performance of coding and decoding, H.266/VCC proposes a cross-component prediction technology of CCLM. Among them, the cross-component prediction technique based on CCLM can realize not only prediction from a luminance component to a chrominance component, i.e., prediction from a first image component to a second image component, or from a first image component to a third image component, but also prediction from a chrominance component to a luminance component, i.e., prediction from a second image component to a first image component, or from a third image component to a first image component, and even prediction between a chrominance component and a chrominance component, i.e., prediction from a second image component to a third image component, or from a third image component to a second image component, and the like. In the embodiment of the present application, the following description will be given taking prediction of a first image component to a second image component as an example, but the technical solution of the embodiment of the present application can be applied to prediction of other image components as well.

Referring to fig. 1, a schematic diagram of a composition structure of a conventional cross-component prediction architecture provided in the related art is shown. As shown in fig. 1, a first image component (e.g., represented by a Y component) is used to predict a second image component (e.g., represented by a U component); assuming that the video image adopts YUV 4:2:0 format, the Y component and the U component have different resolutions, and at this time, downsampling processing needs to be performed on the Y component or upsampling processing needs to be performed on the U component to reach the target resolution of the component to be predicted, so that prediction can be performed between the components with the same resolution. In this example, the method of predicting the third image component (e.g., represented by the V component) using the Y component is the same as this.

In fig. 1, the conventional cross-component prediction architecture 10 may include a Y-component coding block 110, a resolution adjustment unit 120, Y¹Component encoding block 130, U component encoding block 140, prediction model 150, cross component prediction unit 160. Wherein the Y component of the video image is represented by a Y component encoded block 110 of 2N × 2N size, where a bold larger box is used to highlight the Y component encoded block 110 and a surrounding gray filled circle is used to indicate the adjacent reference value Y (N) of the Y component encoded block 110; the U component of the video image is represented by an N × N sized U component coding block 140, where the bold larger box is used to highlight the U component coding block 140 and the surrounding solid gray circles are used to indicate the adjacent reference values c (N) of the U component coding block 140; since the Y component and the U component have different resolutions, the resolution of the Y component needs to be adjusted by the resolution adjustment unit 120 to obtain N × N Y¹A component encoding block 130; for Y¹Component coding block 130, here bolded larger block to highlight Y¹Component encoding block 130, and the surrounding solid gray circle is used to indicate Y¹Adjacent reference value Y of component coding block 130¹(n); then through Y¹Adjacent reference value Y of component coding block 130¹(n) and the adjacent reference values c (n) of the U-component encoding block 140 may construct a prediction model 150; according to Y¹The Y component reconstructed pixel value and the prediction model 150 of the component encoding block 130 may perform component prediction across the component prediction unit 160, and finally output a U component prediction value.

For the conventional cross-component prediction architecture 10, the consideration is incomplete when performing the image component prediction, for example, the difference of the statistical characteristics among the image components is not considered, so that the prediction efficiency is low. In order to improve the prediction efficiency, the embodiment of the application provides an image prediction method, which includes the steps of firstly, obtaining an initial prediction value of a current block to-be-predicted image component in an image through a prediction model; then, filtering the initial prediction value to obtain a target prediction value of the current block to-be-predicted image component; in this way, the filtering processing is continuously carried out on at least one image component after the at least one image component of the current block is predicted, the statistical characteristics of all the image components after the inter-component prediction can be balanced, and therefore the prediction efficiency is improved, and the coding and decoding efficiency of the video image is also improved.

Embodiments of the present application will be described in detail below with reference to the accompanying drawings.

Referring to fig. 2, a block diagram of an example of a video coding system provided in an embodiment of the present application is shown; as shown in fig. 2, the video Coding system 20 includes a transform and quantization unit 201, an intra estimation unit 202, an intra prediction unit 203, a motion compensation unit 204, a motion estimation unit 205, an inverse transform and inverse quantization unit 206, a filter control analysis unit 207, a filtering unit 208, a Coding unit 209, a decoded image buffer unit 210, and the like, wherein the filtering unit 208 may implement deblocking filtering and Sample Adaptive 0 offset (SAO) filtering, and the Coding unit 209 may implement header information Coding and Context-based Adaptive Binary arithmetic Coding (CABAC). For an input original video signal, a Coding block can be obtained by dividing a Coding Tree Unit (CTU), and then residual pixel information obtained by intra-frame or inter-frame prediction is transformed into the Coding block by a transformation and quantization Unit 201, including transforming the residual pixel information from a pixel domain into a transformation domain and quantizing the obtained transformation coefficient, so as to further reduce the bit rate; the intra estimation unit 202 and the intra prediction unit 203 are used for intra prediction of the coding block; in particular, intra estimation unit 202 and intra prediction unit 203 are used to determine the intra prediction mode to be used to encode the encoded block; motion compensation unit 204 and motion estimation unit 205 to perform inter-prediction encoding of the received encoded block relative to one or more blocks in one or more reference frames to provide temporal prediction information; the motion estimation performed by the motion estimation unit 205 is a process of generating a motion vector that can estimate the motion of the coding block, and then performing motion compensation by the motion compensation unit 204 based on the motion vector determined by the motion estimation unit 205; after determining the intra prediction mode, the intra prediction unit 203 is also configured to supply the selected intra prediction data to the encoding unit 209, and the motion estimation unit 205 sends the calculated and determined motion vector data to the encoding unit 209; furthermore, the inverse transform and inverse quantization unit 206 is used for reconstruction of the encoded block, reconstructing a residual block in the pixel domain, which removes blocking artifacts through the filter control analysis unit 207 and the filtering unit 208, and then adding the reconstructed residual block to a predictive block in the frame of the decoded picture buffer unit 210 to generate a reconstructed video block; the encoding unit 209 is used for encoding various encoding parameters and quantized transform coefficients, and in the CABAC-based encoding algorithm, context content may be based on adjacent encoding blocks, may be used for encoding information indicating the determined intra prediction mode, and outputs a code stream of the video signal; the decoded picture buffer unit 210 is used to store reconstructed video blocks for prediction reference. As the video coding proceeds, new reconstructed video blocks are generated and stored in the decoded picture buffer unit 210.

Referring to fig. 3, a block diagram of an example of a video decoding system provided in an embodiment of the present application is shown; as shown in fig. 3, the video decoding system 30 includes a decoding unit 301, an inverse transform and inverse quantization unit 302, an intra prediction unit 303, a motion compensation unit 304, a filtering unit 305, a decoded image buffer unit 306, and the like, wherein the decoding unit 301 can implement header information decoding and CABAC decoding, and the filtering unit 305 can implement deblocking filtering and SAO filtering. After the input video signal is subjected to the encoding process of fig. 2, outputting a code stream of the video signal; the code stream is input into the video decoding system 30, and first passes through the decoding unit 301 to obtain a decoded transform coefficient; processing the transform coefficients by an inverse transform and inverse quantization unit 302 to produce a block of residual in the pixel domain; intra-prediction unit 303 may be used to generate prediction data for a video block currently to be decoded based on the determined intra-prediction mode and data from previously decoded blocks of a current frame or picture; motion compensation unit 304 is a predictive block that determines prediction information for the video block to be decoded by parsing the motion vector and other associated syntax elements and uses the prediction information to generate the video block being decoded; forming a decoded video block by summing the residual block from inverse transform and inverse quantization unit 302 with the corresponding predictive block generated by intra prediction unit 303 or motion compensation unit 304; the decoded video blocks pass through the filtering unit 305 in order to remove blocking artifacts, which may improve video quality; the decoded video block is then stored in the decoded picture buffer unit 306, and the decoded picture buffer unit 306 stores the reference picture for subsequent intra prediction or motion compensation, and also for output of the video signal, i.e. the restored original video signal is obtained.

The embodiment of the present application is mainly applied to the portion of the intra prediction unit 203 shown in fig. 2 and the portion of the intra prediction unit 303 shown in fig. 3; that is, the embodiments of the present application may be applied to a video encoding system and a video decoding system, and the embodiments of the present application are not particularly limited.

Based on the application scenario example of fig. 2 or fig. 3, referring to fig. 4, a flowchart of an image prediction method provided in an embodiment of the present application is shown, where the method may include:

s401: obtaining an initial prediction value of a current block to be predicted of an image component in an image through a prediction model;

s402: and filtering the initial prediction value to obtain a target prediction value of the current block to-be-predicted image component.

It should be noted that a video image may be divided into a plurality of image blocks, and each image block to be currently encoded may be referred to as an encoding block. Wherein each encoding block may include a first image component, a second image component and a third image component, and the current block is an encoding block to be currently predicted by the first image component, the second image component or the third image component in the video image.

It should be noted that the image prediction method in the embodiment of the present application may be applied to both a video coding system and a video decoding system, and may even be applied to both the video coding system and the video decoding system, which is not limited in the embodiment of the present application.

In the embodiment of the application, firstly, an initial prediction value of a current block to-be-predicted image component in an image is obtained through a prediction model; then, filtering the initial prediction value to obtain a target prediction value of the current block to-be-predicted image component; in this way, the filtering processing is continuously carried out on at least one image component after the at least one image component is predicted, and the statistical characteristics of the image components after cross-component prediction can be balanced, so that the prediction efficiency is improved, and the coding and decoding efficiency of the video image is also improved.

Further, different image components have different statistical characteristics, and the statistical characteristics of the image components are different, for example, a luminance component has rich texture characteristics, and a chrominance component tends to be more uniform and flat; in order to better balance the statistical characteristics of the image components after cross-component prediction, at this time, characteristic statistics needs to be performed on at least one image component of the current block. Therefore, in some embodiments, for S402, before the filtering the initial prediction value, the method may further include:

performing characteristic statistics on at least one image component of the current block; wherein the at least one image component comprises an image component to be predicted and/or an image component to be referenced, the image component to be predicted being different from the image component to be referenced;

according to the result of the characteristic statistics, acquiring a reference value of the image component to be predicted of the current block and/or a reference value of the image component to be referred of the current block; the image component to be predicted is a component predicted when the prediction model is built, and the image component to be referred is a component used for prediction when the prediction model is built.

It should be noted that at least one image component of the current block may be a to-be-predicted image component, a to-be-referred image component, or even a to-be-predicted image component and a to-be-referred image component. Assuming that the prediction of the first image component to the second image component is realized through a prediction model, the image component to be predicted is the second image component, and the image component to be referred is the first image component; alternatively, assuming that the prediction of the third image component by the first image component is realized by a prediction model, the image component to be predicted is the third image component, and the image component to be referred is still the first image component.

In this way, by performing characteristic statistics on at least one image component of the current block, according to the result of the characteristic statistics, the reference value of the image component to be predicted of the current block and/or the reference value of the image component to be referenced of the current block can be obtained.

Further, in order to improve the prediction efficiency, the initial prediction value of the image component to be predicted of the current block may be filtered according to the reference value of the image component to be predicted of the current block and/or the reference value of the image component to be referenced of the current block.

In some embodiments, the processing the initial prediction value corresponding to the at least one image component according to the reference value of the at least one image component may include:

according to the reference value of the image component to be predicted of the current block and/or the reference value of the image component to be referred of the current block, filtering the initial predicted value by using a preset processing mode, wherein the preset processing mode at least comprises one of the following modes: filtering, grouping, value correcting, quantizing and de-quantizing;

and obtaining the target predicted value according to the processing result.

It should be noted that, according to the result of the characteristic statistics of at least one image component of the current block, after obtaining the reference value of the image component to be predicted of the current block and/or the reference value of the image component to be referenced of the current block, the initial prediction value may be subjected to filtering processing by using a preset processing mode. Specifically, the initial prediction value may be filtered by filtering, or the initial prediction value may be filtered by grouping, or the initial prediction value may be filtered by value correction, or the initial prediction value may be filtered by quantization, or the initial prediction value may be filtered by inverse quantization (also referred to as dequantization), and the like, and the embodiment of the present invention is not particularly limited.

For example, in order to improve the prediction efficiency, that is, improve the accuracy of the prediction value, assuming that the luminance component is used to predict the chrominance component, for the initial prediction value of the chrominance component obtained by the prediction model, if the preset processing mode adopts the value correction processing, since the luminance component and the chrominance component have different statistical characteristics, according to the difference of the statistical characteristics of the two image components, a deviation factor can be obtained; then, value correction processing is carried out on the initial predicted value by utilizing the deviation factor (for example, the initial predicted value and the deviation factor are added) so as to balance the statistical characteristics among the image components after cross-component prediction, and therefore the target predicted value of the chrominance component is obtained, and at the moment, the target predicted value of the chrominance component is closer to the true value of the chrominance component; if the preset processing mode adopts filtering processing, because the luminance component and the chrominance component have different statistical characteristics, according to the difference of the statistical characteristics of the two image components, the initial predicted value can be filtered to balance the statistical characteristics among the image components after cross-component prediction, so that a target predicted value corresponding to the chrominance component is obtained, and at the moment, the target predicted value of the chrominance component is closer to the true value of the chrominance component; if the preset processing mode adopts grouping processing, because the luminance component and the chrominance component have different statistical characteristics, according to the difference of the statistical characteristics of the two image components, the grouping processing can be carried out on the initial predicted value so as to balance the statistical characteristics among the image components after cross-component prediction, the target predicted value corresponding to the chrominance component can be obtained according to the grouped initial predicted value, and at the moment, the target predicted value of the chrominance component is closer to the true value of the chrominance component; in addition, because quantization processing and inverse quantization processing on a luminance component and a chrominance component are involved in the process of determining an initial predicted value, and simultaneously, because the luminance component and the chrominance component have different statistical characteristics, the difference between the quantization processing and the inverse quantization processing may possibly exist according to the difference of the statistical characteristics of the two image components, at this time, if the quantization processing is adopted in a preset processing mode, the quantization processing can be carried out on the initial predicted value to balance the statistical characteristics among the image components after cross-component prediction, so that a target predicted value corresponding to the chrominance component is obtained, and at this time, the target predicted value of the chrominance component is closer to the real value of the chrominance component; if the preset processing mode is inverse quantization processing, dequantization processing can be carried out on the initial predicted value to balance the statistical characteristics among the image components after cross-component prediction, so that a target predicted value corresponding to the chrominance component is obtained, and the target predicted value of the chrominance component is closer to the true value of the chrominance component; therefore, the accuracy of the predicted value is improved, and the prediction efficiency is also improved.

In the embodiment of the application, the initial prediction value of the current block to be predicted of the image component in the image is obtained through a prediction model. And performing characteristic statistics on at least one image component of the current block, wherein the at least one image component comprises an image component to be predicted and/or an image component to be referred. And acquiring a reference value of a to-be-predicted image component of the current block and/or a reference value of a to-be-referenced image component of the current block according to the result of the characteristic statistics, wherein the to-be-predicted image component is a component predicted when a prediction model is built, and the to-be-referenced image component is a component used for prediction when the prediction model is built. And performing filtering processing on the initial predicted value by using a preset processing mode according to the reference value of the image component to be predicted of the current block and/or the reference value of the image component to be referred of the current block, wherein the preset processing mode at least comprises one of the following modes: filtering processing, grouping processing, value modification processing, quantization processing and dequantization processing. And obtaining a target predicted value according to the processing result.

Further, in order to improve the prediction efficiency, the initial prediction residual of the image component to be predicted of the current block may be filtered according to the reference value of the image component to be predicted of the current block and/or the reference value of the image component to be referenced of the current block.

In some embodiments, for S401, after obtaining the initial prediction value of the current block to be predicted of the image component in the image through the prediction model, the method may further include:

calculating to obtain an initial prediction residual error of the current block to-be-predicted image component based on the initial prediction component value;

and performing filtering processing on the initial prediction residual error by using a preset processing mode according to the reference value of the image component to be predicted of the current block and/or the reference value of the image component to be referred of the current block, wherein the preset processing mode at least comprises one of the following modes: filtering, grouping, value correcting, quantizing and de-quantizing;

and obtaining the target prediction residual according to the processing result.

Further, in some embodiments, for S402, the obtaining the target prediction value of the current block for the image component to be predicted may include:

and calculating to obtain a target prediction value of the current block to-be-predicted image component according to the target prediction residual.

It should be noted that the prediction residual is obtained from a difference between a predicted value of the image component and a true value of the image component; in order to improve the coding efficiency of the video image, it is necessary to ensure that the prediction residual transmitted by the current block is as small as possible.

In order to make the prediction residual as small as possible, on one hand, after the initial prediction value corresponding to the image component to be predicted is obtained through the prediction model, the initial prediction value can be filtered according to a preset processing mode to obtain a target prediction value of the image component to be predicted, and the target prediction value of the image component to be predicted is as close as possible to the true value of the image component to be predicted, so that the prediction residual between the two is as small as possible; on the other hand, after the initial prediction value corresponding to the image component to be predicted is obtained through the prediction model, according to the difference value between the initial prediction value of the image component to be predicted and the true value of the image component to be predicted, the initial prediction residual of the image component to be predicted is determined first, then the initial prediction residual is filtered according to the preset processing mode to obtain the target prediction residual of the image component to be predicted, and the target prediction value of the image component to be predicted can be obtained according to the target prediction residual; because the target prediction residual is as small as possible, the target prediction value of the image component to be predicted is as close as possible to the true value of the image component to be predicted. That is to say, the embodiment of the present application can be applied not only to performing filtering processing on the initial prediction value of the to-be-predicted image component of the current block, but also to performing filtering processing on the initial prediction residual of the to-be-predicted image component of the current block; after filtering processing, the statistical characteristics of each image component after cross-component prediction can be balanced, so that the prediction efficiency is improved, and the prediction residual of the image component to be predicted is smaller because the obtained target prediction value is closer to a true value, so that the bit rate transmitted in the encoding and decoding process is less, and the encoding and decoding efficiency of the video image is also improved.

Further, before obtaining the initial prediction value of the image component to be predicted of the current block, it is also necessary to determine a model parameter of the prediction model to construct the prediction model. Therefore, in some embodiments, for S401, before the obtaining, by the prediction model, the initial prediction value of the current block to be predicted of the image component in the image, the method may further include:

determining a reference value of a component of the current block to be predicted, wherein the reference value of the component of the current block to be predicted is the value of the component of the current block to be predicted of a pixel adjacent to the current block;

determining a reference value of an image component to be referred of the current block, wherein the image component to be referred of the current block is different from the image component to be predicted, and the reference value of the image component to be referred of the current block is the reference image component value of the adjacent pixel of the current block;

calculating a model parameter of the prediction model according to the reference value of the image component to be predicted of the current block and the reference value of the image component to be referred of the current block;

and constructing the prediction model according to the model parameters obtained by calculation, wherein the prediction model is used for performing cross-component prediction processing on the component of the current block to be predicted according to the component of the current block to be referred to.

It should be noted that the prediction model in the embodiment of the present application may be a linear model, such as a cross-component prediction technique of CCLM; the prediction Model may also be a non-linear Model, such as a cross-component prediction technique of a multi-Model CCLM (multi-Model CCLM), which is composed of a plurality of linear models. The embodiments of the present application will be described below by taking a prediction model as a linear model as an example, but the image prediction method of the embodiments of the present application can be applied to a nonlinear model as well.

The model parameters include a first model parameter (denoted by α) and a second model parameter (denoted by β). The calculation for α and β may be in various manners, and may be a preset factor calculation model constructed by a least square method, a preset factor calculation model constructed by a maximum value and a minimum value, or even a preset factor calculation model constructed in other manners, which is not specifically limited in the embodiment of the present application.

Taking a preset factor calculation model constructed by a least square method as an example, first, a reference value of a to-be-predicted image component of a current block and a reference value of a to-be-referenced image component of the current block need to be determined, where the reference value of the to-be-referenced image component of the current block may be a reference image component value (such as a first image component adjacent reference value) of a pixel adjacent to the current block, and the reference value of the to-be-predicted image component of the current block may be a to-be-predicted image component value (such as a second image component adjacent reference value) of a pixel adjacent to the current block; deriving to obtain a model parameter of the prediction model by using the minimized regression error of the adjacent reference pixel value of the first image component and the adjacent reference pixel value of the second image component, specifically, as shown in formula (1):

l (N) represents a first image component adjacent reference value corresponding to the down-sampled left side and upper side of the current block, c (N) represents a second image component adjacent reference value corresponding to the left side and upper side of the current block, N is the side length of the current block of the second image component, and N is 1, 2. The first model parameter alpha and the second model parameter beta can be obtained through the calculation of the formula (1); assuming that the second image component is predicted from the first image component based on alpha and beta, the constructed prediction model is as shown in equation (2),

Pred_C[i,j]＝α·Rec_L[i,j]+β (2)

wherein i, j represents the position coordinates of the pixel points in the current block, i represents the horizontal direction, j represents the vertical direction, Pred_C[i,j]Representing the position coordinate in the current block as [ i, j]Corresponding to the pixel point of (a) a second image component prediction value, Rec_L[i,j]Representing the (down-sampled) position coordinate in the same current block as [ i, j ]]Corresponding to the first image component reconstruction value.

Taking a preset factor calculation model constructed by a maximum value and a minimum value as an example, the method provides a method for deriving a simplified model parameter, and specifically, model parameters of a prediction model can be derived according to a principle of "determining a line between two points" by searching a maximum adjacent reference value of a first image component and a minimum adjacent reference value of the first image component, as shown in formula (3):

wherein L is_maxAnd L_minRepresenting the maximum and minimum values found by searching in the adjacent reference values of the first image component corresponding to the left and upper sides of the down-sampled current block, C_maxAnd C_minRepresents L_maxAnd L_minAnd the reference pixel points at the corresponding positions correspond to the adjacent reference values of the second image component. According to L_maxAnd L_minAnd C_maxAnd C_minThe first model parameter α and the second model parameter β can also be obtained by the calculation of the formula (3); based on α and β, assuming that the second image component is predicted from the first image component, the constructed prediction model is still as shown in equation (2) above.

After the prediction model is constructed, the image component can be predicted according to the prediction model; for example, according to the prediction model shown in equation (2), the second image component may be predicted by using the first image component, for example, the chrominance component may be predicted by using the luminance component, so as to obtain an initial predicted value of the chrominance component, and then, the initial predicted value may be filtered by using a preset processing mode according to the reference value of the luminance component and/or the reference value of the chrominance component, so as to obtain a target predicted value of the chrominance component; the first image component may also be predicted by using the second image component, for example, the luminance component is predicted by using the chrominance component, so as to obtain an initial predicted value of the luminance component, and then, the initial predicted value may be filtered by using a preset processing mode according to the reference value of the luminance component and/or the reference value of the chrominance component, so as to obtain a target predicted value corresponding to the luminance component; even the third image component may be predicted by using the second image component, for example, the red chrominance component is predicted by using the blue chrominance component, so as to obtain an initial predicted value of the red chrominance component, and then, the initial predicted value may be filtered by using a preset processing mode according to the reference value of the blue chrominance component and/or the reference value of the red chrominance component, so as to obtain a target predicted value of the red chrominance component; the purpose of improving the prediction efficiency can be achieved.

Further, the resolutions of the image components are not the same, and in order to construct the prediction model, the resolutions of the image components need to be adjusted (including upsampling the image components or downsampling the image components) to achieve the target resolution of the image components to be predicted.

Optionally, in some embodiments, before the calculating the model parameters of the prediction model, the method may further include:

when the resolution of the image component to be predicted of the current block is different from the resolution of the image component to be referred to of the current block, performing resolution adjustment on the resolution of the image component to be referred to; wherein the resolution adjustment comprises an upsampling adjustment or a downsampling adjustment;

updating the reference value of the image component to be referred of the current block based on the adjusted resolution of the image component to be referred to obtain a first reference value of the image component to be referred of the current block; and the adjusted resolution of the image component to be referred is the same as the resolution of the image component to be predicted.

Optionally, in some embodiments, before the calculating the model parameters of the prediction model, the method may further include:

when the resolution of the image component to be predicted of the current block is different from the resolution of the image component to be referred to of the current block, adjusting the reference value of the image component to be referred to of the current block to obtain a first reference value of the image component to be referred to of the current block, wherein the adjusting includes one of the following steps: down-sampling filtering, up-sampling filtering, cascaded filtering of down-sampling filtering and low-pass filtering, and cascaded filtering of up-sampling filtering and low-pass filtering.

It should be noted that, if the resolution of the image component to be predicted of the current block is different from the resolution of the image component to be referred to of the current block, the resolution of the image component to be referred to may be adjusted, so that the adjusted resolution of the image component to be referred to is the same as the resolution of the image component to be predicted; the resolution adjustment here includes up-sampling adjustment or down-sampling adjustment; and updating the reference value of the image component to be referred of the current block according to the adjusted resolution of the image component to be referred to, so as to obtain the first reference value of the image component to be referred of the current block.

In addition, if the resolution of the image component to be predicted of the current block is different from the resolution of the image component to be referred to of the current block, the reference value of the image component to be referred to of the current block can be adjusted to obtain a first reference value of the image component to be referred to of the current block; the adjustment process here includes one of the following: down-sampling filtering, up-sampling filtering, cascaded filtering of down-sampling filtering and low-pass filtering, and cascaded filtering of up-sampling filtering and low-pass filtering.

Further, in some embodiments, the calculating the model parameter of the prediction model according to the reference value of the image component to be predicted of the current block and the reference value of the image component to be referred to of the current block may include:

and calculating the model parameters of the prediction model according to the reference value of the image component to be predicted of the current block and the first reference value of the image component to be referred of the current block.

It should be noted that, if the resolution of the image component to be predicted of the current block is different from the resolution of the image component to be referred to of the current block, after the updated first reference value of the image component to be referred to of the current block is obtained, the model parameter of the prediction model may be calculated according to the first reference value of the image component to be referred to of the current block and the first reference value of the image component to be referred to of the current block.

For example, it is assumed that the chrominance component is predicted using the luminance component, and the image component to be used is the luminance component and the chrominance component is predicted; since the resolutions of the luminance component and the chrominance component are different, after the target resolution of the chrominance component is obtained, since the resolution of the luminance component does not conform to the target resolution, the resolution of the luminance component needs to be adjusted at this time, for example, the luminance component is subjected to down-sampling processing, so that the adjusted resolution of the luminance component conforms to the target resolution; on the contrary, if the chrominance component is used to predict the luminance component, after the target resolution of the luminance component is obtained, since the resolution of the chrominance component does not meet the target resolution, the resolution of the chrominance component needs to be adjusted at this time, for example, the chrominance component is subjected to upsampling processing, so that the adjusted resolution of the chrominance component can meet the target resolution; in addition, if the blue chrominance component is used for predicting the red chrominance component, after the target resolution of the red chrominance component is obtained, the resolution of the blue chrominance component does not need to be adjusted at the moment because the resolution of the blue chrominance component accords with the target resolution, and the resolution of the blue chrominance component is ensured to accord with the target resolution; in this way, the updated first reference value of the image component to be referred to of the current block can be obtained according to the same resolution, and a prediction model is constructed to predict the image component.

In addition, in order to improve the prediction efficiency, the initial prediction value of the to-be-predicted image component of the current block may be filtered according to the reference value of the to-be-predicted image component of the current block.

In some embodiments, for S402, the filtering the initial prediction value may include:

filtering the initial predicted value according to the reference value of the current block to-be-predicted image component to obtain the target predicted value; wherein the reference value of the current block to-be-predicted image component is obtained by performing characteristic statistics on the image to-be-predicted image component or the current block to-be-predicted image component.

Further, the filtering the initial prediction value according to the reference value of the current block to-be-predicted image component may include:

according to the reference value of the current block to-be-predicted image component, filtering the initial predicted value by using a preset processing mode; wherein the preset processing mode at least comprises one of the following modes: filtering processing, grouping processing, value modification processing, quantization processing, inverse quantization processing, low-pass filtering processing and adaptive filtering processing.

In some embodiments, for S402, the filtering the initial prediction value includes:

calculating to obtain an initial prediction residual of the current block to-be-predicted image component by using the initial prediction value;

according to the reference value of the current block to-be-predicted image component, filtering the initial prediction residual error by using a preset processing mode; wherein the preset processing mode at least comprises one of the following modes: filtering processing, grouping processing, value modification processing, quantization processing, inverse quantization processing, low-pass filtering processing and adaptive filtering processing.

It should be noted that the preset processing mode may be a filtering process, a grouping process, a value modification process, a quantization process, an inverse quantization process, a low-pass filtering process, an adaptive filtering process, or the like. In addition, the reference value of the current block to be predicted image component may be obtained by performing characteristic statistics on the image component to be predicted or the current block to be predicted image component, where the characteristic statistics are not limited to the current block to be predicted image component, and may also be extended to the image component to be predicted of the image to which the current block belongs.

In this way, for the filtering process, after the initial prediction value of the to-be-predicted image component of the current block is obtained, the initial prediction value can be filtered by using a preset processing mode according to the reference value of the to-be-predicted image component of the current block to obtain a target prediction value; or, the initial prediction residual of the current block to be predicted image component may be calculated by using the initial prediction value, and then, according to the reference value of the current block to be predicted image component, the initial prediction residual is filtered by using a preset processing mode to obtain a target prediction residual, and the target prediction value may also be obtained according to the target prediction residual.

In order to improve the prediction efficiency, the initial prediction value of the image component to be predicted of the current block can be filtered according to the reference value of the image component to be predicted of the current block and the reference value of the image component to be referred of the current block.

In some embodiments, for S401, before obtaining the initial prediction value of the current block to be predicted of the image component in the image through the prediction model, the method may further include:

carrying out characteristic statistics on the image component to be predicted of the image;

determining a reference value of the image component to be predicted of the current block and a reference value of the image component to be referred of the current block according to the result of the characteristic statistics; wherein the image component to be referred is different from the image component to be predicted;

and calculating the model parameters of the prediction model according to the reference value of the image component to be predicted of the current block and the reference value of the image component to be referred of the current block.

Further, in some embodiments, the method may further comprise:

according to the reference value of the image component to be predicted of the current block and the reference value of the image component to be referred of the current block, filtering the initial predicted value by using a preset processing mode, wherein the preset processing mode at least comprises one of the following modes: filtering processing, grouping processing, value modification processing, quantization processing, inverse quantization processing, low-pass filtering processing and adaptive filtering processing.

It should be noted that, because different image components have different statistical characteristics, and the statistical characteristics of the image components are different, for example, the luminance component has rich texture characteristics, and the chrominance component tends to be more uniform and flat; in order to better balance the statistical characteristics of each image component after cross-component prediction, at this time, characteristic statistics needs to be performed on at least one image component of the current block, for example, characteristic statistics is performed on a to-be-predicted image component of the image; then according to the result of the characteristic statistics, determining a reference value of the image component to be predicted of the current block and a reference value of the image component to be referred of the current block; according to the reference value of the image component to be predicted of the current block and the reference value of the image component to be referred of the current block, model parameters of a prediction model can be calculated to construct the prediction model, filtering processing can be carried out on an initial prediction value, the statistical characteristics of the image components after cross-component prediction can be balanced, and therefore prediction efficiency is improved.

For example, refer to fig. 5, which shows a schematic structural diagram of an improved cross-component prediction architecture provided in an embodiment of the present application. As shown in fig. 5, based on the conventional cross component prediction architecture 10 shown in fig. 1, the improved cross component prediction architecture 50 may further include a processing unit 510, where the processing unit 510 is mainly configured to perform correlation processing on the predicted value after passing through the cross component prediction unit 160 to obtain a more accurate target predicted value.

In fig. 5, assuming that the U component is predicted by the Y component for the current block, since the Y component encoding block 110 and the U component encoding block 140 have different resolutions, the resolution adjustment unit 120 needs to perform resolution adjustment on the Y component to obtain the Y component having the same resolution as the U component encoding block 140¹A component encoding block 130; then using Y¹Adjacent reference value Y of component coding block 130¹(n) and the adjacent reference value c (n) of the U component encoding block 140, a prediction model 150 may be constructed; according to Y¹The Y component reconstruction pixel value and the prediction model 150 of the component coding block 130 perform image component prediction by the cross component prediction unit 160 to obtain an initial prediction value of the U component; in order to improve the prediction efficiency, the processing unit 510 may further perform related processing on the U component initial prediction value, such as filtering processing, grouping processing, value modification processing, quantization processing, inverse quantization processing, and the like, so as to obtain a U component target prediction value; because the predicted value of the U component target is closer to the true value of the U component, the prediction efficiency is improved, and the coding and decoding efficiency of the video image is also improved.

In the embodiment of the application, when the image prediction method is applied to the encoder side, after the target predicted value is obtained, the prediction residual error can be determined according to the difference value between the target predicted value and the true value, and then the prediction residual error is written into the code stream; meanwhile, model parameters of a prediction model can be calculated according to the reference value of the image component to be predicted of the current block and the reference value of the image component to be referred of the current block, and then the calculated model parameters are written into a code stream; the code stream is transmitted from the encoder side to the decoder side; correspondingly, when the image prediction method is applied to a decoder side, the prediction residual can be obtained by analyzing the code stream, and the model parameters of the prediction model can be obtained by analyzing the code stream, so that the prediction model is constructed; thus, on the decoder side, the initial prediction value of the current block to be predicted is obtained through the prediction model; and then filtering the initial predicted value to obtain a target predicted value of the current block to-be-predicted image component.

The embodiment provides an image prediction method, which obtains an initial prediction value of a current block to-be-predicted image component in an image through a prediction model; filtering the initial prediction value to obtain a target prediction value of the current block to-be-predicted image component; therefore, the filtering processing is continuously carried out on at least one image component after the at least one image component of the current block is predicted, the statistical characteristics of the image components after the inter-component prediction can be balanced, the prediction efficiency is improved, the prediction residual error of the image components is smaller because the obtained target prediction value is closer to the true value, the transmission bit rate in the coding and decoding process is less, and the coding and decoding efficiency of the video image can be improved.

Based on the same inventive concept of the foregoing embodiment, refer to fig. 6, which shows a schematic structural diagram of an encoder 60 according to an embodiment of the present application. The encoder 60 may comprise a first prediction unit 601 and a first processing unit 602, wherein,

the first prediction unit 601 is configured to obtain an initial prediction value of a current block to be predicted of an image component in the image through a prediction model;

the first processing unit 602 is configured to perform filtering processing on the initial prediction value to obtain a target prediction value of the to-be-predicted image component of the current block.

In the above scheme, referring to fig. 6, the encoder 60 may further include a first statistical unit 603 and a first obtaining unit 604, wherein,

the first statistical unit 603 is configured to perform characteristic statistics on at least one image component of the current block; wherein the at least one image component comprises an image component to be predicted and/or an image component to be referenced, the image component to be predicted being different from the image component to be referenced;

the first obtaining unit 604 is configured to obtain, according to a result of the characteristic statistics, a reference value of a to-be-predicted image component of the current block and/or a reference value of a to-be-referenced image component of the current block; the image component to be predicted is a component predicted when the prediction model is built, and the image component to be referred is a component used for prediction when the prediction model is built.

In the above solution, the first processing unit 602 is configured to perform filtering processing on the initial prediction value by using a preset processing mode according to the reference value of the image component to be predicted of the current block and/or the reference value of the image component to be referred to of the current block, where the preset processing mode at least includes one of the following modes: filtering, grouping, value correcting, quantizing and de-quantizing;

the first obtaining unit 604 is configured to obtain the target predicted value according to the processing result.

In the above solution, referring to fig. 6, the encoder 60 may further include a calculating unit 605 configured to calculate an initial prediction residual of the current block to be predicted image component based on the initial prediction component value;

the first processing unit 602 is further configured to perform filtering processing on the initial prediction residual according to a reference value of a to-be-predicted image component of the current block and/or a reference value of a to-be-referenced image component of the current block, where the preset processing mode at least includes one of the following: filtering, grouping, value correcting, quantizing and de-quantizing;

the first obtaining unit 604 is further configured to obtain the target prediction residual according to the processing result.

In the above solution, the calculating unit 605 is further configured to calculate a target prediction value of the to-be-predicted image component of the current block according to the target prediction residual.

In the above solution, referring to fig. 6, the encoder 60 may further include a first determining unit 606 and a first constructing unit 607, wherein,

the first determining unit 606 is configured to determine a reference value of a to-be-predicted image component of the current block, wherein the reference value of the to-be-predicted image component of the current block is the to-be-predicted image component value of a pixel adjacent to the current block; and determining a reference value of an image component to be referred to of the current block, wherein the image component to be referred to of the current block is different from the image component to be predicted, and the reference value of the image component to be referred to of the current block is the reference image component value of the adjacent pixel of the current block;

the calculating unit 605 is further configured to calculate a model parameter of the prediction model according to the reference value of the image component to be predicted of the current block and the reference value of the image component to be referred of the current block;

the first constructing unit 607 is configured to construct the prediction model according to the calculated model parameter, where the prediction model is used to perform cross-component prediction processing on the to-be-predicted image component of the current block according to the to-be-referenced image component of the current block.

In the above scheme, referring to fig. 6, the encoder 60 may further include a first adjusting unit 608 configured to perform resolution adjustment on the resolution of the image component to be referred to when the resolution of the image component to be predicted of the current block is different from the resolution of the image component to be referred to of the current block; wherein the resolution adjustment comprises an upsampling adjustment or a downsampling adjustment; updating the reference value of the image component to be referred of the current block based on the adjusted resolution of the image component to be referred to obtain a first reference value of the image component to be referred of the current block; and the adjusted resolution of the image component to be referred is the same as the resolution of the image component to be predicted.

In the above solution, the first adjusting unit 608 is further configured to, when the resolution of the image component to be predicted of the current block is different from the resolution of the image component to be referred to of the current block, perform adjustment processing on the reference value of the image component to be referred to of the current block to obtain a first reference value of the image component to be referred to of the current block, where the adjustment processing includes one of: down-sampling filtering, up-sampling filtering, cascaded filtering of down-sampling filtering and low-pass filtering, and cascaded filtering of up-sampling filtering and low-pass filtering.

In the above solution, the calculating unit 605 is further configured to calculate the model parameter of the prediction model according to the reference value of the image component to be predicted of the current block and the first reference value of the image component to be referred of the current block.

In the above solution, the first processing unit 602 is further configured to perform filtering processing on the initial prediction value according to a reference value of a to-be-predicted image component of the current block, so as to obtain the target prediction value; wherein the reference value of the current block to-be-predicted image component is obtained by performing characteristic statistics on the image to-be-predicted image component or the current block to-be-predicted image component.

In the above solution, the first processing unit 602 is further configured to perform filtering processing on the initial prediction value by using a preset processing mode according to a reference value of a to-be-predicted image component of the current block; wherein the preset processing mode at least comprises one of the following modes: filtering processing, grouping processing, value modification processing, quantization processing, inverse quantization processing, low-pass filtering processing and adaptive filtering processing.

In the above solution, the calculating unit 605 is further configured to calculate, by using the initial prediction value, an initial prediction residual of a to-be-predicted image component of the current block;

the first processing unit 602 is further configured to perform filtering processing on the initial prediction residual by using a preset processing mode according to the reference value of the to-be-predicted image component of the current block; wherein the preset processing mode at least comprises one of the following modes: filtering processing, grouping processing, value modification processing, quantization processing, inverse quantization processing, low-pass filtering processing and adaptive filtering processing.

In the above solution, the first statistical unit 603 is further configured to perform characteristic statistics on a to-be-predicted image component of the image;

the first determining unit 606 is further configured to determine, according to a result of the characteristic statistics, a reference value of a to-be-predicted image component of the current block and a reference value of a to-be-referenced image component of the current block; wherein the image component to be referred is different from the image component to be predicted;

In the above solution, the first processing unit 602 is further configured to perform filtering processing on the initial prediction value by using a preset processing mode according to the reference value of the image component to be predicted of the current block and the reference value of the image component to be referred to of the current block, where the preset processing mode at least includes one of the following modes: filtering processing, grouping processing, value modification processing, quantization processing, inverse quantization processing, low-pass filtering processing and adaptive filtering processing.

It is understood that in the embodiments of the present application, a "unit" may be a part of a circuit, a part of a processor, a part of a program or software, and the like, and may also be a module, and may also be non-modular. Moreover, each component in the embodiment may be integrated into one processing unit, or each unit may exist alone physically, or two or more units are integrated into one unit. The integrated unit can be realized in a form of hardware or a form of a software functional module.

Based on the understanding that the technical solution of the present embodiment essentially or a part contributing to the prior art, or all or part of the technical solution may be embodied in the form of a software product stored in a storage medium, and include several instructions for causing a computer device (which may be a personal computer, a server, or a network device, etc.) or a processor (processor) to execute all or part of the steps of the method of the present embodiment. And the aforementioned storage medium includes: various media capable of storing program codes, such as a usb disk, a removable hard disk, a Read Only Memory (ROM), a Random Access Memory (RAM), a magnetic disk, or an optical disk.

Accordingly, embodiments of the present application provide a computer storage medium storing an image prediction program, which when executed by at least one processor implements the steps of the method of the foregoing embodiments.

Based on the above-mentioned composition of the encoder 60 and the computer storage medium, referring to fig. 7, it shows a specific hardware structure of the encoder 60 provided in the embodiment of the present application, which may include: a first communication interface 701, a first memory 702, and a first processor 703; the various components are coupled together by a first bus system 704. It is understood that the first bus system 704 is used to enable connection communications between these components. The first bus system 704 includes a power bus, a control bus, and a status signal bus in addition to a data bus. For clarity of illustration, however, the various buses are labeled as a first bus system 704 in fig. 7. Wherein the content of the first and second substances,

a first communication interface 701, configured to receive and transmit signals during information transmission and reception with other external network elements;

a first memory 702 for storing a computer program capable of running on the first processor 703;

a first processor 703, configured to execute, when running the computer program:

obtaining an initial prediction value of a current block to be predicted of an image component in an image through a prediction model;

and filtering the initial prediction value to obtain a target prediction value of the current block to-be-predicted image component.

It will be appreciated that the first memory 702 in embodiments of the subject application can be either volatile memory or nonvolatile memory, or can include both volatile and nonvolatile memory. The non-volatile Memory may be a Read-Only Memory (ROM), a Programmable ROM (PROM), an Erasable PROM (EPROM), an Electrically Erasable PROM (EEPROM), or a flash Memory. Volatile Memory can be Random Access Memory (RAM), which acts as external cache Memory. By way of illustration and not limitation, many forms of RAM are available, such as Static random access memory (Static RAM, SRAM), Dynamic Random Access Memory (DRAM), Synchronous Dynamic random access memory (Synchronous DRAM, SDRAM), Double Data Rate Synchronous Dynamic random access memory (ddr Data Rate SDRAM, ddr SDRAM), Enhanced Synchronous SDRAM (ESDRAM), Synchlink DRAM (SLDRAM), and Direct Rambus RAM (DRRAM). The first memory 702 of the systems and methods described herein is intended to comprise, without being limited to, these and any other suitable types of memory.

The first processor 703 may be an integrated circuit chip having signal processing capabilities. In implementation, the steps of the method may be implemented by integrated logic circuits of hardware or instructions in the form of software in the first processor 703. The first Processor 703 may be a general-purpose Processor, a Digital Signal Processor (DSP), an Application Specific Integrated Circuit (ASIC), an off-the-shelf Programmable Gate Array (FPGA) or other Programmable logic device, discrete Gate or transistor logic device, or discrete hardware components. The various methods, steps, and logic blocks disclosed in the embodiments of the present application may be implemented or performed. A general purpose processor may be a microprocessor or the processor may be any conventional processor or the like. The steps of the method disclosed in connection with the embodiments of the present application may be directly implemented by a hardware decoding processor, or implemented by a combination of hardware and software modules in the decoding processor. The software module may be located in ram, flash memory, rom, prom, or eprom, registers, etc. storage media as is well known in the art. The storage medium is located in the first memory 702, and the first processor 703 reads the information in the first memory 702, and completes the steps of the method in combination with the hardware thereof.

It is to be understood that the embodiments described herein may be implemented in hardware, software, firmware, middleware, microcode, or any combination thereof. For a hardware implementation, the Processing units may be implemented within one or more Application Specific Integrated Circuits (ASICs), Digital Signal Processors (DSPs), Digital Signal Processing Devices (DSPDs), Programmable Logic Devices (PLDs), Field Programmable Gate Arrays (FPGAs), general purpose processors, controllers, micro-controllers, microprocessors, other electronic units configured to perform the functions described herein, or a combination thereof. For a software implementation, the techniques described herein may be implemented with modules (e.g., procedures, functions, and so on) that perform the functions described herein. The software codes may be stored in a memory and executed by a processor. The memory may be implemented within the processor or external to the processor.

Optionally, as another embodiment, the first processor 703 is further configured to, when running the computer program, perform the method of any of the foregoing embodiments.

The present embodiment provides an encoder, which may include a first prediction unit and a first processing unit, wherein the first prediction unit is configured to obtain an initial prediction value of a current block to be predicted of a picture component in a picture by a prediction model; the first processing unit is configured to perform filtering processing on the initial prediction value to obtain a target prediction value of the current block to-be-predicted image component; therefore, the filtering processing is continuously carried out on at least one image component after the at least one image component of the current block is predicted, the statistical characteristics of the image components after the inter-component prediction can be balanced, the prediction efficiency is improved, the prediction residual error of the image components is smaller because the obtained target prediction value is closer to the true value, the transmission bit rate in the coding and decoding process is less, and the coding and decoding efficiency of the video image can be improved.

Based on the same inventive concept of the foregoing embodiment, refer to fig. 8, which shows a schematic structural diagram of a decoder 80 provided in an embodiment of the present application. The decoder 80 may comprise a second prediction unit 801 and a second processing unit 802, wherein,

the second prediction unit 801 is configured to obtain an initial prediction value of a current block to be predicted of an image component in the image through a prediction model;

the second processing unit 802 is configured to perform filtering processing on the initial prediction value to obtain a target prediction value of the to-be-predicted image component of the current block.

In the above scheme, referring to fig. 8, the decoder 80 may further include a second statistics unit 803 and a second obtaining unit 804, wherein,

the second statistical unit 803 is configured to perform characteristic statistics on at least one image component of the current block; wherein the at least one image component comprises an image component to be predicted and/or an image component to be referenced, the image component to be predicted being different from the image component to be referenced;

the second obtaining unit 804 is configured to obtain, according to a result of the characteristic statistics, a reference value of the image component to be predicted of the current block and/or a reference value of the image component to be referred to of the current block; the image component to be predicted is a component predicted when the prediction model is built, and the image component to be referred is a component used for prediction when the prediction model is built.

In the above solution, the second processing unit 802 is configured to perform filtering processing on the initial prediction value by using a preset processing mode according to the reference value of the image component to be predicted of the current block and/or the reference value of the image component to be referred to of the current block, where the preset processing mode at least includes one of the following modes: filtering, grouping, value correcting, quantizing and de-quantizing;

the second obtaining unit 804 is configured to obtain the target predicted value according to the processing result.

In the above scheme, referring to fig. 8, the decoder 80 may further include an analyzing unit 805 configured to analyze the code stream to obtain an initial prediction residual of the current block to be predicted;

the second processing unit 802 is further configured to perform filtering processing on the initial prediction residual according to a reference value of a to-be-predicted image component of the current block and/or a reference value of a to-be-referenced image component of the current block by using a preset processing mode, where the preset processing mode at least includes one of: filtering, grouping, value correcting, quantizing and de-quantizing;

the second obtaining unit 804 is further configured to obtain the target prediction residual according to the processing result.

In the above scheme, referring to fig. 8, the decoder 80 may further include a second construction unit 806, wherein,

the analysis unit 805 is further configured to analyze the code stream to obtain a model parameter of the prediction model;

the second constructing unit 806 is configured to construct the prediction model according to the analyzed model parameter, where the prediction model is used to perform cross-component prediction processing on the to-be-predicted image component of the current block according to the to-be-referenced image component of the current block.

In the above scheme, referring to fig. 8, the decoder 80 may further include a second adjustment unit 807 configured to perform resolution adjustment on the resolution of the image component to be referred to when the resolution of the image component to be predicted of the current block is different from the resolution of the image component to be referred to of the current block; wherein the resolution adjustment comprises an upsampling adjustment or a downsampling adjustment; updating the reference value of the image component to be referred of the current block based on the adjusted resolution of the image component to be referred to obtain a first reference value of the image component to be referred of the current block; and the adjusted resolution of the image component to be referred is the same as the resolution of the image component to be predicted.

In the above solution, the second adjusting unit 807 is further configured to, when the resolution of the image component to be predicted of the current block is different from the resolution of the image component to be referred to of the current block, perform an adjusting process on the reference value of the image component to be referred to of the current block to obtain the first reference value of the image component to be referred to of the current block, where the adjusting process includes one of: down-sampling filtering, up-sampling filtering, cascaded filtering of down-sampling filtering and low-pass filtering, and cascaded filtering of up-sampling filtering and low-pass filtering.

In the above solution, the second processing unit 802 is further configured to perform filtering processing on the initial prediction value according to a reference value of a to-be-predicted image component of the current block to obtain the target prediction value; wherein the reference value of the current block to-be-predicted image component is obtained by performing characteristic statistics on the image to-be-predicted image component or the current block to-be-predicted image component.

In the above solution, the second processing unit 802 is further configured to perform filtering processing on the initial prediction value by using a preset processing mode according to the reference value of the to-be-predicted image component of the current block; wherein the preset processing mode at least comprises one of the following modes: filtering processing, grouping processing, value modification processing, quantization processing, inverse quantization processing, low-pass filtering processing and adaptive filtering processing.

In the above scheme, the parsing unit 805 is configured to parse a code stream to obtain an initial prediction residual of a to-be-predicted image component of the current block;

the second processing unit 802 is further configured to perform filtering processing on the initial prediction residual error by using a preset processing mode according to the reference value of the to-be-predicted image component of the current block; wherein the preset processing mode at least comprises one of the following modes: filtering processing, grouping processing, value modification processing, quantization processing, inverse quantization processing, low-pass filtering processing and adaptive filtering processing.

In the above scheme, referring to fig. 8, the decoder 80 may further include a second determining unit 808, wherein,

the second statistical unit 803 is further configured to perform characteristic statistics on the image component to be predicted of the image;

the second determining unit 808 is further configured to determine, according to a result of the characteristic statistics, a reference value of a to-be-predicted image component of the current block and a reference value of a to-be-referenced image component of the current block; wherein the image component to be referred is different from the image component to be predicted.

In the above solution, the second processing unit 802 is further configured to perform filtering processing on the initial prediction value by using a preset processing mode according to the reference value of the image component to be predicted of the current block and the reference value of the image component to be referred to of the current block, where the preset processing mode at least includes one of the following modes: filtering processing, grouping processing, value modification processing, quantization processing, inverse quantization processing, low-pass filtering processing and adaptive filtering processing.

It is understood that in this embodiment, a "unit" may be a part of a circuit, a part of a processor, a part of a program or software, etc., and may also be a module, or may also be non-modular. Moreover, each component in the embodiment may be integrated into one processing unit, or each unit may exist alone physically, or two or more units are integrated into one unit. The integrated unit can be realized in a form of hardware or a form of a software functional module.

The integrated unit, if implemented in the form of a software functional module and not sold or used as a stand-alone product, may be stored in a computer readable storage medium. Based on such an understanding, the present embodiment provides a computer storage medium storing an image prediction program that, when executed by a second processor, implements the method of any of the preceding embodiments.

Based on the above-mentioned composition of the decoder 80 and the computer storage medium, referring to fig. 9, it shows a specific hardware structure of the decoder 80 provided in the embodiment of the present application, which may include: a second communication interface 901, a second memory 902 and a second processor 903; the various components are coupled together by a second bus system 904. It will be appreciated that the second bus system 904 is used to enable communications among the components. The second bus system 904 includes a power bus, a control bus, and a status signal bus in addition to the data bus. But for clarity of illustration the various buses are labeled as the second bus system 904 in figure 9. Wherein the content of the first and second substances,

a second communication interface 901, configured to receive and send signals in a process of receiving and sending information with other external network elements;

a second memory 902 for storing a computer program capable of running on the second processor 903;

a second processor 903, configured to execute, when running the computer program:

obtaining an initial prediction value of a current block to be predicted of an image component in an image through a prediction model;

and filtering the initial prediction value to obtain a target prediction value of the current block to-be-predicted image component.

Optionally, as another embodiment, the second processor 903 is further configured to execute the method in any one of the foregoing embodiments when the computer program is executed.

It is to be understood that the second memory 902 has hardware functionality similar to that of the first memory 702, and the second processor 903 has hardware functionality similar to that of the first processor 703; and will not be described in detail herein.

The present embodiment provides a decoder, which may include a second prediction unit and a second processing unit, wherein the second prediction unit is configured to obtain an initial prediction value of a current block to be predicted of an image component in an image by a prediction model; the second processing unit is configured to perform filtering processing on the initial prediction value to obtain a target prediction value of the current block to-be-predicted image component; in this way, the filtering processing is continuously carried out on at least one image component after the at least one image component of the current block is predicted, the statistical characteristics of the image components after the inter-component prediction can be balanced, and therefore not only the prediction efficiency is improved, but also the coding and decoding efficiency of the video image can be improved.

It should be noted that, in the present application, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. Without further limitation, an element defined by the phrase "comprising an … …" does not exclude the presence of other like elements in a process, method, article, or apparatus that comprises the element.

The above-mentioned serial numbers of the embodiments of the present application are merely for description and do not represent the merits of the embodiments.

The methods disclosed in the several method embodiments provided in the present application may be combined arbitrarily without conflict to obtain new method embodiments.

Features disclosed in several of the product embodiments provided in the present application may be combined in any combination to yield new product embodiments without conflict.

The features disclosed in the several method or apparatus embodiments provided in the present application may be combined arbitrarily, without conflict, to arrive at new method embodiments or apparatus embodiments.

The above description is only for the specific embodiments of the present application, but the scope of the present application is not limited thereto, and any person skilled in the art can easily conceive of the changes or substitutions within the technical scope of the present application, and shall be covered by the scope of the present application. Therefore, the protection scope of the present application shall be subject to the protection scope of the claims.

Industrial applicability

In the embodiment of the application, firstly, an initial prediction value of a current block to-be-predicted image component in an image is obtained through a prediction model; then, filtering the initial prediction value to obtain a target prediction value of the current block to-be-predicted image component; therefore, the filtering processing is continuously carried out on at least one image component after the at least one image component of the current block is predicted, the statistical characteristics of the image components after the inter-component prediction can be balanced, the prediction efficiency is improved, the prediction residual error of the image components is smaller because the obtained target prediction value is closer to the true value, the transmission bit rate in the coding and decoding process is less, and the coding and decoding efficiency of the video image can be improved.

26页详细技术资料下载

上一篇：一种医用注射器针头装配设备

下一篇：点云质量评估方法、编码器、解码器及存储介质

Image prediction method, encoder, decoder, and storage medium

相关技术

网友询问留言