interaction of intra block copy with optional temporal motion vector prediction

文档序号：1712120 发布日期：2019-12-13 浏览：19次中文

阅读说明：本技术 帧内块复制与可选时域运动矢量预测的交互 (interaction of intra block copy with optional temporal motion vector prediction ) 是由张凯张莉刘鸿彬王悦于 2019-06-05 设计创作，主要内容包括：描述了在视频编码中应用帧内块复制(IBC)的设备、系统和方法。总体来说,描述了将IBC与用于视频编码和解码的现有运动补偿算法相结合的方法。在一个典型的方面,一种使用IBC进行视频编码的方法,包括：确定是否使用运动补偿算法对当前图片的当前块进行编码；以及基于该确定,通过将帧内块复制选择性地应用到当前块对当前块进行编码。在一个典型的方面,另一种使用IBC进行视频编码方法,包括：确定是否使用帧内块复制对当前图片的当前块进行编码；以及基于该确定,通过将运动补偿算法选择性地应用到当前块对当前块进行编码。(Apparatus, systems, and methods are described that apply Intra Block Copy (IBC) in video coding. In general, methods are described that combine IBC with existing motion compensation algorithms for video encoding and decoding. In one exemplary aspect, a method for video encoding using IBC includes: determining whether a current block of a current picture is encoded using a motion compensation algorithm; and encoding the current block by selectively applying intra block copy to the current block based on the determination. In one exemplary aspect, another method for video encoding using IBC includes: determining whether to encode a current block of a current picture using intra block copy; and encoding the current block by selectively applying a motion compensation algorithm to the current block based on the determination.)

1. a method of decoding visual information, comprising:

determining from an encoded representation that a decoded block representing a portion of the visual information is encoded using an optional temporal motion vector prediction (ATMVP) encoding technique;

Determining that spatial neighboring blocks of the decoded block are encoded using an Intra Block Copy (IBC) encoding technique;

Determining that the spatial neighboring block cannot provide a motion vector to derive a temporal vector for the decoded block; and

-decoding the encoded representation by deriving the temporal vector using an ATMVP decoding technique corresponding to the ATMVP coding technique and by avoiding the use of the spatial neighboring blocks to provide the motion vector for the decoded block.

2. A method of decoding visual information, comprising:

Determining from an encoded representation that a decoded block representing a portion of the visual information is encoded using an optional temporal motion vector prediction (ATMVP) encoding technique;

determining that a collocated block in a reference picture is encoded using an Intra Block Copy (IBC) encoding technique;

Determining that the collocated block in a reference picture cannot be used to derive a motion vector for a sub-block of the decoded block; and

Decoding the encoded representation by using an ATMVP decoding technique corresponding to the ATMVP encoding technique and by avoiding using the collocated block in the reference picture to provide temporal motion vector candidates for the sub-blocks of the decoded block.

3. the method of any of claims 1 to 2, comprising:

Obtaining a plurality of blocks representing an encoding of a first picture from the encoded visual information;

Decoding an initial block of the plurality of blocks; and

in decoding the initial block, a first block of a remainder of the plurality of blocks is decoded based on a result of the decoding.

4. The method of any of claims 1 to 3, comprising:

Obtaining a plurality of blocks representing an encoding of a second picture from the encoded visual information;

Decoding an encoding of the second picture associated with the visual information using the ATMVP decoding technique by performing a two-step process on the encoded decoded block of the plurality of blocks that represents the second picture, the two-step process comprising:

In a first step of the two-step process, obtaining a decoded reference picture of the encoded visual information, a decoded corresponding block of the reference picture corresponding to the decoded block, and a temporal vector containing motion vectors of the spatial neighboring blocks, wherein the spatial neighboring blocks are spatial neighbors of the decoded block;

In a second step of the two-step process, obtaining a plurality of sub-blocks representing the decoded block, identifying a corresponding sub-block for each of the plurality of sub-blocks in the reference picture of the visual information, obtaining a motion vector for the corresponding sub-block, and deriving the motion vector for each of the plurality of sub-blocks based on the motion vector for the corresponding sub-block; and

Decoding the plurality of sub-blocks based on a motion vector of the sub-block and a corresponding sub-block in the reference picture of the visual information.

5. the method of claim 4, comprising:

it is assumed that the spatial neighboring blocks are not encoded using the IBC encoding technique.

6. The method of any of claims 4 to 5, comprising:

A decoding indicator that designates the spatial neighboring block as unavailable or intra-coded for use in the first step of an ATMVP decoding technique.

7. The method of any of claims 4 to 6, comprising:

A decoding indicator designating the corresponding sub-block as unavailable or intra-coded for use in the second step of the ATMVP decoding technique.

8. The method of any of claims 4 to 5, comprising:

A decoding indicator that designates the spatial neighboring block as using an intra coding technique in the first step of an ATMVP decoding technique.

9. the method of any of claims 4 to 5, comprising:

a decoding indicator that designates the corresponding sub-block as using an intra coding technique in the second step of an ATMVP decoding technique.

10. the method of any of claims 1 to 9, comprising:

in the case where the corresponding block of the encoded block is encoded using the IBC encoding technique, the motion vector of the corresponding sub-block is copied to a sub-block of the encoded block without scaling the motion vector of the corresponding sub-block.

11. The method of claim 4, comprising:

Decoding an indicator associated with the corresponding block, the indicator indicating that the corresponding block was encoded using the IBC encoding technique; and

Decoding the decoded block using the second picture as the reference picture.

12. The method according to any one of claims 4 to 11, comprising:

Decoding an indicator associated with the corresponding sub-block, the indicator indicating that the corresponding sub-block was encoded using the IBC encoding technique; and

Decoding the sub-blocks with the corresponding sub-blocks encoded using the IBC encoding technique.

13. The method of claim 4, comprising:

Copying a motion vector of the corresponding sub-block to a sub-block of the decoded block without scaling the motion vector of the corresponding sub-block if the corresponding block of the decoded block is encoded using the IBC encoding technique.

14. The method of claim 13, comprising:

changing the reference picture to the second picture including the decoded block.

15. The method of claim 4, comprising:

deriving the motion vector for the sub-block of the decoded block based on the motion vector for at least one corresponding sub-block encoded using the IBC encoding technique.

16. A method of encoding visual information, comprising:

Encoding the visual information into a plurality of encoded pictures and a plurality of indicators indicative of one or more encoding techniques applied, the plurality of indicators including an Intra Block Copy (IBC) technique indicator and an optional temporal motion vector prediction (ATMVP) technique indicator, wherein a first block of a first picture associated with the visual information is encoded using the IBC technique and a second block of a second picture associated with the visual information is encoded using the ATMVP technique, wherein the IBC technique encodes the first block of the first picture using a different block of the first picture and the ATMVP technique encodes the second picture using a third picture associated with the visual information.

17. the method of claim 16, comprising:

dividing the first picture into a plurality of blocks;

Encoding an initial block of the plurality of blocks; and

in encoding the initial block, a first block of the plurality of blocks is encoded based on the initial block.

18. The method of any of claims 16 to 17, comprising:

encoding the second picture associated with the visual information using the ATMVP technique by dividing the second picture into a plurality of blocks including encoded blocks and performing a two-step process comprising:

in a first step of the two-step process, identifying a reference picture of the visual information, a corresponding block of the reference picture corresponding to the block being encoded, and a temporal vector comprising motion vectors of spatially neighboring blocks, wherein the spatially neighboring blocks are spatial neighbors of the block being encoded;

In a second step of the two-step process, dividing a block to be encoded into a plurality of sub-blocks, identifying a corresponding sub-block for each of the plurality of sub-blocks in the reference picture of the visual information, obtaining a motion vector of the corresponding sub-block, and deriving a motion vector for each of the plurality of sub-blocks based on the motion vector of the corresponding sub-block; and

Encoding the sub-block of the plurality of sub-blocks based on the motion vector of the sub-block and the corresponding sub-block in the reference picture of the visual information.

19. The method of claim 18, comprising:

The spatial neighboring blocks are required to be not encoded using the IBC technique.

20. The method according to claim 18 or 19, comprising:

In the event that the spatial neighboring block of the block being encoded is encoded using an IBC technique, encoding an indicator that designates the spatial neighboring block as unavailable or intra-coded for use in the first step of an ATMVP technique.

21. The method of any of claims 18 to 20, comprising:

in a case that the reference picture of the visual information is encoded using an IBC technique, encoding an indicator that designates the corresponding sub-block as unavailable or intra-coded for use in the second step of the ATMVP technique.

22. The method according to claim 18 or 19, comprising:

In the case where the spatial neighboring blocks of the block being encoded are encoded using IBC techniques, an indicator is encoded that designates the spatial neighboring blocks as intra-coded for use in the first step of the ATMVP technique.

23. the method of any of claims 18 to 20, comprising:

in a case where the reference picture of the visual information is encoded using an IBC technique, encoding an indicator that designates the corresponding sub-block as intra-coded for use in the second step of the ATMVP technique.

24. the method of claim 18, comprising:

In the case where the corresponding block of the encoded block is encoded using the IBC technique, the motion vector of the corresponding sub-block is copied to a sub-block of the encoded block without scaling the motion vector of the corresponding sub-block.

25. the method of claim 24, comprising:

Altering the reference picture to the second picture including the encoded block.

26. The method of any of claims 18 to 25, comprising:

Deriving the motion vector for the sub-block of the encoded block based on the motion vector for at least one corresponding sub-block encoded using the IBC technique.

27. A video processing apparatus comprising a processor configured to implement the method of one or more of claims 1 to 26.

28. A computer readable medium having stored thereon processor executable code, which when executed, causes a processor to implement a method as claimed in one or more of claims 1 to 26.

Technical Field

this patent document relates generally to video coding techniques.

Background

motion compensation is a technique in video processing that predicts frames in a video given previous and/or future frames by taking into account the motion of the camera and/or objects in the video. Motion compensation may be used in the encoding and decoding of video data to achieve video compression.

Disclosure of Invention

Devices, systems, and methods related to intra block copy for motion compensation are described.

In one exemplary aspect, the disclosed techniques may be used to provide a video encoding method using intra block copy. The method comprises the following steps: determining whether a current block of a current picture is encoded using a motion compensation algorithm; and encoding the current block by selectively applying intra block copy to the current block based on the determination.

In another exemplary aspect, another method of video encoding using intra block copy may be provided using the disclosed techniques. The method comprises the following steps: determining whether to encode a current block of a current picture using intra block copy; and encoding the current block by selectively applying a motion compensation algorithm to the current block based on the determination.

in yet another exemplary aspect, a video decoding method using intra block copy may be provided using the disclosed techniques. The method comprises the following steps: determining whether a current block of a current picture is decoded using a motion compensation algorithm; and decoding the current block by selectively applying intra block copy to the current block based on the determination.

In yet another exemplary aspect, another method of video decoding using intra block copy may be provided using the disclosed techniques. The method comprises the following steps: determining whether to decode a current block of a current picture using intra block copy; and decoding the current block by selectively applying a motion compensation algorithm to the current block based on the determination.

In yet another exemplary aspect, a method of decoding visual information is disclosed. The method comprises the following steps: determining from the encoded representation that a decoded block representing a portion of the visual information is encoded using an optional temporal motion vector prediction (ATMVP) encoding technique; determining that spatial neighboring blocks of the decoded block are encoded using an Intra Block Copy (IBC) encoding technique; determining that a spatial neighboring block cannot provide a motion vector to derive a temporal vector for the decoded block; and deriving a temporal vector by providing the motion vector for the decoded block using an ATMVP decoding technique corresponding to the ATMVP coding technique and by avoiding the use of spatial neighboring blocks.

in yet another exemplary aspect, another method of decoding visual information is disclosed. The method comprises the following steps: determining from the encoded representation that a decoded block representing a portion of the visual information is encoded using an optional temporal motion vector prediction (ATMVP) encoding technique; determining that a collocated block in a reference picture is encoded using an Intra Block Copy (IBC) encoding technique; determining that a collocated block in a reference picture cannot be used to derive a motion vector for a sub-block of a decoded block; and providing temporal motion vector candidates for sub-blocks of the decoded block by using an ATMVP decoding technique corresponding to the ATMVP coding technique and by avoiding the use of collocated blocks in the reference picture.

in yet another exemplary aspect, another visual information processing method is disclosed. The method comprises the following steps: encoding visual information into a plurality of encoded pictures and a plurality of indicators, the plurality of indicators indicating one or more encoding techniques applied, the plurality of indicators including an Intra Block Copy (IBC) technique indicator and an ATMVP technique indicator, wherein a first block of a first picture associated with the visual information is encoded using an IBC technique and a second block of a second picture associated with the visual information is encoded using an ATMVP technique, wherein the IBC technique encodes the first block of the first picture using a different block of the first picture and the ATMVP technique encodes the second picture using a third picture associated with the visual information. In yet another exemplary aspect, the above-described method is embodied in the form of processor executable code and stored in a computer readable program medium.

In yet another exemplary aspect, an apparatus configured or operable to perform the above method is disclosed. The apparatus may include a processor programmed to implement the method.

In yet another exemplary aspect, a video decoder apparatus as described herein may be implemented.

the above aspects, as well as other aspects and features of the disclosed technology, are described in more detail in the accompanying drawings, the description, and the claims.

Drawings

fig. 1 illustrates an example of an intra block copy technique.

fig. 2 shows an example of motion prediction using an optional temporal motion vector prediction (ATMVP) algorithm for a Coding Unit (CU).

fig. 3 illustrates a flow diagram of an example method for video encoding using intra block copy in accordance with the disclosed techniques.

fig. 4 illustrates a flow diagram of another example method of video encoding using intra block copy in accordance with the disclosed techniques.

fig. 5 illustrates a flow diagram of an example method for video decoding using intra block copy in accordance with the disclosed techniques.

Fig. 6 illustrates a flow diagram of another example method for video decoding using intra block copy in accordance with the disclosed techniques.

FIG. 7 is a block diagram illustrating an example of the structure of a computer system or other control device that may be used to implement various portions of the disclosed technology.

FIG. 8 illustrates a block diagram of an example embodiment of a mobile device that may be used to implement various portions of the disclosed technology.

FIG. 9 is a flow diagram of an example method of visual information processing.

FIG. 10 is a flow diagram of an example method of visual information processing.

Detailed Description

Section headings are used herein for ease of understanding, and do not limit the scope of the techniques and embodiments discussed in the sections to only that section.

Due to the increasing demand for high resolution visual information such as video, pictures, three-dimensional scenes, and the like, video coding methods and techniques are ubiquitous in modern technology. The techniques described herein may be applied to a variety of visual information, including videos, pictures, three-dimensional scenes, and the like. The picture of visual information may be a frame in a video, a portion of a picture, an object in a three-dimensional scene, a portion of a three-dimensional scene, and so on. A block may be a portion of a picture of visual information, such as a Coding Unit (CU), a Largest Coding Unit (LCU), a sample, a Prediction Unit (PU), etc., as described herein. The sub-blocks of visual information may be PUs, such as sub-CUs, samples, etc. The PU may be a pixel, voxel, or minimum-resolution quantum of visual information. Video codecs typically include electronic circuits or software that compress or decompress digital video and are continually being improved to provide higher coding efficiency. Video codecs convert uncompressed video into a compressed format, or vice versa. There is a complex relationship between video quality, the amount of data used to represent the video (as determined by the bit rate), the complexity of the encoding and decoding algorithms, susceptibility to data loss and errors, ease of editing, random access, and end-to-end delay (latency). The compression format typically conforms to a standard video compression specification, such as the High Efficiency Video Coding (HEVC) standard (also known as h.265 or MPEG-H part 2), the general video coding standard to be finalized, or other current and/or future video coding standards.

embodiments of the disclosed techniques may be applied to existing video coding standards (e.g., HEVC, h.265) and future standards to improve runtime performance. In this document, section headings are used to enhance the readability of the description, and do not in any way limit the discussion or embodiments (and/or implementations) to the respective sections.

1. Examples of reference pictures and reference picture lists

In HEVC, there are two types of reference pictures, short-term and long-term. When a reference picture is no longer needed for prediction reference, it may be marked as "unused for reference". HEVC introduces a completely new reference picture management method, called Reference Picture Set (RPS) or buffer description.

The process of marking a picture as "for short-term reference", "for long-term reference", or "unused for reference" is done using the RPS concept. An RPS is a set of picture indicators that signal in each slice (slice) header and consists of a set of short-term pictures and a set of long-term pictures. After decoding the first slice header of a picture, the pictures in the DPB will be marked as specified by the RPS. The pictures indicated by the short-term picture part of the RPS in the DPB are kept as short-term pictures. The short-term or long-term pictures indicated by the long-term picture part of the RPS in the DPB are converted to or kept as long-term pictures. Finally, pictures in the DPB that have no indicator in the RPS are marked as unused for reference. Therefore, all decoded pictures that may be used as references for any subsequent picture decoding order prediction must be included in the RPS.

the RPS consists of a set of Picture Order Count (POC) values used to identify pictures in the DPB. In addition to signaling POC information, the RPS also sends a flag for each picture. Each flag indicates whether the current picture has a corresponding picture available or unavailable for reference. It should be noted that even if a reference picture is signaled as unavailable for the current picture, it is still retained in the DPB and may be available later for reference and used for decoding future pictures.

from the POC information and the availability flag, five reference picture lists shown in table 1 may be created. The list RefPicSetStCurrBefore consists of short-term pictures that are available for reference by the current picture and have POC values lower than the POC value of the current picture. RefPicSetStCurrAfter consists of available short-term pictures with POC values higher than the POC value of the current picture. RefPicSetStFoll is a list of all short-term pictures that contain reference pictures that are not available for the current picture, but can be used as reference pictures to decode subsequent pictures in decoding order. Finally, the lists RefPicSetLtCurr and refpicsetltfol include long-term pictures that are available for reference by the current picture and are not available for reference by the current picture, respectively.

Table 1 list of reference picture lists

1.1 short-term and long-term reference Picture examples

the syntax of the generic sequence parameter set is shown below:

the syntax of the generic slice segment (slice segment) header is shown as follows:

The semantics used in the above syntax table are defined as follows:

num _ short _ term _ ref _ pic _ sets specifies the number of st _ ref _ pic _ set () syntax structures contained in the SPS. The value of num _ short _ term _ ref _ pic _ sets should be between 0 and 64 (including 0 and 64).

In some embodiments, the decoder may allocate memory for the total number of num _ short _ term _ ref _ pic _ sets +1st _ ref _ pic _ set () syntax structures because one st _ ref _ pic _ set () syntax structure may be signaled directly in the slice header of the current picture. The st _ ref _ pic _ set () syntax structure that signals directly in the slice header of the current picture has an index equal to num _ short _ term _ ref _ pic _ set.

long _ term _ ref _ pics _ present _ flag equal to 0 specifies that inter prediction of any coded picture in CVS does not use long-term reference pictures. long _ term _ ref _ pics _ present _ flag equal to 1 specifies that inter prediction of one or more coded pictures in CVS may use long-term reference pictures.

num _ long _ term _ ref _ pics _ SPS specifies the number of candidate long-term reference pictures specified in the SPS. The value of num _ long _ term _ ref _ pics _ sps should be between 0 and 32 (including 0 and 32).

lt _ ref _ pic _ poc _ lsb _ SPS [ i ] specifies the picture order count module MaxPicOrderCntLsb of the i-th candidate long-term reference picture in the SPS. The number of bits used to represent lt _ ref _ pic _ poc _ lsb _ sps [ i ] is equal to log2_ max _ pic _ order _ cnt _ lsb _ minus4+ 4.

used _ by _ curr _ pic _ lt _ SPS _ flag [ i ] equal to 0 specifies that the i-th candidate long-term reference picture specified in the SPS is not used for reference by the picture in its long-term Reference Picture Set (RPS) that contains its i-th candidate long-term reference picture specified in the SPS.

short _ term _ ref _ pic _ set _ SPS _ flag equal to 1 specifies that the short-term RPS of the current picture is derived based on one st _ ref _ pic _ set () syntax structure in the active SPS identified by the syntax element short _ term _ ref _ pic _ set _ idx in the slice header. short _ term _ ref _ pic _ set _ sps _ flag equal to 0 specifies that the short-term RPS of the current picture is derived based on the st _ ref _ pic _ set () syntax structure contained directly in the current picture slice header. When num _ short _ term _ ref _ pic _ sets is equal to 0, the value of short _ term _ ref _ pic _ set _ sps _ flag should be equal to 0.

short _ term _ ref _ pic _ set _ idx specifies an index of a syntax structure st _ ref _ pic _ set () used to derive a short-term RPS of a current picture to a list of syntax structures st _ ref _ pic _ set () contained in an active SPS. The syntax element short _ term _ ref _ pic _ set _ idx is represented by a Ceil (log2(num _ short _ term _ ref _ pic _ set)) bit. When not present, the value of short _ term _ ref _ pic _ set _ idx is inferred to be equal to 0. The value of short _ term _ ref _ pic _ set _ idx should be in the range of 0 to num _ short _ term _ ref _ pic _ sets-1 (including 0 and num _ short _ term _ ref _ pic _ sets-1).

In some embodiments, the variable currrpsiidx is derived as follows:

-if short _ term _ ref _ pic _ set _ sps _ flag is equal to 1, set currrpidx equal to short _ term _ ref _ pic _ set _ idx.

else, set currrpidsidx equal to num _ short _ term _ ref _ pic _ sets.

num _ long _ term _ SPS specifies the number of entries in the long-term RPS of the current picture derived based on the candidate long-term reference pictures specified in the active SPS. The value of num _ long _ term _ sps should be in the range of 0 to num _ long _ term _ ref _ pics _ sps (including 0 and num _ long _ term _ ref _ pics _ sps). When not present, the value of num _ long _ term _ sps is inferred to be equal to 0.

num _ long _ term _ pics specifies the number of entries in the long-term RPS of the current picture that are signaled directly in the slice header. When not present, the value of num _ long _ term _ pics is inferred to be equal to 0.

In some embodiments, when nuh _ layer _ id is equal to 0, the value of num _ long _ term _ pics should be less than or equal to sps _ max _ dec _ pic _ buffering _ minus1[ Temporalld ] -NumNegative Pics [ CurrrpSidx ] -NumPositivePics [ CurrrRsIdx ] -num _ long _ term _ sps-TwoVersionsOfCurrDecPicFlag.

lt _ idx _ SPS [ i ] assigns the index of the ith entry in the long-term RPS of the current picture to the list of candidate long-term reference pictures specified in the active SPS. The number of bits used to represent lt _ idx _ sps [ i ] is equal to Ceil (Log2(num _ long _ term _ ref _ pics _ sps)). When not present, the value of lt _ idx _ sps [ i ] is inferred to be equal to 0. The value of lt _ idx _ sps [ i ] should be in the range of 0 to num _ long _ term _ ref _ pics _ sps-1 (including 0 and num _ long _ term _ ref _ pics _ sps-1).

poc _ lsb _ lt [ i ] specifies the value of the picture order count module MaxPicOrderCntLsb of the ith entry in the long-term RPS of the current picture. The length of the poc _ lsb _ lt [ i ] syntax element is log2_ max _ pic _ order _ cnt _ lsb _ minus4+4 bits.

used _ by _ curr _ pic _ lt _ flag [ i ] equal to 0 specifies that the i-th entry in the long-term RPS of the current picture is not used by the current picture for reference.

In some embodiments, the variables PocLsbLt [ i ] and UsedByCurrPicLt [ i ] are derived as follows:

-if i is less than num _ long _ term _ sps, setting PocLsbLt [ i ] equal to lt _ ref _ pic _ poc _ lsb _ sps [ lt _ idx _ sps [ i ] ], and setting usedbycurrrpiclt [ i ] equal to used _ by _ curr _ pic _ lt _ sps _ flag [ lt _ idx _ sps [ i ] ].

-otherwise, PocLsbLt [ i ] is set equal to poc _ lsb _ lt [ i ], and UsedByCurrPicLt [ i ] is set equal to used _ by _ curr _ pic _ lt _ flag [ i ].

a delta _ poc _ msb _ present _ flag [ i ] equal to 1 means that a delta _ poc _ msb _ cycle _ lt [ i ] exists. delta _ poc _ msb _ present _ flag [ i ] equal to 0 specifies that delta _ poc _ msb _ cycle _ lt [ i ] is not present.

in some embodiments, prevTid0Pic is taken as the previous picture in decoding order, which has a temporalld equal to 0 and is not a RASL, RADL, or SLNR picture. Let setOfPrevPocVals be the set comprising:

PicOrderCntVal of prevTid0Pic,

-PicOrderCntVal for each picture in the RPS of prevTid0Pic,

-PicOrderCntVal of each picture following prevTid0Pic in decoding order and preceding the current picture in decoding order.

in some embodiments, delta _ poc _ msb _ present _ flag [ i ] should be equal to 1 when there are more than one value in setofprevpockvvals (where the value module MaxPicOrderCntLsb equals PocLsbLt [ i ]).

delta _ poc _ msb _ cycle _ lt [ i ] is used to determine the value of the most significant bit of the picture order count value of the ith entry in the current picture long-term RPS. When delta _ poc _ msb _ cycle _ lt [ i ] is not present, it is inferred to be equal to 0.

In some embodiments, the variable deltapocmcsbcllt [ i ] is derived as follows:

If (i ═ 0| | | | | i ═ num _ long _ term _ sps)

DeltaPocMsbCycleLt[i]＝delta_poc_msb_cycle_lt[i]

Otherwise DeltaPocMgCyclt [ i ] ═ delta _ poc _ msb _ cycle _ lt [ i ] + DeltaPocMgbCyclt [ i-1]

1.2 Motion Vector Prediction (MVP) example between short-term and long-term reference pictures

In some embodiments, motion vector prediction is only allowed when the target reference picture type and the prediction reference picture type are the same. In other words, when the types are different, motion vector prediction is not allowed.

Advanced Motion Vector Prediction (AMVP)Is an example of motion vector prediction including existing implementations. Relevant portions of the existing AMVP implementations are as follows.

The motion vector mvLXA and the availability flag availableFlagLXA are derived by the following sequential steps:

(1) The sample position (xNbA0, yNbA0) is set equal to (xPbb-1, yPb + nPbH), and the sample position (xNbA1, yNbA1) is set equal to (xNbA0, yNbA 0-1).

(7) When availableflagflag lxa equals 0, the following applies for (xNbAk, yNbAk), from (xNbA0, yNbA0) to (xNbA1, yNbA1) or until availableflalag lxa equals 1:

-when availableAk equals TRUE and availableFlagLXA equals 0, the following applies:

If PredFlagLX [ xNbAk ] [ yNbAk ] is equal to 1 and LongTermRefPic (currPic, currPb, refIdxLX, RefPicListX) is equal to LongTermRefPic (currPic, currPb, RefIdxLX [ xNbAk ] [ yNbAk ], RefPicListX), then availableFlagLXA is set to 1 and the following assignment is made:

mvLXA＝MvLX[xNbAk][yNbAk]

refIdxA＝RefIdxLX[xNbAk][yNbAk]

refPicListA＝RefPicListX

Otherwise, availableflag lxa is set to 1 when predflagilly [ xNbAk ] [ yNbAk ] (where Y ═ X) equals 1 and longtermrfpic (currPic, currPb, refIdxLX, RefPicListX) equals longtermrfpic (currPic, currPb, RefIdxLY [ xNbAk ] [ yNbAk ], RefPicListY).

The motion vector mvLXB and the availability flag availableFlagLXB are derived in the following sequence of steps:

(1) Sample positions (xNbB0, yNbB0), (xNbB1, yNbB1) and (xNbB2, yNbB2) were set to (xpB + nPbW, yPb-1), (xpB + nPbW-1, yPb-1) and (xpB-1, yPb-1), respectively.

(5) When isscaledflag lx is equal to 0, availableFlagLXB is set to 0, and the following applies to (xNbBk, yNbBk), from (xNbB0, yNbB0) to (xNbB2, yNbB2) or until availableFlagLXB is equal to 1:

-invoking the availability derivation process of the prediction block specified in clause 6.4.2 with luma position (xCb, yCb), current luma coding block size nCbS, luma position (xPb, yPb), luma prediction block width nbbw, luma prediction block height nbbh, luma position (xNbY, yNbY) set equal to (xnbbbk, ynbbbk), and partition index partIdx as input, and assigning the output to the prediction block availability flag availableBk.

-when availableBk equals TRUE and availableFlagLXB equals 0, the following applies:

If PredFlagLX [ xNbBk ] [ yNbBk ] is equal to 1 and LongTermRefPic (currPic, currPb, refIdxLX, RefPicListX) is equal to LongTermRefPic (currPic, currPb, RefIdxLX [ xNbBk ] [ yNbBk ], RefPicListX), then availableFlagLXB is set equal to 1 and the following allocation is made:

mvLXB＝MvLX[xNbBk][yNbBk]

refIdxB＝RefIdxLX[xNbBk][yNbBk]

refPicListB＝RefPicListX

otherwise, when predflagglly [ xNbBk ] [ yNbBk ] (where Y ═ X) equals 1 and longtermrfpic (currPic, currPbrefIdxLX, RefPicListX) equals longtermrfpic (currPic, currPb, RefIdxLY [ xNbBk ] [ yNbBk ], RefPicListY), availableflagflag lxb is set equal to 1 and the following assignments are made:

mvLXB＝MvLY[xNbBk][yNbBk]。

temporal Motion Vector Prediction (TMVP)is another example of motion vector prediction that includes existing implementations. Relevant portions of the current TMVP implementations are described below.

The variables mvLXCol and availableFlagLXCol are derived as follows:

if LongTermRefPic (currPic, currPb, refIdxLX, LX) is not equal to LongTermRefPic (ColPic, colPb, refIdxClol, listCol), then both components of mvLXClE are set equal to 0 and availableFlagLXClE is set equal to 0.

otherwise, the variable availableFlagLXCol is set to 1, and refPicListCol [ refIdxCol ] is set to a picture with refIdxCol as a reference index in a reference picture list listCol of a slice containing the prediction block colPb among the collocated pictures specified by the ColPic.

2. example embodiments of Intra Block Copy (IBC)

intra Block Copy (IBC) extends the concept of motion compensation from inter coding to intra coding. As shown in fig. 1, when IBC is applied, a current block is predicted by a reference block in the same picture. The samples in the reference block must have been reconstructed before the current block is encoded or decoded. While IBC is not as effective for most camera-captured sequences, it shows significant coding gain for screen content. The reason is that there are many repetitive patterns in the screen content picture, such as icons and text characters. IBC can effectively eliminate redundancy between these overlapping modes.

in HEVC-SCC, if a current picture is selected as a reference picture, an intra Coding Unit (CU) may be used. In this case, the MV is renamed to a Block Vector (BV), and the BV always has integer-pel precision. To be compatible with the main specification HEVC, the current picture is marked as a "long-term" reference picture in the Decoded Picture Buffer (DPB). It should be noted that similarly, in the multiview/3D video coding standard, the inter-view reference picture is also labeled as a "long-term" reference picture.

2.1 embodiments of Picture marking when IBC is enabled

IBC-related semantics in PPS. PPS _ curr _ pic _ ref _ enabled _ flag equal to 1 specifies that the picture referencing the PPS may be included in the reference picture list of the picture itself slice. PPS _ curr _ pic _ ref _ enabled _ flag equal to 0 specifies that the picture referencing the PPS is never included in the reference picture list of the picture slice. When not present, the value of pps _ curr _ pic _ ref _ enabled _ flag is inferred to be equal to 0.

It is a requirement that the value of pps _ curr _ pic _ ref _ enabled _ flag should be equal to 0 when sps _ curr _ pic _ ref _ enabled _ flag is equal to 0.

the variable TwoVersionsOfCurrDecPicFlag is derived as follows:

TwoVersionsOfCurrDecPicFlag＝pps_curr_pic_ref_enabled_flag&&(sample_adaptive_offset_enabled_flag||！pps_deblocking_filter_disabled_flag||deblocking_filter_override_enabled_flag)

The value of TwoVersionOfCurrDecPicFlag should be equal to 0 when sps _ max _ dec _ pic _ buffering _ minus1[ TemporalId ] is equal to 0.

And (5) decoding.After invoking the loop filtering process, the currently decoded picture is stored in an empty picture storage buffer in the DPB, the fullness of the DPB is increased by 1, this picture is marked as "used for short-term reference".

When TwoVersionsOfCurrDecPicFlag is equal to 1, the currently decoded picture before the in-loop filtering process call specified in clause f.8.7[1] is stored in the empty picture storage buffer in the DPB, the fullness of the DPB is increased by 1, this picture is marked as "for long-term reference".

3. Joint Exploration Model (JEM) examples

In some embodiments, reference software named Joint Exploration Model (JEM) is used to explore future video coding techniques. In JEM, sub-block based prediction is used in a variety of coding tools, such as affine prediction, optional temporal motion vector prediction (ATMVP), spatial-temporal motion vector prediction (STMVP), bi-directional optical flow (BIO), frame rate up-conversion (FRUC), Locally Adaptive Motion Vector Resolution (LAMVR), Overlapped Block Motion Compensation (OBMC), Local Illumination Compensation (LIC), and decoder-side motion vector optimization (DMVR).

3.1 example of optional temporal motion vector prediction (ATMVP)

In the ATMVP method, a Temporal Motion Vector Prediction (TMVP) method is modified by acquiring multiple sets of motion information (including motion vectors and reference indices) from blocks smaller than a current CU.

Fig. 2 shows an example of ATMVP motion prediction processing of CU 600. The ATMVP method predicts the motion vector of sub-CU 601 within CU 600 in two steps. The first step is to identify a corresponding block 651 in the reference picture 650 with the time domain vector. The reference picture 650 is also referred to as a motion source picture. The second step is to divide the current CU 600 into sub-CUs 601 and obtain the motion vector and reference index of each sub-CU from the block corresponding to each sub-CU.

In a first step, the reference picture 650 and the corresponding block are determined by motion information of spatially neighboring blocks of the current CU 600. To avoid the repeated scanning process of the neighboring blocks, the first MERGE candidate in the MERGE candidate list of the current CU 600 is used. The first available motion vector and its associated reference index are set as the temporal vector and index to the motion source picture. In this way, the corresponding block can be identified more accurately than the TMVP, where the corresponding block (sometimes referred to as a collocated block) is always located in the lower right corner or center position with respect to the current CU.

In a second step, the corresponding block of sub-CU 651 is identified by the temporal vector in motion source picture 650 by adding the temporal vector to the coordinates of the current CU. For each sub-CU, the motion information of the sub-CU is derived using the motion information of its corresponding block (e.g., the smallest motion grid covering the center sample). After identifying the motion information for the nxn block, it is converted into a motion vector and reference index for the current sub-CU, as in the TMVP method of HEVC, where motion scaling and other processing is applied. For example, the decoder checks whether a low delay condition is met (e.g., POC of all reference pictures of the current picture is less than POC of the current picture) and predicts a motion vector MVy for each sub-CU, possibly using a motion vector MVx (e.g., a motion vector corresponding to reference picture list X) (e.g., X equals 0 or 1 and Y equals 1-X).

4. Example method of IBC in video coding

Fig. 3 shows a flow diagram of an exemplary method for video encoding using intra block copy. The method 300 includes determining whether to encode a current block of a current picture using a motion compensation algorithm at step 310. The method 300 includes, at step 320, encoding the current block by selectively applying intra block copy to the current block based on the determination. More generally, whether to apply intra block copy to a current block is based on whether to encode the current block using a particular motion compensation algorithm.

Fig. 4 illustrates a flow diagram of another example method for video encoding using intra block copy. The method 400 includes, at step 410, determining whether to encode a current block of a current picture using intra block copy. The method 400 includes, at step 420, encoding the current block by selectively applying a motion compensation algorithm to the current block based on the determination. More generally, whether to encode the current block using a motion compensation algorithm is based on whether to encode the current block using intra block copy.

Fig. 5 illustrates a flow diagram of an exemplary method for video decoding using intra block copy. The method 500 includes determining whether to decode a current block of a current picture using a motion compensation algorithm at step 510. The method 500 includes, at step 520, decoding the current block by selectively applying intra block copy to the current block based on the determination. More generally, whether to apply intra block copy to the current block is based on whether to decode the current block using a particular motion compensation algorithm.

fig. 6 illustrates a flow diagram of another exemplary method for video decoding using intra block copy. The method 600 includes, at step 610, determining whether to decode a current block of a current picture using intra block copy. The method 600 includes, at step 620, decoding the current block by selectively applying a motion compensation algorithm to the current block based on the determination. More generally, whether to decode the current block using a motion compensation algorithm is based on whether to decode the current block using intra block copy.

The methods 300, 400, 500, 605, 900 and 1000 described in the context of fig. 3-6 and 9-10 may further include the step of determining whether the motion compensation algorithm is compatible with intra block copy. The compatibility of intra block copy and motion compensation algorithms for different specific motion compensation algorithms is illustrated in the following example.

example 1: it is proposed that the time domain vectors used in the first step of ATMVP cannot be derived from neighboring blocks coded with IBC. In one example, the current picture is taken as its reference pictureneighboring blocks of a slice are marked as "unavailable" or intra-coded in the first step of ATMVP.

Example 2: it is proposed to mark the corresponding block of the sub-CU as "unavailable" or "intra coded" if the corresponding block of the sub-CU is coded using IBC in the second step of ATMVP.

example 3: alternatively, it is proposed that if the corresponding block is encoded using IBC in the second step of ATMVP, the motion information of the corresponding block of the sub-CU is copied to the sub-CU without any scaling. The sub-CU applies IBC with the same MV as the corresponding block, but the reference picture is changed to the current picture.

example 4: more than one ATMVP candidate may be added, one of which may be derived from temporal neighboring blocks using the method described above, and another one of which may be derived from temporal neighboring blocks (if the collocated sub-CUs are IBC coded, at least one sub-CU has a different method of deriving motion information of the sub-CU).

Listed below are some examples of the techniques described in this application, listed in terms of clauses. A tile, as used herein, may be a contiguous or non-contiguous set of pixels, voxels, sub-pixels, and/or sub-voxels. For example, the blocks may be rectilinear, such as a 4x 4 square, a 6x 4 rectangle, or curved, such as an ellipse.

A portion of the visual information used in this application may be a subset of the visual information. The encoded representation used in this application may be a bitstream representing visual information that has been encoded using one of the techniques described in this application. The indication used in this application may be a flag or field in the coded representation or may be a plurality of separate flags or fields.

The decoding techniques used in this application may be applied by a decoder and may be implemented in hardware or software. The decoding technique may undo everything the encoder does in the reverse order. When a suitable decoding technique is applied to the encoded representation, the result is that visual information can be obtained. The initial block of the plurality of blocks used in this application is the block that appears before the first block in the encoded representation. The collocated reference picture may be a reference picture used to encode/decode a block being encoded/decoded.

1. a method (e.g., method 900 depicted in fig. 9) of decoding visual information, comprising:

Determining (902), from an encoded representation, that a decoded block representing a portion of the visual information is encoded using an optional temporal motion vector prediction (ATMVP) encoding technique;

Determining (904) spatial neighboring blocks of the decoded block are encoded using an Intra Block Copy (IBC) encoding technique;

determining (906) that the spatial neighboring block cannot provide a motion vector to derive a temporal vector for the decoded block; and

-decoding (908) the encoded representation by deriving the temporal vector using an ATMVP decoding technique corresponding to the ATMVP coding technique and by avoiding the use of the spatial neighboring blocks to provide the motion vector for the decoded block.

2. a method (e.g., method 1000 depicted in fig. 10) of decoding visual information, comprising:

determining (1002), from an encoded representation, that a decoded block representing a portion of the visual information is encoded using an optional temporal motion vector prediction (ATMVP) encoding technique;

determining (1004) that a collocated block in a reference picture is encoded using an Intra Block Copy (IBC) encoding technique;

Determining (1006) that the collocated block in a reference picture cannot be used to derive a motion vector for a sub-block of the decoded block; and

Decoding (1008) the encoded representation by providing temporal motion vector candidates for the sub-blocks of the decoded block using an ATMVP decoding technique corresponding to the ATMVP encoding technique and by avoiding using the collocated block in the reference picture.

3. the method of any of clauses 1-2, comprising:

Obtaining a plurality of blocks representing an encoding of a first picture from the encoded visual information;

Decoding an initial block of the plurality of blocks; and

In decoding the initial block, a first block of a remainder of the plurality of blocks is decoded based on a result of the decoding.

4. the method of any of clauses 1-3, comprising:

obtaining a plurality of blocks representing an encoding of a second picture from the encoded visual information;

decoding an encoding of the second picture associated with the visual information using the ATMVP decoding technique by performing a two-step process on the encoded decoded block of the plurality of blocks that represents the second picture, the two-step process comprising:

Decoding the plurality of sub-blocks based on a motion vector of the sub-block and a corresponding sub-block in the reference picture of the visual information.

5. the method of clause 4, including:

It is assumed that the spatial neighboring blocks are not encoded using the IBC encoding technique.

6. The method of any of clauses 4-5, including:

A decoding indicator that designates the spatial neighboring block as unavailable or intra-coded for use in the first step of an ATMVP decoding technique.

7. the method of any of clauses 4-6, including:

a decoding indicator designating the corresponding sub-block as unavailable or intra-coded for use in the second step of the ATMVP decoding technique.

8. the method of any of clauses 4-5, including:

a decoding indicator that designates the spatial neighboring block as using an intra coding technique in the first step of an ATMVP decoding technique.

9. The method of any of clauses 4-5, including:

a decoding indicator that designates the corresponding sub-block as using an intra coding technique in the second step of an ATMVP decoding technique.

10. The method of any of clauses 1-9, comprising:

In the case where the corresponding block of the encoded block is encoded using the IBC encoding technique, the motion vector of the corresponding sub-block is copied to a sub-block of the encoded block without scaling the motion vector of the corresponding sub-block.

11. The method of clause 4, including:

Decoding an indicator associated with the corresponding block, the indicator indicating that the corresponding block was encoded using the IBC encoding technique; and decoding the decoded block using the second picture as the reference picture.

12. The method of any of clauses 4-11, including:

Decoding an indicator associated with the corresponding sub-block, the indicator indicating that the corresponding sub-block was encoded using the IBC encoding technique; and decoding the sub-blocks with the corresponding sub-blocks encoded using the IBC encoding technique.

13. The method of clause 4, including:

copying a motion vector of the corresponding sub-block to a sub-block of the decoded block without scaling the motion vector of the corresponding sub-block if the corresponding block of the decoded block is encoded using the IBC encoding technique.

14. the method of clause 13, including:

Changing the reference picture to the second picture including the decoded block.

15. The method of clause 4, including:

Deriving the motion vector for the sub-block of the decoded block based on the motion vector for at least one corresponding sub-block encoded using the IBC encoding technique.

Further aspects and variations of the method of clauses 1-15 are described in the previous section. The decoding method may be implemented by a video decoder, which may include a hardware platform such as described with respect to fig. 7 and 8.

16. a method of encoding visual information, comprising:

17. The method of clause 16, including:

Dividing the first picture into a plurality of blocks;

Encoding an initial block of the plurality of blocks; and

in encoding the initial block, a first block of the plurality of blocks is encoded based on the initial block.

18. The method of any of clauses 16-17, comprising:

In a first step of the two-step process, identifying a reference picture of the visual information, a corresponding block of the reference picture corresponding to the block being encoded, and a temporal vector comprising motion vectors of spatially neighboring blocks, wherein the spatially neighboring blocks are spatial neighbors of the block being encoded;

in a second step of the two-step process, dividing a block to be encoded into a plurality of sub-blocks, identifying a corresponding sub-block for each of the plurality of sub-blocks in the reference picture of the visual information, obtaining a motion vector of the corresponding sub-block, and deriving a motion vector for each of the plurality of sub-blocks based on the motion vector of the corresponding sub-block; and

Encoding the sub-block of the plurality of sub-blocks based on the motion vector of the sub-block and the corresponding sub-block in the reference picture of the visual information.

19. the method of clause 18, including:

The spatial neighboring blocks are required to be not encoded using the IBC technique.

20. The method of clause 18 or 19, comprising:

21. the method of any of clauses 18-20, comprising:

In a case that the reference picture of the visual information is encoded using an IBC technique, encoding an indicator that designates the corresponding sub-block as unavailable or intra-coded for use in the second step of the ATMVP technique.

22. The method of clause 18 or 19, comprising:

23. the method of any of clauses 18-20, comprising:

24. The method of clause 18, including:

25. The method of clause 24, including:

Altering the reference picture to the second picture including the encoded block.

26. The method of any of clauses 18-25, comprising:

deriving the motion vector for the sub-block of the encoded block based on the motion vector for at least one corresponding sub-block encoded using the IBC technique.

Further aspects and variations of the method of clauses 16-26 are described in the previous section. The decoding method may be implemented by a video decoder, which may include a hardware platform such as described with respect to fig. 7 and 8.

27. A video processing apparatus comprising a processor configured to implement the method of one or more of clauses 1-26.

28. a computer readable medium having stored thereon processor executable code which when executed causes a processor to implement the method of one or more of clauses 1-26.

5. Example embodiments of the disclosed technology

FIG. 7 is a block diagram illustrating an example of a structure of a computer system or other control device 700 that may be used to implement various portions of the disclosed technology, including (but not limited to) methods 300, 400, 500, and 600. In fig. 7, computer system 700 includes one or more processors 705 and memory 710 connected by an interconnect 725. Interconnect 725 may represent any one or more separate physical buses, point-to-point connections, or both, connected by appropriate bridges, adapters, or controllers. Thus, interconnect 725 may include, for example, a system bus, a Peripheral Component Interconnect (PCI) bus, a hyper transport or Industry Standard Architecture (ISA) bus, a Small Computer System Interface (SCSI) bus, a Universal Serial Bus (USB), an IIC (I2C) bus, or an Institute of Electrical and Electronics Engineers (IEEE) standard 674 bus (sometimes referred to as a "firewire").

the processor 705 may include a Central Processing Unit (CPU) to control overall operation of, for example, a host. In some embodiments, the processor 1005 accomplishes this by executing software or firmware stored in the memory 710. Processor 1005 may be or include one or more programmable general purpose or special purpose microprocessors, Digital Signal Processors (DSPs), programmable controllers, Application Specific Integrated Circuits (ASICs), Programmable Logic Devices (PLDs), and the like, or a combination of such devices.

The memory 710 may be or include the main memory of the computer system. Memory 710 represents any suitable form of Random Access Memory (RAM), Read Only Memory (ROM), flash memory, etc., or combination of these devices. In use, memory 710 may contain, among other things, a set of machine instructions that, when executed by processor 705, cause processor 705 to perform operations to implement embodiments of the disclosed technology.

Also connected to the processor 705 via an interconnect 725 is an (optional) network adapter 715. Network adapter 715 provides computer system 700 with the ability to communicate with remote devices, such as storage clients and/or other storage servers, and may be, for example, an ethernet adapter or a fibre channel adapter.

Fig. 8 illustrates a block diagram of an example embodiment of a mobile device 800 that may be used to implement various portions of the disclosed technology, including (but not limited to) methods 300, 400, 500, and 600. The mobile device 800 may be a laptop, smartphone, tablet, camera, or other device capable of processing video. The mobile device 800 includes a processor or controller 801 to process data and a memory 802 in communication with the processor 801 to store and/or buffer data. For example, the processor 801 may include a Central Processing Unit (CPU) or a microcontroller unit (MCU). In some implementations, the processor 801 may include a Field Programmable Gate Array (FPGA). In some implementations, the mobile device 800 includes or communicates with a Graphics Processing Unit (GPU), a Video Processing Unit (VPU), and/or a wireless communication unit to implement various visual and/or communication data processing functions of the smartphone device. For example, the memory 802 may include and store processor-executable code that, when executed by the processor 801, configures the mobile device 800 to perform various operations, such as receiving information, commands, and/or data, processing information and data, and transmitting or providing processed information/data to another device, such as an actuator or external display.

To support various functions of the mobile device 800, the memory 802 may store information and data such as instructions, software, values, pictures and other data that are processed or referenced by the processor 801. For example, the storage function of the memory 802 may be implemented using various types of Random Access Memory (RAM) devices, Read Only Memory (ROM) devices, flash memory devices, and other suitable storage media. In some embodiments, the mobile device 800 includes an input/output (I/O) unit 803 to interface the processor 801 and/or the memory 802 with other modules, units, or devices. For example, the I/O unit 803 may interface with the processor 801 and the memory 802 to utilize various wireless interfaces compatible with typical data communication standards, e.g., between one or more computers and user equipment in the cloud. In some implementations, the mobile device 800 may interface with other devices through the I/O unit 803 using a wired connection. The mobile device 800 may also interface with other external interfaces (e.g., data stores) and/or a visual or audio display device 804 to retrieve and transmit data and information that may be processed by a processor, stored in a memory, or displayed on the display device 804 or an output unit of an external device. For example, the display device 804 may display a video frame that includes blocks (CU, PU, or TU) based on whether a motion compensation algorithm is used and intra block copy is applied in accordance with the disclosed techniques.

In some embodiments, a video decoder device may implement a video decoding method in which intra block replication as described herein is used for video decoding. The method may be similar to methods 300, 400, 500, and 600 described above.

In some embodiments, a decoder-side method of video decoding may improve video quality using intra block copy by using a motion compensation algorithm to determine whether a current block of a current picture is to be decoded, and based on the determination, decode the current block by selectively applying intra block copy to the current block.

In other embodiments, a decoder-side method of video decoding may improve video quality using intra block copy by determining whether to decode a current block of a current picture using intra block copy, and based on the determination, decode the current block by selectively applying a motion compensation algorithm to the current block.

In some embodiments, the video decoding method may be implemented using a decoding apparatus implemented on a hardware platform, as described in fig. 7 and 8.

the following is the improvement measured by incorporating IBC into VTM-1.0, which is the reference software for the video coding standard known as Universal video coding (VVC). VTM stands for VVC test model.

In the above table, "Y", "U", "V" stands for color in a YUV color coding system that encodes a color picture or video in consideration of human perception. EncT and DecT denote the ratio of encoding and decoding time using IBC to encoding and decoding time without IBC, respectively. In particular, the present invention relates to a method for producing,

EncT ═ test encoding time (TestEncodingTime)/(anchor encoding time) anchoreencodingtime

DecT ═ test coding time (TestDecodingTime)/(anchor coding time) anchoredecodingtime

various classes (such as a1, a2, etc.) represent a set of standard video sequences for testing the performance of various video coding techniques. The negative percentages under the columns "Y", "U", "V" represent the bit rate savings when IBC is added to VTM-1.0. The percentage over 100% under the EncT and DecT columns indicates how much slower to encode/decode using IBC than without IBC. For example, 150% means that encoding/decoding using IBC is 50% slower than encoding/decoding without IBC. A percentage below 100% indicates how faster the encoding/decoding is using IBC than without IBC. The two classes shown in the table above (class F and SCC) indicate a bit rate saving of over 3%.

From the foregoing, it will be appreciated that specific embodiments of the disclosed technology have been described herein for purposes of illustration, but that various modifications may be made without deviating from the scope of the invention. Accordingly, the disclosed technology is not limited, except as by the appended claims.

The implementation and functional operations of the subject matter described in this patent document can be implemented in various systems, digital electronic circuitry, or computer software, firmware, or hardware, including the structures disclosed in this specification and their structural equivalents, or in combinations of one or more of them. Implementations of the subject matter described in this specification can be implemented as one or more computer program products, i.e., one or more modules of computer program instructions encoded on a tangible and non-transitory computer readable medium for execution by, or to control the operation of, data processing apparatus. The computer readable medium can be a machine-readable storage device, a machine-readable storage substrate, a memory device, a composition of matter effecting a machine-readable propagated signal, or a combination of one or more of them. The term "data processing unit" or "data processing apparatus" includes all apparatus, devices, and machines for processing data, including by way of example a programmable processor, a computer, or multiple processors or groups of computers. The apparatus can include, in addition to hardware, code that creates an execution environment for the computer program, e.g., code that constitutes processor firmware, a protocol stack, a database management system, an operating system, or a combination of one or more of them.

A computer program (also known as a program, software application, script, or code) can be written in any form of programming language, including compiled or interpreted languages, and it can be deployed in any form, including as a stand-alone program or as a module, component, subroutine, or other unit suitable for use in a computing environment. A computer program does not necessarily correspond to a file in a file system. A program can be stored in a portion of a file that holds other programs or data (e.g., one or more scripts stored in a markup language document), in a single file dedicated to the program, or in multiple coordinated files (e.g., files that store one or more modules, sub programs, or portions of code). A computer program can be deployed to be executed on one computer or on multiple computers that are located at one site or distributed across multiple sites and interconnected by a communication network.

The processes and logic flows described in this specification can be performed by one or more programmable processors executing one or more computer programs to perform functions by operating on input data and generating output. The processes and logic flows can also be performed by, and apparatus can also be implemented as, special purpose logic circuitry, e.g., an FPGA (field programmable gate array) or an ASIC (application-specific integrated circuit).

Processors suitable for the execution of a computer program include, by way of example, both general and special purpose microprocessors, and any one or more of any kind of digital computer. Generally, a processor will receive instructions and data from a read-only memory or a random access memory or both. The essential elements of a computer are a processor for executing instructions and one or more memory devices for storing instructions and data. Generally, a computer will also include, or be operatively coupled to receive data from or transfer data to, or both, one or more mass storage devices for storing data, e.g., magnetic, magneto-optical disks, or optical disks. However, a computer does not necessarily have such a device. Computer-readable media suitable for storing computer program instructions and data include all forms of non-volatile memory, media and memory devices, including by way of example semiconductor memory devices, e.g., EPROM, EEPROM, and flash memory devices. The processor and the memory can be supplemented by, or incorporated in, special purpose logic circuitry.

the specification and drawings are to be regarded in an illustrative manner, with an exemplary meaning being exemplary. As used herein, the singular forms "a," "an," and "the" are intended to include the plural forms as well, unless the context clearly indicates otherwise. Moreover, use of "or" is intended to include "and/or" unless the context clearly indicates otherwise.

while this patent document contains many specifics, these should not be construed as limitations on the scope of any invention or of what may be claimed, but rather as descriptions of features specific to particular embodiments of particular inventions. Certain features that are described in this patent document in the context of separate embodiments can also be implemented in combination in a single embodiment. Conversely, various functions described in the context of a single embodiment can also be implemented in multiple embodiments separately or in any suitable subcombination. Furthermore, although features may be described above as acting in certain combinations and even initially claimed as such, one or more features from a claim combination can in some cases be excised from the combination, and the claimed combination may be directed to a subcombination or variation of a subcombination.

Also, while operations are depicted in the drawings in a particular order, this should not be understood as requiring that such operations be performed in the particular order shown or in sequential order, or that all illustrated operations be performed, to achieve desirable results. Moreover, the separation of various system components in the embodiments described herein should not be understood as requiring such separation in all embodiments.

Only a few implementations and examples have been described and other implementations, enhancements and variations can be made based on what is described and illustrated in this patent document.

33页详细技术资料下载

上一篇：一种医用注射器针头装配设备

下一篇：帧内块复制与帧间编码工具的交互

interaction of intra block copy with optional temporal motion vector prediction

相关技术

网友询问留言