Content-aware point cloud compression method and apparatus using HEVC tiles

文档序号：639640 发布日期：2021-05-11 浏览：36次中文

阅读说明：本技术 使用hevc图块的内容感知的点云压缩方法和装置 (Content-aware point cloud compression method and apparatus using HEVC tiles ) 是由阿拉什·沃索基芮世薰刘杉于 2019-09-27 设计创作，主要内容包括：一种方法,包括接收含有多个数据点的数据云。该方法还包括识别包括感兴趣区域(ROI)的每个数据点,以及将数据云划分为ROI云和一个或多个非ROI云。该方法包括对ROI云执行片生成处理,该片生成处理包括根据每个包括ROI的数据点生成ROI片。该方法包括对ROI云执行片封装处理,片封装处理包括：(i)将每个ROI片映射到二维(2D)图,(ii)确定多个ROI片中的至少两个ROI片是否位于该图的一个以上的图块中,以及(iii)响应于确定至少两个ROI片位于一个以上的图块中,将该ROI片中的每个ROI片移动到一个图块中。(A method includes receiving a data cloud containing a plurality of data points. The method also includes identifying each data point including a region of interest (ROI), and dividing the data cloud into a ROI cloud and one or more non-ROI clouds. The method includes performing a patch generation process on the ROI cloud, the patch generation process including generating an ROI patch from each data point that includes the ROI. The method comprises the steps of carrying out slice packaging processing on the ROI cloud, wherein the slice packaging processing comprises the following steps: (i) mapping each ROI slice to a two-dimensional (2D) map, (ii) determining whether at least two ROI slices of the plurality of ROI slices are located in more than one tile of the map, and (iii) moving each ROI slice of the ROI slices into one tile in response to determining that the at least two ROI slices are located in more than one tile.)

1. A method performed by a video encoder, comprising:

receiving a data cloud comprising a plurality of data points representing a three-dimensional (3D) space;

identifying each data point comprising a region of interest (ROI) associated with the data cloud;

dividing the data cloud into an ROI cloud and one or more non-ROI clouds, the ROI cloud comprising data points that each include the ROI;

performing a slice generation process on the ROI cloud, the slice generation process comprising generating an ROI slice from the each data point comprising the ROI; and

performing a tile encapsulation process on the ROI cloud, the tile encapsulation process comprising:

(i) mapping each ROI tile to a two-dimensional (2D) map, the 2D map comprising a plurality of tiles arranged as a grid in the 2D map,

(ii) determining whether at least two ROI slices of the plurality of ROI slices are located in more than one tile, an

(iii) In response to determining that the at least two ROI slices are located in more than one tile, moving each of the ROI slices from the plurality of tiles into one tile.

2. The method of claim 1, further comprising:

performing a slice generation process on each non-ROI cloud, the slice generation process comprising creating a non-ROI slice for each data point that does not include an ROI, and

performing a tile encapsulation process on each non-ROI cloud, the tile encapsulation process comprising mapping each of the non-ROI tiles to one or more white spaces in the two-dimensional map that do not include the ROI tile.

3. The method of claim 1, wherein the tile encapsulation processing for the ROI cloud and the tile encapsulation processing for each non-ROI cloud are performed in parallel.

4. The method of claim 2, further comprising:

compressing the tile containing each of the ROI slices according to a first compression rate; and

compressing respective tiles of the plurality of tiles that do not contain the ROI slice according to a second compression rate that is higher than the first compression rate.

5. The method of claim 1, further comprising:

determining whether the ROI is larger than each tile included in the 2D map; and

in response to determining that the ROI is larger than each tile included in the 2D map, partition the ROI cloud into one or more sub-ROI clouds,

wherein the tile generation process and the tile encapsulation process are performed on each of the one or more sub-ROI clouds.

6. The method of claim 5, wherein the tile encapsulation process is performed in parallel on the one or more sub-ROI clouds.

7. The method of claim 1, further comprising:

determining whether the video encoder specifies a size of each tile in the 2D map; and

in response to determining that the video encoder does not specify a size of each tile in the 2D picture, setting a height of the tile including the ROI slice such that the ROI slice is bounded by the tile including the ROI slice.

8. The method of claim 7, further comprising:

in response to determining that the video encoder does not specify a size of each tile in the 2D picture, setting a width of the tile including the ROI slice such that the ROI slice is bounded by the tile including the ROI slice.

9. The method of claim 2, wherein the data cloud includes a plurality of ROIs, the data cloud is divided into a plurality of ROI clouds, each ROI cloud corresponds to a respective ROI, and the tile generation process and the tile encapsulation process are performed on each ROI cloud.

10. The method of claim 9, wherein the tile encapsulation process performed on each ROI cloud causes each ROI to be mapped to a different tile in the 2D map.

11. A video encoder, comprising:

a processing circuit configured to:

receiving a data cloud comprising a plurality of data points representing a three-dimensional (3D) space;

identifying each data point comprising a region of interest (ROI) associated with the data cloud;

dividing the data cloud into an ROI cloud and one or more non-ROI clouds, the ROI cloud comprising data points that each include an ROI;

performing a slice generation process on the ROI cloud, the slice generation process comprising generating an ROI slice from the each ROI-comprising data point; and

performing a tile encapsulation process on the ROI cloud, the tile encapsulation process comprising:

(i) mapping each ROI tile to a two-dimensional (2D) map, the 2D map comprising a plurality of tiles arranged as a grid in the 2D map,

(ii) determining whether at least two ROI slices of the plurality of ROI slices are located in more than one tile, an

(iii) In response to determining that at least two ROI slices are located in more than one tile, moving each of the ROI slices from the plurality of tiles into one tile.

12. The video encoder of claim 11, wherein the processing circuit is further configured to:

performing a slice generation process on each non-ROI cloud, the slice generation process comprising creating a non-ROI slice for each data point that does not include an ROI, and

13. The video encoder of claim 11, wherein the slice encapsulation processing for the ROI cloud and the slice encapsulation processing for each non-ROI cloud are performed in parallel.

14. The video encoder of claim 12, wherein the processing circuit is further configured to:

compressing the tile containing each of the ROI slices according to a first compression rate; and

compressing respective tiles of the plurality of tiles that do not contain the ROI slice according to a second compression rate that is higher than the first compression rate.

15. The video encoder of claim 11, wherein the processing circuit is further configured to:

determining whether the ROI is larger than each tile included in the 2D map; and

in response to determining that the ROI is larger than each tile included in the 2D map, partition the ROI cloud into one or more sub-ROI clouds,

wherein the tile generation process and the tile encapsulation process are performed on each of the one or more sub-ROI clouds.

16. The video encoder of claim 15, wherein the slice encapsulation process is performed in parallel on the one or more sub-ROI clouds.

17. The video encoder of claim 11, wherein the processing circuit is further configured to:

determining whether the video encoder specifies a size of each tile in the 2D map; and

18. The video encoder of claim 17, wherein the processing circuit is further configured to:

19. The video encoder of claim 12, wherein the data cloud comprises a plurality of ROIs, the data cloud is divided into a plurality of ROI clouds, each ROI cloud corresponds to a respective ROI, and the slice generation process and the slice encapsulation process are performed on each ROI cloud.

20. A non-transitory computer-readable medium storing instructions that, when executed by a processor in a video encoder, cause the processor to perform a method comprising: