Enhanced region-oriented encapsulation and view-independent high-efficiency video coding media profile

文档序号：1472545 发布日期：2020-02-21 浏览：34次中文

阅读说明：本技术 增强区域取向包封及视区独立高效视频译码媒体配置文件 (Enhanced region-oriented encapsulation and view-independent high-efficiency video coding media profile ) 是由王业奎托马斯·斯托克哈默于 2018-07-10 设计创作，主要内容包括：本发明提供一种用于处理媒体内容的装置，其可经配置以：从视频文件内的区域取向包封框获得指示媒体内容的第一经包封区域的第一大小及第一位置的第一值集合，及指示所述媒体内容的第二经包封区域的第二大小及第二位置的第二值集合，其中所述第一值集合及所述第二值集合呈解包封的左上角明度样本的相对单位；解包封所述第一经包封区域以产生第一解包封区域；从所述第一解包封区域形成第一经投影区域；解包封所述第二经包封区域以产生第二解包封区域；及从所述第二解包封区域形成第二经投影区域，所述第二经投影区域不同于所述第一经投影区域。(The present invention provides a device for processing media content, which may be configured to: obtaining, from a region-oriented encapsulation box within a video file, a first set of values indicative of a first size and a first position of a first encapsulated region of media content, and a second set of values indicative of a second size and a second position of a second encapsulated region of the media content, wherein the first set of values and the second set of values are in relative units of unpacked upper left-hand luma samples; decapsulating the first encapsulated region to produce a first decapsulated region; forming a first projected region from the first decapsulated region; decapsulating the second encapsulated region to produce a second decapsulated region; and forming a second projected region from the second decapsulated region, the second projected region different from the first projected region.)

1. A method of processing media content, the method comprising:

obtaining, from a region-oriented encapsulation box within a video file, a first set of values indicative of a first size and a first position of a first encapsulated region of media content, and a second set of values indicative of a second size and a second position of a second encapsulated region of the media content, wherein the first set of values and the second set of values are in relative units of upper left corner luma samples of an unpackaged picture that includes the first encapsulated region and the second encapsulated region;

decapsulating the first encapsulated region to produce a first decapsulated region;

forming a first projected region from the first decapsulated region;

decapsulating the second encapsulated region to produce a second decapsulated region; and

forming a second projected region from the second decapsulated region, the second projected region being different from the first projected region.

2. The method of claim 1, wherein the first set of values includes a first width value, a first height value, a first top value, and a first left value, and wherein the second set of values includes a second width value, a second height value, a second top value, and a second left value, the method further comprising:

determining a first width of the first encapsulated region from the first width value;

determining a first height of the first encapsulated region from the first height value;

determining a first top offset for the first encapsulated region from the first top value;

determining a first left-side offset of the first encapsulated region from the first left-side value;

determining a second width of the second encapsulated region from the second width value;

determining a second height of the second encapsulated region from the second height value;

determining a second top offset for the second encapsulated region from the second top value; and

determining a second left-side offset of the second encapsulated region from the second left-side value.

3. The method of claim 2, wherein the first width value comprises a packed _ reg _ width [ i ] value, the first height value comprises a packed _ reg _ height [ i ] value, the first top value comprises a packed _ reg _ top [ i ] value, the first left value comprises a packed _ reg _ left [ i ], the second width value comprises a packed _ reg _ width [ j ] value, the second height value comprises a packed _ reg _ height [ j ] value, the second top value comprises a packed _ reg _ top [ j ] value, and the second left value comprises a packed _ reg _ left [ j ] value.

4. The method of claim 1, further comprising:

obtaining a projected picture width and a projected picture height from the region-oriented encapsulation frame within the video file, wherein the projected picture width and the projected picture height are in the relative units.

5. The method of claim 1, wherein the container of the region-oriented encapsulation box comprises a projected omnidirectional video box.

6. The method of claim 1, wherein the media content is monoscopic.

7. The method of claim 1, wherein the media content is stereoscopic.

8. The method of claim 7, wherein the first wrapped area corresponds to a first picture of the media content, and wherein the second wrapped area corresponds to a second picture of the media content.

9. An apparatus for processing media content, the apparatus comprising:

a memory configured to store media content; and

one or more processors implemented in circuitry and configured to:

decapsulating the first encapsulated region to produce a first decapsulated region;

forming a first projected region from the first decapsulated region;

decapsulating the second encapsulated region to produce a second decapsulated region; and

forming a second projected region from the second decapsulated region, the second projected region being different from the first projected region.

10. The device of claim 9, wherein the first set of values comprises a first width value, a first height value, a first top value, and a first left value, and wherein the second set of values comprises a second width value, a second height value, a second top value, and a second left value, wherein the one or more processors are further configured to: