Image processing apparatus, image processing method, and program

文档序号:1836572 发布日期:2021-11-12 浏览:8次 中文

阅读说明:本技术 图像处理装置、图像处理方法和程序 (Image processing apparatus, image processing method, and program ) 是由 小仓翔 山下裕也 于 2020-03-13 设计创作,主要内容包括:本发明使得能够组合使用推荐视点和自由视点而无需通过帧等显示推荐视点的位置。通过检测单元检测视点位置改变指令。输出单元向显示单元输出图像的一部分。在输出推荐范围之后,根据视点位置改变单元指令改变输出范围(对应于当前视点的图像范围),并且输出范围基于预定条件在朝向推荐范围的方向上迁移。例如,预定条件是输出范围在推荐范围之外。此外,例如,控制转换速度,使得输出范围和推荐范围之间的位置差越大,则输出范围越快地返回到推荐范围。(The present invention enables combined use of a recommended viewpoint and a free viewpoint without displaying the position of the recommended viewpoint by a frame or the like. The viewpoint position changing instruction is detected by the detection unit. The output unit outputs a part of the image to the display unit. After the recommended range is output, the output range (the image range corresponding to the current viewpoint) is instructed to be changed according to the viewpoint position changing unit, and the output range is shifted in a direction toward the recommended range based on a predetermined condition. For example, the predetermined condition is that the output range is outside the recommended range. Further, for example, the switching speed is controlled so that the larger the positional difference between the output range and the recommended range, the faster the output range returns to the recommended range.)

1. An image processing apparatus comprising:

a detection unit detecting viewpoint shift information according to the viewpoint position change instruction;

an output range determination unit that determines an output range of an image containing the recommended viewpoint information based on the recommended viewpoint information and the viewpoint offset information; and

an output unit that outputs a part of the image as a display image to the display unit based on the determined output range,

wherein the output range determination unit determines the output range such that: in a case where the display image satisfies the predetermined condition, the display image is located within the recommended range indicated by the recommended viewpoint information.

2. The image processing apparatus according to claim 1, wherein the output range determination unit determines the output range such that the display image migrates in a direction toward the recommended range.

3. The image processing apparatus according to claim 1, wherein the predetermined condition includes a case where the display image is different from the recommended range.

4. The image processing apparatus according to claim 1, wherein the predetermined condition includes a case where the display image is different from the recommended range for a predetermined time or longer.

5. The image processing apparatus according to claim 3, wherein the predetermined condition further includes a case where the viewpoint offset information is not detected.

6. The image processing apparatus according to claim 1, wherein the predetermined condition includes a case where the display image is located within a specific range in the image.

7. The image processing apparatus according to claim 1, wherein the predetermined condition includes a case where the display image is different from the recommended range in a state where the image is included in the image of the specific frame.

8. The image processing apparatus according to claim 1, wherein the predetermined condition includes a case where the display image is different from the recommended range and an instruction to migrate to the recommended range is also issued as the viewpoint offset information.

9. The image processing apparatus according to claim 2, wherein the output range determining unit controls the speed of the transition based on a positional difference between the display image and the recommended range so that the speed of the transition becomes higher in a case where the positional difference is large than in a case where the positional difference is small.

10. The image processing apparatus according to claim 2, wherein the output range determining unit controls the speed of the transition so that the return to the recommended range is achieved within a certain time.

11. The image processing apparatus according to claim 2, wherein the output range determining unit controls the speed of the transition so that the return to the recommended range is achieved at a certain speed.

12. The image processing apparatus according to claim 2, wherein the output range determination unit interpolates the shifted route by spherical linear interpolation.

13. The image processing apparatus according to claim 2, wherein the output unit interpolates the migrated route by linear interpolation.

14. The image processing apparatus according to claim 2, wherein in a case where the display image is different from the recommended range as a result of changing the viewpoint position based on the viewpoint offset information, the output range determination unit determines the output range so that the offset is achieved along a route in a direction opposite to a viewpoint position change direction corresponding to the change of the viewpoint position at the time of the transition.

15. The image processing apparatus according to claim 2, wherein the output range determination unit determines the output range such that the shift is implemented along a shortest route from the display image to the recommended range at the time of the transition.

16. The image processing apparatus according to claim 2, wherein in a case where the image contains mask range information indicating a mask range, the output range determination unit determines the output range such that the shift is implemented along a route that does not pass through the mask range at the time of the transition.

17. The image processing apparatus according to claim 2, wherein in a case where the image contains passing point information indicating a passing point, the output range determination unit determines the output range such that the shift is achieved along a route passing the passing point at the time of the transition.

18. The image processing apparatus according to claim 1, wherein in a case where the recommended viewpoint information contains a plurality of recommended ranges, the output range determination unit determines the output range such that the shift is realized toward a recommended range located at a position closest to the display image among the plurality of recommended ranges or toward a recommended range located at a position selected by the user at the time of the transition.

19. An image processing method comprising:

a step for detecting viewpoint shift information according to the viewpoint position change instruction;

a step for determining an output range of an image containing the recommended viewpoint information based on the recommended viewpoint information and the viewpoint offset information; and

a step for outputting a part of the image as a display image to a display unit based on the determined output range,

wherein, in the step for determining the output range, the output range is determined such that: in a case where the display image satisfies the predetermined condition, the display image is located within the recommended range indicated by the recommended viewpoint information.

20. A program that causes a computer to function as:

a detection means for detecting the viewpoint shift information based on the viewpoint position change instruction;

an output range determination means for determining an output range of an image containing the recommended viewpoint information based on the recommended viewpoint information and the viewpoint offset information; and

an output means for outputting a part of the image as a display image to the display unit based on the determined output range,

wherein the output range determining means determines the output range such that: in a case where the display image satisfies a predetermined condition, the display image is located within the recommended viewpoint range indicated by the recommended viewpoint information.

Technical Field

The present technology relates to an image processing apparatus, an image processing method, and a program, and particularly relates to an image processing apparatus and the like capable of providing a user with preferable observation of wide-angle-of-view images such as VR images and panoramic images.

Background

Under the oma (omni-directional media application format) standard of MPEG-DASH (dynamic adaptive streaming over HTTP), a method for transmitting recommended viewpoint information together with wide view angle image data having a view angle of 360 ° or less is defined. Also, for example, PTL 1 contains a description about recommended viewpoint display.

[ citation list ]

[ patent document ]

[PTL 1]

JP-2016194784A

Disclosure of Invention

[ problem ] to

During the display of the recommended viewpoint, only the recommended viewpoint is visible in the wide-view-angle image. On the other hand, the recommended viewpoint position is difficult to recognize in a freely visible state. In this case, it may be considered to display the recommended viewpoint by using a frame or the like. However, the display may cause such inconvenience that the interpretation of the world in the image collapses at the corresponding portion.

The purpose of the present technology is to realize combined use of a recommended viewpoint and a free viewpoint without displaying the position of the recommended viewpoint by using a frame or the like.

[ solution of problem ]

The concept of the present technology relates to an image processing apparatus including: a detection unit detecting viewpoint shift information according to the viewpoint position change instruction; an output range determination unit that determines an output range of an image containing the recommended viewpoint information based on the recommended viewpoint information and the viewpoint offset information; and an output unit that outputs a part of the image to the display unit as a display image based on the determined output range. The output range determination unit determines the output range such that: in a case where the display image satisfies the predetermined condition, the display image is located within the recommended range indicated by the recommended viewpoint information.

According to the present technology, the detection unit detects viewpoint offset information according to a viewpoint position change instruction. The viewpoint position changing instruction is given by, for example, a gyro sensor (angular velocity sensor), a user operation, and the like. The output range determination unit determines an output range of an image containing the recommended viewpoint information based on the recommended viewpoint information and the viewpoint offset information. In addition, the output unit outputs a part of the image to the display unit as a display image based on the determined output range. For example, the image is a wide view angle image such as a VR image and a panoramic image. The output range determination unit determines the output range such that: in a case where the display image satisfies the predetermined condition, the display image is located within the recommended range indicated by the recommended viewpoint information. For example, the output range determination unit may determine the output range such that the display image migrates in a direction toward the recommended range.

Also, for example, the predetermined condition may include a case where the display image is different from the recommended range. Further, for example, the predetermined condition may include a case where the display image is different from the recommended range for a predetermined time or longer. In addition, the predetermined condition may also include a case where the viewpoint offset information is not detected, for example.

Also, for example, the predetermined condition may include a case where the display image is located within a specific range in the image. Also, for example, the predetermined condition may include a case where the display image is different from the recommended range in a state where the image is included in the image of the specific frame. In addition, for example, the predetermined condition may include a case where the display image is different from the recommended range and an instruction to migrate to the recommended range is also issued as the viewpoint offset information.

Also, for example, the output range determining unit may control the speed of the transition based on the positional difference between the display image and the recommended range so that the speed of the transition becomes higher in the case where the positional difference is large than in the case where the positional difference is small. Also, for example, the output range determination unit may control the speed of migration so that return to the recommended range is achieved within a certain time. In addition, for example, the output range determination unit may control the speed of the migration so that the return to the recommended range is achieved at a certain speed.

Also, for example, the output range determination unit may interpolate the migrated route by spherical linear interpolation. Also, for example, the output range determination unit may interpolate the migrated route by linear interpolation.

Also, for example, in a case where the display image is different from the recommended range as a result of changing the viewpoint position based on the viewpoint displacement information, the output range determination unit may determine the output range such that the current output range is displaced along a route in a direction opposite to the viewpoint position changing direction corresponding to the viewpoint position changing instruction at the time of the transition. Also, for example, the output range determination unit may determine the output range such that the shift is realized along the shortest route from the display image to the recommended range at the time of the transition.

Also, for example, in a case where the image contains the mask range information indicating the mask range, the output range determination unit may determine the output range such that the shift is realized along a route that does not pass through the mask range at the time of the transition. Also, for example, in a case where the image contains passing point information indicating a passing point, the output range determination unit may determine the output range such that the shift is achieved along a route passing the passing point at the time of the transition.

Also, for example, in a case where the recommendation point information contains a plurality of recommendation ranges, the output range determination unit may determine the output range such that the shift is realized toward a recommendation range located at a position closest to the display image among the plurality of recommendation ranges at the time of the transition or toward a recommendation range located at a position selected by the user.

As described above, the present technology determines the output range of an image containing recommended viewpoint information based on the recommended point information and viewpoint offset information. Therefore, the combined use of the recommended viewpoint and the free viewpoint can be realized without displaying the position of the recommended viewpoint by using a frame or the like.

Drawings

Fig. 1 is a block diagram showing a configuration example of an image distribution system according to an embodiment.

Fig. 2 is a diagram for explaining recommended viewpoint information and the like.

Fig. 3 is a block diagram showing a configuration example of a display device.

Fig. 4 is a diagram for explaining processing for obtaining a viewpoint position of the next frame performed by the migration calculation unit.

Fig. 5 is a diagram for explaining the transition from the current viewpoint to the recommended viewpoint.

Fig. 6 is a diagram for explaining cutting and displaying an image corresponding to a current viewpoint.

Fig. 7 is a flowchart showing an example of display processing steps performed by the display device.

Fig. 8 is a block diagram showing a configuration example of hardware of the computer.

Fig. 9 is a diagram schematically showing the general configuration of an operating room system.

Fig. 10 is a diagram illustrating an example of display of an operation screen image of the collective operation panel.

Fig. 11 is a diagram showing an example of a state of an operation to which the operating room system is applied.

Fig. 12 is a block diagram showing an example of a functional configuration of the camera head and Camera Control Unit (CCU) shown in fig. 11.

Fig. 13 is a diagram for explaining an example in which the current viewpoint is shifted to the recommended viewpoint in a direction corresponding to the shortest distance when migrating from the current viewpoint to the recommended viewpoint.

Fig. 14 is a diagram for explaining a case where a mask range exists in a direction opposite to a direction corresponding to a movement caused by a user when migrating from a current viewpoint to a recommended viewpoint.

Fig. 15 is a flowchart showing an example of display processing steps performed by the display device in a case where the route does not pass through the mask range.

Fig. 16 is a diagram for explaining a case where a passing point is set at the time of transition from the current viewpoint to the recommended viewpoint.

Fig. 17 is a flowchart showing an example of display processing steps performed by the display device in a case where the route passes through the passing point.

Fig. 18 is a diagram for explaining a case where there are a plurality of recommended viewpoints at the time of migration from the current viewpoint to the recommended viewpoint.

Fig. 19 is a flowchart showing an example of display processing steps performed by the display device in a case where the current viewpoint is shifted to a recommended viewpoint closest in position among the plurality of recommended viewpoints.

Fig. 20 is a flowchart showing an example of display processing steps performed by the display device in the case of shifting to a recommended viewpoint selected by the user among the plurality of recommended viewpoints.

Fig. 21 is a flowchart showing an example of display processing steps performed by the display device in a case based on whether or not the recommended viewpoint is shifted to the recommended viewpoint.

Fig. 22 is a flowchart showing an example of display processing steps performed by the display device in a case where no recommended viewpoint is displayed based on whether or not a predetermined time or longer has elapsed to shift to the recommended viewpoint.

Fig. 23 is a flowchart showing an example of display processing steps performed by the display device.

Detailed Description

Modes for carrying out the invention (hereinafter referred to as "embodiments") will be described below. Note that the description will be given in the following order.

1. Examples of the embodiments

2. Applications of

3. Modification example

<1. example >

[ image distribution System ]

Fig. 1 shows a configuration example of an image distribution system 10 according to an embodiment. The image distribution system 10 includes a distribution-side apparatus and a display apparatus 200 serving as a reception-side apparatus. Possible examples of the display device 200 include an HMD (head mounted display) and a tablet pc.

The distribution-side apparatus includes a multi-camera 101, a wide-angle-of-view image conversion unit 102, an encoder 103, and a distribution server 104.

The multi-camera 101 obtains image data of a spherical captured image. For example, the multi-camera 101 performs imaging by a back-to-back method by using two cameras to obtain a wide-angle front image and a wide-angle rear image, which have an angle of view of 180 ° or more, respectively, and are captured by using a fish-eye lens, as spherical captured images.

The wide angle-of-view image conversion unit 102 performs plane packing on the spherical captured images obtained by the multi-camera 101 to obtain rectangular projection images (projection pictures). The projection image is an image corresponding to a wide view angle image having a view angle of 360 ° or less. In this case, for example, an equirectangular (equirectangular) format or the like is selected as the format type of the projection image. Note that the wide view angle image conversion unit 102 scales the projection image as necessary to obtain a projection image having a predetermined resolution.

The encoder 103 performs, for example, HEVC or other encoding on the image data of the projection image received from the wide-view image conversion unit 102 to obtain encoded image data, and generates a video stream containing the encoded image data. In this stage, the encoder 103 inserts recommended viewpoint information into the video stream for each frame in the video stream.

For example, the recommended viewpoint is automatically set in a range including, for example, an actor estimated by a position sensor or image recognition, or is set in a range manually specified by, for example, a director. The recommended viewpoint set here is not limited to only one point, but may be set at a plurality of points.

Fig. 2(a) shows a spherical surface corresponding to a spherical captured image. Fig. 2(b) schematically shows a rectangular projection image in the case of adopting the equal rectangular format type. The center of the projection image is located at (0, 0).

For example, as shown in fig. 2(c), the recommended viewpoint information includes a frame number, center angle information, horizontal angle information, vertical angle information, a recommended viewpoint number, and the like. The center angle information, the horizontal angle information, and the vertical angle information are each angle information associated with a spherical captured image (see fig. 2 (a)).

Returning to fig. 1, the distribution server 104 transmits the video stream generated by the encoder 103 to the display apparatus 200 as distribution data. Note that the distribution may be arbitrarily realized by broadcasting or by communication.

Fig. 3 shows a configuration example of the display device 200. The display device 200 includes a receiving unit 201, a decoder 202, a field-of-view rendering unit 203, a gyro sensor 204, a viewpoint offset information calculation unit 205, a migration calculation unit 206, and a display unit 207. The transition calculation unit 206 here constitutes an output range determination unit, and the field-of-view rendering unit 203 constitutes an output unit.

The reception unit 201 receives a video stream transmitted from a transmission-side apparatus as distribution data. The decoder 202 decodes the video stream received by the receiving unit 201 to obtain a projection image (image data). In addition, the decoder 202 obtains recommended viewpoint information from the received video stream as information inserted into each frame of the video stream.

The gyro sensor 204 detects a change in the rotation angle of the device (i.e., the display device 200 here) that houses the gyro sensor 204. The viewpoint offset information calculation unit 205 calculates viewpoint offset information based on the detection output of the gyro sensor 204. The viewpoint offset information calculation unit 205 constitutes a detection unit that detects a viewpoint position change instruction.

In this configuration, in the case where the user wearing the HMD causes a viewpoint offset by turning the neck, rotating the tablet of the user, or other operation, the viewpoint offset information calculation unit 205 calculates viewpoint offset information indicating the viewpoint offset.

In addition, it is also conceivable that the user causes a viewpoint shift by performing a slide or the like on the touch panel. In this case, the viewpoint offset information calculation unit 205 calculates the viewpoint offset information based on the information indicating the operation performed on the touch panel.

The transition calculation unit 206 calculates the viewpoint position of the next frame for each frame based on the recommended viewpoint information obtained from the decoder 202 and the viewpoint offset information obtained from the viewpoint offset information calculation unit 205 associated with the next frame. In the case where the viewpoint shift is caused by the user operation here, the migration calculation unit 206 shifts the viewpoint position to reflect the viewpoint shift indicated by the viewpoint shift information, and specifies the shifted viewpoint position as the viewpoint position of the next frame.

On the other hand, in the case where the viewpoint shift is not caused by the user operation, the migration calculation unit 206 migrates the viewpoint position in the recommended viewpoint direction indicated by the recommended viewpoint information. In this case, the route from the current viewpoint to the recommended viewpoint is interpolated by spherical linear interpolation, and, as shown in fig. 4, the position advanced by one step is specified as the viewpoint position of the next frame. The amount of offset of a step depends on how many steps are needed to return to the recommended viewpoint for interpolation and thus becomes a parameter indicating the ease of return. Also, in this case, the speed control is performed such that the approach speed increases as the difference between the current viewpoint and the recommended viewpoint increases, and such that the approach speed decreases as the difference decreases. In other words, in the case where the positional difference is large, the migration speed is made higher than in the case where the positional difference is small.

Fig. 5(a) shows an example of the positional relationship between the recommended viewpoint and the current viewpoint (display viewpoint) in the rectangular projection image. When no user operation causes a viewpoint to be shifted in a state where the current viewpoint deviates from the recommended viewpoint as a result of the illustrated user operation, the current viewpoint moves toward the recommended viewpoint in a direction opposite to the direction of movement performed by the user, as shown in fig. 5 (b). The illustrated example indicates an example in which the current viewpoint of the t-4 frame returns to the recommended viewpoint of the t frame through 4 frames. By reducing the speed of the offset at which the current viewpoint approaches the recommended viewpoint, the current viewpoint smoothly returns to the recommended viewpoint.

The field-of-view rendering unit 203 cuts and presents the image data contained in the viewpoint position range in the next frame calculated by the migration calculation unit 206 from the projection image data obtained by the decoder 202 and associated with the next frame, and obtains display image data corresponding to the viewpoint position of the next frame. The display unit 207 displays images corresponding to the display image data of the respective frames obtained by the field-of-view drawing unit 203.

Fig. 6(a) shows an example of a rectangular projection image. Fig. 6(b) shows an example of the viewpoint position range in the projection image. In this case, as shown in fig. 6(c), an image corresponding to the viewpoint position is displayed in the display unit 207.

The action of the display apparatus 200 shown in fig. 3 will be briefly described. The distribution data transmitted from the transmission-side apparatus is received by the reception unit 201 and transmitted to the decoder 202. Decoder 202 decodes the video stream to obtain the projected image data for each frame in turn. The projection image data is transmitted to the field-of-view rendering unit 203. In addition, the decoder 202 obtains recommended viewpoint information from the video stream as information inserted for each frame of the video stream. The recommended viewpoint information is transferred to the migration calculation unit 206.

Also, the gyro sensor 204 detects a change in the rotation angle of the display device 200 (e.g., HMD or tablet) that houses the gyro sensor 204. The detection output thus obtained is transmitted to the viewpoint offset information calculation unit 205.

The viewpoint offset information calculation unit 205 calculates viewpoint offset information based on the detected output of the gyro sensor 204. In the case where the user wearing the HMD causes a viewpoint shift by turning the neck, rotating the user's tablet, or other operation herein, the viewpoint shift information calculation unit 205 calculates viewpoint shift information indicating the viewpoint shift. The viewpoint offset information is transferred to the migration calculation unit 206.

The transition calculation unit 206 calculates the viewpoint position of the next frame of each frame based on the recommended viewpoint information obtained from the decoder 202 and the viewpoint offset information obtained from the viewpoint offset information calculation unit 205, which are associated with the next frame. In the case where the viewpoint shift is caused by the user operation here, the migration calculation unit 206 shifts the viewpoint position to reflect the viewpoint shift indicated by the viewpoint shift information, and specifies the shifted viewpoint position as the viewpoint position of the next frame. Also, in the case where the viewpoint is not shifted by the user operation, the route from the current viewpoint to the recommended viewpoint is interpolated by spherical linear interpolation so that the viewpoint position shifts in the recommended viewpoint direction indicated by the recommended viewpoint information, and the stepped position is specified as the viewpoint position of the next frame.

The viewpoint position of the next frame obtained for each frame by the transition calculation unit 206 is transferred to the field-of-view rendering unit 203. The field-of-view rendering unit 203 cuts and presents image data contained in the viewpoint position range in the next frame and calculated by the migration calculation unit 206 from the projection image data obtained by the decoder 202 and associated with the next frame, and obtains display image data corresponding to the viewpoint position of the next frame of each frame.

The display image data of each frame obtained by the field-of-view drawing unit 203 is transferred to the display unit 207. The display unit 207 displays images corresponding to the display image data of the respective frames, that is, images corresponding to the current viewpoint position.

The flowchart shown in fig. 7 represents an example of display processing steps performed by the display apparatus 200. In step ST1, the display device 200 starts processing in response to a video clip reproduction start operation performed by the user, for example.

Subsequently, in step ST2, the display device 200 displays a range corresponding to the recommended viewpoint (recommended range) as an output range. Thereafter, in step ST3, the display device 200 determines whether or not a viewpoint shift is caused by a user operation.

In the case where the viewpoint shift is caused by the user operation, the display apparatus 200 shifts the viewpoint in the operation direction in step ST4, and then proceeds to the processing in step ST 5. In addition, in the case where the amount of shift of viewpoint shift caused by user operation is small and equal to a predetermined value or less, processing for determining that viewpoint shift is not caused by user operation (threshold processing) may be performed here. In this way, even in the case where, for example, the head wearing the HMD moves slightly, a stable display operation can be achieved. On the other hand, in the case where the viewpoint shift is not caused by the user, in step ST6, the display device 200 calculates a route in the recommended viewpoint direction, shifts the viewpoint in the recommended viewpoint direction along the calculated route, and then proceeds to the processing in step ST 5. In step ST5, the display device 200 displays a range corresponding to the current viewpoint as an output range.

Subsequently, in step ST7, the display device 200 determines whether video reproduction or distribution has ended. When it is determined that video reproduction or distribution has not ended, the display device 200 returns to the processing in step ST3 and reflects the processing of the next frame. On the other hand, when it is determined that video reproduction or distribution has ended, the series of processing ends at step ST 8.

Note that the series of processes performed by the display device 200 as described above may be performed by hardware, or may be performed by software. In the case where a series of processes is executed by software, a program constituting the software is installed in a computer. Examples of the computer herein include a computer integrated in dedicated hardware and a computer capable of executing various functions according to various programs installed in the computer, such as a general-purpose personal computer.

Fig. 8 is a block diagram showing an example of a hardware configuration of a computer 400 that executes the series of processing described above according to a program.

The computer 400 includes a CPU (central processing unit) 401, a ROM (read only memory) 402, and a RAM (random access memory) 403 connected to each other via a bus 404.

The input/output interface 405 is further connected to the bus 404. An input unit 406, an output unit 407, a recording unit 408, a communication unit 409, and a drive 410 are connected to the input/output interface 405.

The input unit 406 includes input switches, buttons, a microphone, an imaging element, and the like. The output unit 407 includes a display, a speaker, and the like. The recording unit 408 includes a hard disk, a nonvolatile memory, and the like. The communication unit 409 includes a network interface and the like. The drive 410 drives a removable medium 411 such as a magnetic disk, an optical disk, a magneto-optical disk, and a semiconductor memory.

According to the computer 400 configured as described above, for example, the CPU 401 loads a program stored in the recording unit 408 into the RAM 403 via the input/output interface 405 and the bus 404, and runs the loaded program to execute the above-described series of processes.

For example, a program executed by the computer 400(CPU 401) is allowed to be recorded in a removable medium 411 as a package medium or the like and provided in this form. In addition, the program is allowed to be provided through a wired or wireless transmission medium such as a local area network, the internet, and digital satellite broadcasting.

In the computer, it is allowed to install a program into the recording unit 408 via the input/output interface 405 by attaching the removable medium 411 to the drive 410. And, the program is allowed to be received by the communication unit 409 via a wired or wireless transmission medium and installed in the recording unit 408. Also, it is allowed to install a program in the ROM 402 or the recording unit 408 in advance.

Note that the program executed by the computer may be a program that performs processing in time series in the order described in this specification, or may be a program that performs processing in parallel or at necessary timing such as the occasion of calling.

As described above, the display device 200 of the image distribution system 10 shown in fig. 1 is capable of changing the output range according to the viewpoint offset caused by the user operation after designating the recommended range (range corresponding to the recommended viewpoint) as the output range, and shifting the output range in the direction of the recommended range in the case where the output range deviates from the recommended range in a state where there is no viewpoint offset caused by the user operation. Therefore, the combined use of the recommended viewpoint and the free viewpoint can be realized without displaying the position of the recommended viewpoint by using a frame or the like.

<2. applications >

The techniques according to the present disclosure are applicable to a variety of products. For example, techniques according to the present disclosure are applicable to operating room systems.

Fig. 9 is a view schematically showing the general configuration of an operating room system 5100 to which the technique according to the embodiment of the present disclosure can be applied. Referring to fig. 9, an operating room system 5100 is configured such that a set of devices installed in an operating room are connected through an Audio Visual (AV) controller 5107 and an operating room control device 5109 to cooperate with each other.

In an operating room, various devices may be installed. In fig. 9, various device groups 5101 for endoscopic surgery, a ceiling camera 5187, an operation field camera 5189, a plurality of display devices 5103A to 5103D, a recorder 5105, a patient bed 5183, and illumination 5191 are shown as an example. A ceiling camera 5187 is disposed on the ceiling of the operating room and images the surgeon's hand. The surgery field camera 5189 is provided on the ceiling of the surgery room, and images the state of the whole surgery room.

Among the mentioned devices, the device group 5101 belongs to an endoscopic surgery system 5113 described below, and includes an endoscope, a display device that displays an image picked up by the endoscope, and the like. Various devices belonging to the endoscopic surgery system 5113 are also referred to as medical equipment. Meanwhile, the display devices 5103A to 5103D, the recorder 5105, the bed 5183, and the illumination 5191 are devices provided in an operating room separately from the endoscopic surgery system 5113, for example. Devices that do not belong to the endoscopic surgical system 5113 are also referred to as non-medical devices. The audiovisual controller 5107 and/or the operating room control device 5109 control the operations of the medical device and the non-medical device in cooperation with each other.

The audiovisual controller 5107 integrally controls processing of medical devices and non-medical devices related to image display. Specifically, each of the device group 5101, the ceiling camera 5187, and the surgery field camera 5189 of the devices provided in the operating room system 5100 may be an apparatus having a function of transmitting information to be displayed during surgery (such information is hereinafter referred to as display information, and the mentioned device is hereinafter referred to as a device of a transmission source). Meanwhile, each of the display devices 5103A to 5103D may be a device to which display information is output (the device is hereinafter also referred to as a device of an output destination). The recorder 5105 may be a device that serves as both a source device and an output destination device. The viewing controller 5107 has a function of controlling the operations of the source device and the output destination device to acquire display information from the source device and transmit the display information to the output destination device for display or recording. It should be noted that the display information includes various images picked up during the operation and various information related to the operation (for example, physical information of the patient, a result of past examination, or information on the operation step), and the like.

Specifically, with the audiovisual controller 5107, information relating to an image of an operation region in a body cavity of a patient imaged by an endoscope may be transmitted from the device group 5101 as display information. Also, from the ceiling camera 5187, information about the image of the surgeon's hand picked up by the ceiling camera 5187 can be transmitted as display information. Also, from the surgical field camera 5189, information that relates to an image picked up by the surgical field camera 5189 and shows the state of the entire operating room may be transmitted as display information. It should be noted that if a different device having an image pickup function exists in the operating room system 5100, the audiovisual controller 5107 may also acquire information relating to an image picked up by the different device from the different device as display information.

Alternatively, for example, in the recorder 5105, information on such an image picked up in the past is recorded by the av controller 5107. The av controller 5107 can acquire information on images picked up in the past from the recorder 5105 as display information. It should be noted that various pieces of information relating to the operation may also be recorded in advance in the recorder 5105.

The av controller 5107 controls at least one of the display devices 5103A to 5103D, which are devices of output destinations, to display the acquired display information (i.e., images picked up during a surgery or pieces of information relating to the surgery). In the example shown, the display device 5103A is a display device mounted to be suspended from a ceiling of an operating room; the display device 5103B is a display device mounted on a wall surface of the operating room; the display device 5103C is a display device mounted on a table of an operating room; and the display device 5103D is a mobile device (e.g., a tablet Personal Computer (PC)) having a display function.

Also, although not shown in fig. 9, the operating room system 5100 may include devices external to the operating room. The device outside the operating room may be, for example, a server connected to a network built inside and outside a hospital, a PC used by medical staff, or a projector installed in a hospital conference room, or the like. In the case where such an external device is located outside the hospital, the audiovisual controller 5107 can also cause display information to be displayed on display devices of different hospitals through a teleconference system or the like to perform remote medical treatment.

The operating room control device 5109 integrally controls processing other than processing related to image display on the non-medical device. For example, the operating room control device 5109 controls driving of a bed 5183, a ceiling camera 5187, an operating field camera 5189, and lighting 5191.

In the operating room system 5100, a centralized operation panel 5111 is provided so that an instruction for image display can be issued to the viewing controller 5107 or an instruction for operation of a non-medical device can be issued to the operating room control device 5109 via the centralized operation panel 5111. The collective operation panel 5111 is configured by providing a touch panel on the display surface of the display device.

Fig. 10 is a diagram showing an example of display of an operation screen image on the collective operation panel 5111. In fig. 10, an operation screen image corresponding to a case where two display devices are set as devices of output destinations in the operating room system 5100 is shown. Referring to fig. 10, operation screen image 5193 includes a transmission source selection area 5195, a preview area 5197, and a control area 5201.

In the transmission source selection area 5195, a transmission source device provided in the operating room system 5100 and a thumbnail screen image showing display information possessed by the transmission source device are displayed in association with each other. The user can select display information to be displayed on the display device from any one of the transmission source devices displayed in the transmission source selection area 5195.

In the preview area 5197, a preview of screen images displayed on two display devices (the monitor 1 and the monitor 2) as devices of an output destination is displayed. In the illustrated example, with respect to one display device, four images are displayed by picture-in-picture (PinP) display. The four images correspond to display information transmitted from the transmission source device selected in the transmission source selection area 5195. One of the four images is displayed as a main image in a relatively large size, and the remaining three images are displayed as sub-images in a relatively small size. The user can exchange between the main image and the sub image by appropriately selecting one of the images from among the four images displayed in the area. Also, a state display area 5199 is provided below the area in which the four images are displayed, and states related to the operation (for example, elapsed time of the operation, physical information of the patient, and the like) can be appropriately displayed in the state display area 5199.

A transmission source operation area 5203 and an output destination operation area 5205 are set in the control area 5201. In the transmission source operation area 5203, a Graphical User Interface (GUI) portion for performing an operation on the device of the transmission source is displayed. In the output destination operation area 5205, a GUI part for performing an operation on the output destination device is displayed. In the illustrated example, a GUI part for performing various operations (panning, tilting, and zooming) on a camera in an apparatus having a transmission source of an image pickup function is provided in the transmission source operation area 5203. The user can control the action of the camera of the device of the transmission source by appropriately selecting any one of the GUI parts. It should be noted that, although not shown, in a case where the device of the transmission source selected in the transmission source selection area 5195 is a recorder (i.e., in a case where images recorded in the recorder in the past are displayed in the preview area 5197), a GUI part for performing operations such as reproduction of images, stop of reproduction, rewinding and fast-forwarding, etc., may be provided in the transmission source operation area 5203.

Also, in the output destination operation area 5205, a GUI part for performing various display operations (swapping, flipping, color adjustment, contrast adjustment, and switching between two-dimensional (2D) display and three-dimensional (3D) display) for display on a display device that is an apparatus of an output destination is set. The user can operate the display of the display device by appropriately selecting any one of the GUI sections.

It should be noted that the operation screen image to be displayed on the collective operation panel 5111 is not limited to the illustrated example, and the user may be able to perform an operation input through the collective operation panel 5111 to each device that can be controlled by the audiovisual controller 5107 and the operating room control device 5109 provided in the operating room system 5100.

Fig. 11 is a diagram showing an example of a state of an operation to which the above-described operating room system is applied. A ceiling camera 5187 and an operating field camera 5189 are provided on the ceiling of the operating room so that it can image the hands of a surgeon (doctor) 5181 performing treatment on the affected area of a patient 5185 on a bed 5183 and the entire operating room. The ceiling camera 5187 and the theater camera 5189 may include a magnification adjustment function, a focal length adjustment function, an imaging direction adjustment function, and the like. The illumination 5191 is disposed on the ceiling of the operating room and shines at least onto the surgeon's 5181 hand. The illumination 5191 may be configured so that the irradiation light amount, the wavelength (color) of the irradiation light, the irradiation direction of the light, and the like can be appropriately adjusted.

As shown in fig. 9, the endoscopic surgery system 5113, the patient bed 5183, the ceiling camera 5187, the theater camera 5189, and the lighting 5191 are connected to each other through the audiovisual controller 5107 and the operating room control device 5109 (not shown in fig. 11) to cooperate. The collective operation panel 5111 is provided in an operating room, and a user can appropriately operate the devices existing in the operating room through the collective operation panel 5111 as described above.

Hereinafter, the configuration of the endoscopic surgery system 5113 is described in detail. As shown, the endoscopic surgical system 5113 includes an endoscope 5115, other surgical tools 5131, a support arm device 5141 on which the endoscope 5115 is supported, and a cart 5151 on which various endoscopic surgical devices are mounted.

In endoscopic surgery, instead of incising the abdominal wall to perform laparotomy, a plurality of tubular aperture devices called trocars 5139a to 5139d are used to pierce the abdominal wall. Then, the lens barrel 5117 of the endoscope 5115 and other surgical tools 5131 are inserted into the body cavity of the patient 5185 through the trocars 5139a to 5139 d. In the example shown, as other surgical tools 5131, a pneumoperitoneum tube 5133, an energy device 5135, and forceps 5137 are inserted into a body cavity of a patient 5185. Also, the energy device 5135 is a treatment tool for performing incision and dissection of tissue, sealing of a blood vessel, or the like by high-frequency current or ultrasonic vibration. However, the illustrated surgical tool 5131 is merely an example, and as the surgical tool 5131, various surgical tools commonly used in endoscopic surgery, such as, for example, forceps or a retractor, may be used.

An image of the surgical region in the body cavity of the patient 5185 picked up by the endoscope 5115 is displayed on the display device 5155. The surgeon 5181 will use the energy device 5135 or forceps 5137 while viewing the image of the surgical region displayed on the display device 5155 on a real-time basis to perform a treatment such as, for example, ablation of the affected region. It should be noted that although not shown, the veress tube 5133, energy device 5135, and forceps 5137 are supported by the surgeon 5181 or an assistant or the like during the procedure.

(support arm device)

The support arm device 5141 includes an arm unit 5145 extending from a base unit 5143. In the illustrated example, the arm unit 5145 includes joint portions 5147a, 5147b, and 5147c and links 5149a and 5149b, and is driven under the control of the arm control device 5159. The endoscope 5115 is supported by the arm unit 5145 so that the position and posture of the endoscope 5115 are controlled. Therefore, stable fixation of the position of the endoscope 5115 can be achieved.

(endoscope)

The endoscope 5115 includes a lens barrel 5117 to be inserted into a body cavity of a patient 5185 from an area of a predetermined length of a distal end thereof, and a camera head 5119 connected to a proximal end of the lens barrel 5117. In the illustrated example, the endoscope 5115 is shown as a rigid endoscope having a hard type lens barrel 5117. However, the endoscope 5115 may be additionally configured as a flexible endoscope having a flexible type lens barrel 5117.

The lens barrel 5117 has an opening at its distal end that fits the objective lens. The light source device 5157 is connected to the endoscope 5115 such that light generated by the light source device 5157 is guided to the distal end of the lens barrel 5117 through a light guide extending inside the lens barrel 5117 and is applied toward an observation target in a body cavity of the patient 5185 through the objective lens. It should be noted that the endoscope 5115 may be a forward-looking endoscope or may be a tilted-view endoscope or a side-viewing endoscope.

The optical system and the image pickup element are disposed inside the camera head 5119 so that reflected light (observation light) from the observation target is condensed on the image pickup element through the optical system. The observation light is photoelectrically converted by the image pickup element to generate an electric signal corresponding to the observation light, that is, an image signal corresponding to an observation image. The image signal is transmitted to the CCU5153 as RAW data. It should be noted that the camera head 5119 has a function of adding thereto an optical system for appropriately driving the camera head 5119 to adjust the magnification and the focal length.

It should be noted that in order to establish compatibility with, for example, stereoscopic vision (3D display), a plurality of image pickup elements may be provided on the camera head 5119. In this case, in order to guide observation light to a plurality of corresponding image pickup elements, a plurality of relay optical systems may be provided inside the lens barrel 5117.

(various devices incorporated in the cart)

The CCU5153 includes a Central Processing Unit (CPU), a Graphics Processing Unit (GPU), or the like, and integrally controls the actions of the endoscope 5115 and the display device 5155. Specifically, the CCU5153 performs various image processes for displaying an image based on an image signal, such as, for example, a development process (demosaicing process), on the image signal received from the camera head 5119. The CCU5153 supplies the display device 5155 with an image signal on which image processing is performed. Also, the av controller 5107 shown in fig. 9 is connected to the CCU 5153. The CCU5153 also supplies the image signal on which the image processing has been performed to the audiovisual controller 5107. Also, the CCU5153 transmits a control signal to the camera head 5119 to control the driving of the camera head 5119. The control signal may include information related to an image pickup condition, such as a magnification or a focal length. Information about image pickup conditions may be input through the input device 5161, or may be input through the collective operation panel 5111 described above.

The display device 5155 displays an image based on the image signal on which the image processing is performed by the CCU5153 under the control of the CCU 5153. If the endoscope 5115 is ready to perform high-resolution imaging such as 4K (the number of horizontal pixels 3840 × the number of vertical pixels 2160) or 8K (the number of horizontal pixels 7680 × the number of vertical pixels (4320), etc.) and/or is ready for 3D display, a display device capable of performing corresponding display of high-resolution and/or 3D display may be used as the display device 5155. In the case where the device is ready for high-resolution imaging such as 4K or 8K, if the display device serving as the display device 5155 has a size equal to or not less than 55 inches, a more immersive experience can be obtained. Also, a plurality of display devices 5155 having different resolutions and/or different sizes may be provided according to purposes.

The light source device 5157 includes a light source such as, for example, a Light Emitting Diode (LED), and supplies irradiation light for imaging of the operation region to the endoscope 5115.

The arm control device 5159 includes a processor such as, for example, a CPU, and acts according to a predetermined program to control the driving of the arm unit 5145 supporting the arm device 5141 according to a predetermined control method.

The input device 5161 is an input interface for the endoscopic surgical system 5113. The user can perform input of various types of information or instructions input to the endoscopic surgical system 5113 through the input device 5161. For example, the user will input various types of information related to the surgery, such as physical information of the patient and information on the surgical steps of the surgery, etc., through the input device 5161. Also, the user inputs, for example, an instruction to drive the arm unit 5145, an instruction to change the image pickup condition (irradiation light type, magnification, focal length, or the like) of the endoscope 5115, an instruction to drive the energy device 5135, or the like, through the input device 5161.

The type of the input device 5161 is not limited and may be any of various known input devices. As the input device 5161, for example, a mouse, a keyboard, a touch panel, a switch, a foot switch 5171, a lever, or the like can be applied. In the case of using a touch panel as the input device 5161, it may be provided on the display surface of the display device 5155.

Alternatively, the input device 5161 is a device to be mounted on the user, such as, for example, a glasses-type wearable device or a head-mounted display (HMD), and performs various types of input in response to a gesture or line of sight of the user detected by any of the mentioned devices. Also, the input device 5161 includes a camera that can detect the motion of the user, and performs various types of input in response to the gesture or line of sight of the user detected from a video picked up by the camera. Also, the input device 5161 includes a microphone that can collect the voice of the user, and various types of input are performed in voice through the microphone. By configuring the input device 5161 in this way, it is possible to input various types of information in a non-contact manner, and in particular, a user (for example, surgeon 5181) belonging to a clean area can operate a device belonging to a non-clean area in a non-contact manner. Also, since the user can operate the apparatus without releasing the held surgical tool from his hand, the user's convenience is improved.

The treatment tool control device 5163 controls driving of the energy device 5135 for cauterization or incision of tissue, sealing of blood vessels, or the like. To ensure the field of view of the endoscope 5115 and to ensure the working space of the surgeon, the pneumoperitoneum device 5165 feeds gas into the body cavity of the patient 5185 through the pneumoperitoneum tube 5133 to inflate the body cavity. The recorder 5167 is a device capable of recording various types of information relating to the operation. The printer 5169 is a device capable of printing various types of information relating to the operation in various types of forms such as text, images, or graphics.

Hereinafter, in particular, the characteristic configuration of the endoscopic surgical system 5113 is described in more detail.

(support arm device)

The support arm device 5141 includes a base unit 5143 serving as a base and an arm unit 5145 extending from the base unit 5143. In the illustrated example, the arm unit 5145 includes a plurality of joint portions 5147a, 5147b, and 5147c and a plurality of links 5149a and 5149b connected to each other by the joint portions 5147 b. In fig. 11, for simplification of explanation, the configuration of the arm unit 5145 is depicted in simplified form. In fact, the shapes, the number, and the arrangement of the joint portions 5147a to 5147c and the links 5149a and 5149b, the direction of the rotational axis of the joint portions 5147a to 5147c, and the like may be appropriately set so that the arm unit 5145 has a desired degree of freedom. For example, it may be preferable to include the arm unit 5145 such that it has a degree of freedom equal to or not less than 6 degrees of freedom. This enables the endoscope 5115 to be freely moved within the movable range of the arm unit 5145. Accordingly, it becomes possible to insert the lens barrel 5117 of the endoscope 5115 into the body cavity of the patient 5185 from a desired direction.

Actuators are provided in the joint portions 5147a to 5147c, and the joint portions 5147a to 5147c are included so that they can be rotated about predetermined rotation axes thereof by the drive of the actuators. The driving of the actuator is controlled by the arm control device 5159 to control the rotation angle of each of the joint portions 5147a to 5147c, thereby controlling the driving of the arm unit 5145. Therefore, control of the position and posture of the endoscope 5115 can be realized. Accordingly, the arm control device 5159 may control the driving of the arm unit 5145 by various known control methods such as force control or position control.

For example, if the surgeon 5181 appropriately performs an operation input through the input device 5161 (including the foot switch 5171), the driving of the arm unit 5145 may be appropriately controlled by the arm control device 5159 in response to the operation input to control the position and posture of the endoscope 5115. After the endoscope 5115 at the distal end of the arm unit 5145 is moved from an arbitrary position to a different arbitrary position by the control just described, the endoscope 5115 can be fixedly supported at the moved position. It should be noted that the arm unit 5145 can be operated in a master-slave manner. In this case, the user can remotely control the arm unit 5145 through the input device 5161 located at a place remote from the operating room.

Also, in the case of applying force control, the arm control means 5159 may perform power assist control to drive the actuators of the joint portions 5147a to 5147c so that the arm unit 5145 may receive an external force from the user and move smoothly following the external force. This enables the arm unit 5145 to be moved with a relatively weak force when the user directly contacts and moves the arm unit 5145. Therefore, it becomes possible for the user to move the endoscope 5115 more intuitively by a simpler and easier operation, and the convenience of the user can be improved.

Here, generally in an endoscopic surgery, the endoscope 5115 is supported by a doctor called an optician (scope). In contrast, in the case of using the support arm device 5141, the position of the endoscope 5115 can be fixed with a higher degree of certainty without the need for hands, and therefore, an image of the operation area can be stably obtained, and the operation can be smoothly performed.

It should be noted that the arm control device 5159 may not necessarily be provided on the cart 5151. Also, the arm control device 5159 may not necessarily be a single device. For example, an arm control device 5159 may be provided in each of the joint portions 5147a to 5147c of the arm unit 5145 of the support arm device 5141 such that the plurality of arm control devices 5159 cooperate with each other to achieve drive control of the arm unit 5145.

(light source device)

The light source device 5157 provides illumination light to the endoscope 5115 when imaging the surgical field. The light source device 5157 includes a white light source including, for example, an LED, a laser light source, or a combination thereof. In this case, in the case where the white light source includes a combination of red, green, and blue (RGB) laser light sources, since the output intensity and the output timing can be controlled with high accuracy for each color (each wavelength), white balance adjustment of a picked-up image can be performed by the light source device 5157. Also in this case, if laser beams from the RGB laser light sources are applied to the observation target time-divisionally and the driving of the image pickup element of the camera head 5119 is controlled in synchronization with the irradiation timing, images respectively corresponding to R, G and B colors can be picked up time-divisionally. According to the method just described, a color image can be obtained even if the image pickup element is not provided with a color filter.

Also, the driving of the light source device 5157 may be controlled so that the intensity of light to be output is changed for each predetermined time. By controlling the driving of the image pickup element of the camera head 5119 in synchronization with the timing of the change in light intensity to time-divisionally acquire images and synthesize the images, an image of a high dynamic range without underexposure blocking shadows and overexposed highlights can be created.

Also, the light source device 5157 may be configured to supply light of a predetermined wavelength band prepared for special light observation. In the special light observation, for example, by utilizing the wavelength dependence of light absorption of the body tissue, by applying light of a wavelength band narrower than the irradiation light (i.e., white light) at the time of ordinary observation, narrow-band light observation (narrow-band imaging) of imaging a predetermined tissue (such as blood vessels of a surface portion of a mucous membrane or the like) with high contrast is performed. Alternatively, in the special light observation, fluorescence observation for obtaining an image from fluorescence generated by irradiation of excitation light may also be performed. In the fluorescence observation, an observation of fluorescence from body tissue (autofluorescence observation) can be performed by irradiating excitation light on the body tissue, or a fluorescence image can be obtained by locally injecting an agent such as indocyanine green (ICG) into the body tissue and irradiating excitation light corresponding to the fluorescence wavelength of the agent on the body tissue. The light source device 5157 may be configured to supply such narrow-band light and/or excitation light suitable for the special light observation described above.

(Camera head and CCU)

The functions of the camera head 5119 and the CCU5153 of the endoscope 5115 are described in more detail with reference to fig. 12. Fig. 12 is a block diagram showing an example of the functional configurations of the camera head 5119 and the CCU5153 shown in fig. 11.

Referring to fig. 12, the camera head 5119 has a lens unit 5121, an image pickup unit 5123, a drive unit 5125, a communication unit 5127, and a camera head control unit 5129 as its functions. Also, the CCU5153 has a communication unit 5173, an image processing unit 5175, and a control unit 5177 as its functions. The camera head 5119 and the CCU5153 are connected by a transmission cable 5179 to be able to bidirectionally communicate with each other.

First, a functional configuration of the camera head 5119 is described. The lens unit 5121 is an optical system provided at a connection position of the camera head 5119 and the lens barrel 5117. Observation light taken from the distal end of the lens barrel 5117 is introduced into the camera head 5119 and enters the lens unit 5121. The lens unit 5121 includes a combination of a plurality of lenses including a zoom lens and a focus lens. The lens unit 5121 has optical characteristics adjusted so that observation light is condensed on a light receiving surface of an image pickup element of the image pickup unit 5123. Also, the zoom lens and the focus lens include such that their positions on their optical axes are movable to adjust the magnification and focus of a picked-up image.

The image pickup unit 5123 includes an image pickup element, and is disposed at a rear stage of the lens unit 5121. Observation light having passed through the lens unit 5121 is condensed on a light receiving surface of the image pickup element, and an image signal corresponding to an observation image is generated by photoelectric conversion. The image signal generated by the image pickup unit 5123 is supplied to the communication unit 5127.

As an image pickup element included by the image pickup unit 5123, an image sensor of, for example, a Complementary Metal Oxide Semiconductor (CMOS) type having a Bayer array and capable of picking up a color image is used. It should be noted that as the image pickup element, for example, an image pickup element that is ready for imaging a high-resolution image equal to or not less than 4K may be used. If an image of the operation region is obtained at high resolution, the surgeon 5181 can understand the state of the operation region with enhanced detail and can perform the operation more smoothly.

Also, the image pickup element included by the image pickup unit 5123 is configured such that it has a pair of image pickup elements for acquiring image signals of right and left eyes compatible with 3D display. In the case of applying the 3D display, the surgeon 5181 can understand the depth of the living tissue in the operation region with higher accuracy. It should be noted that if the image pickup unit 5123 is configured as a multi-plate type image pickup unit, a plurality of systems of the lens unit 5121 are provided corresponding to the respective individual image pickup elements of the image pickup unit 5123.

The image pickup unit 5123 may not necessarily be provided on the camera head 5119. For example, the image pickup unit 5123 may be disposed just behind the objective lens inside the lens barrel 5117.

The driving unit 5125 includes an actuator, and moves the zoom lens and the focus lens of the lens unit 5121 by a predetermined distance along the optical axis under the control of the camera head control unit 5129. Accordingly, the magnification and focus of the image picked up by the image pickup unit 5123 can be appropriately adjusted.

The communication unit 5127 includes communication means for transmitting and receiving various types of information to and from the CCU 5153. The communication unit 5127 transmits the image signal acquired from the image pickup unit 5123 to the CCU5153 as RAW data via the transmission cable 5179. Therefore, in order to display the picked-up image of the operation area with low delay, it is preferable to transmit the image signal through optical communication. This is because, since the surgeon 5181 performs the operation while observing the state of the affected area through the picked-up image at the time of the operation, in order to realize the operation of higher degree of safety and certainty, it is required to display the moving image of the operation area in real time as much as possible. In the case of applying optical communication, a photoelectric conversion module for converting an electric signal into an optical signal is provided in the communication unit 5127. After the image signal is converted into an optical signal by the photoelectric conversion module, it is transmitted to the CCU5153 through the transmission cable 5179.

Also, the communication unit 5127 receives a control signal for controlling the driving of the camera head 5119 from the CCU 5153. The control signal includes information related to image pickup conditions, such as, for example, information specifying a frame rate of a picked-up image, information specifying an exposure value at the time of image pickup, and/or information specifying a magnification and a focus of the picked-up image. The communication unit 5127 supplies the received control signal to the camera head control unit 5129. It should be noted that the control signal from CCU5153 may also be transmitted through optical communication. In this case, a photoelectric conversion module for converting an optical signal into an electrical signal is provided in the communication unit 5127. After the control signal is converted into an electric signal by the photoelectric conversion module, it is supplied to the camera head control unit 5129.

It should be noted that image pickup conditions such as a frame rate, an exposure value, magnification, or focus are automatically set by the control unit 5177 of the CCU5153 based on the acquired image signal. In other words, an Auto Exposure (AE) function, an Auto Focus (AF) function, and an Auto White Balance (AWB) function are added to the endoscope 5115.

The camera head control unit 5129 controls driving of the camera head 5119 based on a control signal from the CCU5153 received through the communication unit 5127. For example, the camera head control unit 5129 controls driving of the image pickup element of the image pickup unit 5123 based on information specifying the frame rate of picked-up images and/or information specifying the exposure value at the time of image pickup. Also, for example, the camera head control unit 5129 controls the drive unit 5125 to appropriately move the zoom lens and the focus lens of the lens unit 5121 based on information specifying the magnification and focus of a picked-up image. The camera head control unit 5129 may include a function for storing information for identifying the lens barrel 5117 and/or the camera head 5119.

It should be noted that the camera head 5119 can be made to have the capability of withstanding the autoclave process by providing components such as the lens unit 5121 and the image pickup unit 5123 in a sealed structure having high airtightness and high water resistance.

Now, the functional configuration of the CCU5153 is described. The communication unit 5173 includes communication means for transmitting and receiving various types of information to and from the camera head 5119. The communication unit 5173 receives an image signal transmitted thereto from the camera head 5119 through the transmission cable 5179. Thus, the image signal can be preferably transmitted through the above-described optical communication. In this case, in order to be compatible with optical communication, the communication unit 5173 includes a photoelectric conversion module for converting an optical signal into an electrical signal. The communication unit 5173 supplies the image signal converted into the electric signal to the image processing unit 5175.

Also, the communication unit 5173 transmits a control signal for controlling the driving of the camera head 5119 to the camera head 5119. The control signal may also be transmitted via optical communication.

The image processing unit 5175 performs various image processes on the image signal in the form of RAW data transmitted thereto from the camera head 5119. The image processing includes various known signal processing such as, for example, development processing, image quality improvement processing (bandwidth enhancement processing, super-resolution processing, Noise Reduction (NR) processing, and/or image stabilization processing), and/or enlargement processing (electronic zoom processing). Also, the image processing unit 5175 performs detection processing on the image signals for performing AE, AF, and AWB.

The image processing unit 5175 includes a processor such as a CPU or a GPU, and when the processor acts according to a predetermined program, the above-described image processing and detection processing can be performed. It should be noted that in the case where the image processing unit 5175 includes a plurality of GPUs, the image processing unit 5175 divides information about image signals appropriately so that image processing is performed by the plurality of GPUs in parallel.

The control unit 5177 performs various types of control related to image pickup of the surgical field by the endoscope 5115 and display of the picked-up image. For example, the control unit 5177 generates a control signal for controlling the driving of the camera head 5119. Therefore, if the image pickup condition is input by the user, the control unit 5177 generates a control signal based on the input of the user. Alternatively, in the case where the endoscope 5115 has an AE function, an AF function, and an AWB function incorporated therein, the control unit 5177 appropriately calculates an optimum exposure value, a focal length, and a white balance in response to the result of the detection processing of the image processing unit 5175, and generates control signals.

Also, the control unit 5177 controls the display device 5155 to display an image of the surgical field based on the image signal on which the image processing unit 5175 has performed image processing. Thus, the control unit 5177 recognizes various objects in the operation area image by using various image recognition techniques. For example, the control unit 5177 may recognize a surgical tool such as forceps, a specific living body area, bleeding, fog when using the energy device 5135, and the like by detecting the shape and color of the edge of the object included in the surgical field image, and the like. The control unit 5177, when it controls the display device 5155 to display the operation area image, causes various types of operation support information to be displayed in an overlapping manner with the image of the operation area by using the recognition result. In the case where the operation support information is displayed and presented to the surgeon 5181 in an overlapping manner, the surgeon 5181 can perform the operation more safely and surely.

The transmission cable 5179 that connects the camera head 5119 and the CCU5153 to each other is an electrical signal cable ready for electrical signal communication, an optical fiber ready for optical communication, or a composite cable thereof.

Here, although communication is performed by wired communication using the transmission cable 5179 in the example shown in the diagram, communication between the camera head 5119 and the CCU5153 may be performed by wireless communication. In the case where communication between the camera head 5119 and the CCU5153 is performed by wireless communication, the cable 5179 does not need to be laid in the operating room. Thus, the situation where the transmission cable 5179 interferes with the movement of the medical staff in the operating room can be eliminated.

An example of an operating room system 5100 to which techniques according to embodiments of the present disclosure may be applied is described above. It should be noted that although a case where the medical system to which the operating room system 5100 is applied is the endoscopic surgery system 5113 has been described as an example, the configuration of the operating room system 5100 is not limited to the configuration of the above-described example. For example, instead of the endoscopic surgery system 5113, the operating room system 5100 may be applied to a soft endoscopic system for examination or a microsurgical system.

The technique according to the present disclosure is suitably applied to the display unit constructing the operating room system as a unit displaying the operating area in the above-described configuration. In this case, the operation region corresponds to the recommended range (range corresponding to the recommended viewpoint). When the technique according to the present disclosure is applied to a display unit that displays an operation region, the user is also allowed to appropriately examine the operation region without the visual field being obstructed while examining other lesion sites and surrounding conditions. Also, the technique according to the present disclosure is applicable not only to, for example, input to an HMD during operation, but also to content used after operation. For example, the technique according to the present disclosure is applicable to an operation video taken as teaching material content for medical treatment while designating the operation video as a recommended viewpoint (surgical part, surgeon viewpoint).

<3. modification example >

Although the example in which the current viewpoint is shifted in the direction opposite to the moving direction by the user to approach the recommended viewpoint is described in the above embodiment, it may be considered that the current viewpoint is shifted to the recommended viewpoint in the direction corresponding to the shortest distance. For example, in the case where the current viewpoint exists at a position deviated from the recommended viewpoint by a bent viewpoint shift caused by a user operation as shown in fig. 13(a), the current viewpoint may be linearly shifted toward the recommended viewpoint as shown in fig. 13 (b).

Also, although an example in which the current viewpoint is shifted in the direction opposite to the moving direction by the user to approach the recommended viewpoint is described in the above embodiment, in the case where the occlusion range has been set for the shifted route, it may be considered to shift the current viewpoint along a route that does not pass through the occlusion range. In this case, for example, the range information of the occlusion may be added to the video stream in advance together with the recommended viewpoint information.

Although not described above, it is also conceivable that the display device 200 temporarily accumulates the received video stream in a memory, and reads the video stream from the memory and uses the video stream at any subsequent timing. In this case, it is also possible to set the shielding range on the display apparatus 200 side by a user operation.

For example, in the case where there is a shading range in a direction opposite to the moving direction by the user as shown in fig. 14(a), the current viewpoint is not shifted in the direction approaching the recommended viewpoint. In fig. 14(a) referred to here, a broken-line arrow indicates the offset direction in the case where there is no shielded range. In this case, as shown in fig. 14(b), a route that does not pass through the occlusion range is obtained, and the current viewpoint is shifted along the obtained route to approach the recommended viewpoint. The example shown in the diagram indicates an example in which the current viewpoint of the t-2 frame returns to the recommended viewpoint of the t frame by two frames. Note that the example offset routes shown in the diagram are presented as examples only. It is also conceivable to present a route that returns in the direction opposite to the offset direction shown in the diagram while avoiding the shielded range upward or downward.

The flowchart shown in fig. 15 represents an example of display processing steps performed by the display apparatus 200 in a case where the route does not pass through the mask range.

In step ST11, the display device 200 starts processing in response to a video clip reproduction start operation performed by the user, for example. Subsequently, in step ST12, the display device 200 displays a range corresponding to the recommended viewpoint (recommended range) as an output range. Thereafter, in step ST13, the display device 200 determines whether or not a viewpoint shift is caused by a user operation.

In the case where the viewpoint shift is caused by the user, the display apparatus 200 shifts the viewpoint in the operation direction in step ST14, and then proceeds to the processing in step ST 15. On the other hand, in a case where the user has not performed the viewpoint shifting operation, the display apparatus 200 proceeds to the processing in step ST 16.

In step ST16, the display device 200 calculates a route in the recommended viewpoint direction. Subsequently, in step ST17, the display device 200 determines whether the route passes through the masking range. In the case where the route does not pass through the shielding range, the display device 200 proceeds to the process of step ST 18. On the other hand, in the case where the route passes through the mask range, in step ST19, the display device 200 calculates the route that does not pass through the mask range again, and then proceeds to the processing in step ST 18.

Note that if the display device 200 calculates a route that does not pass through the mask range in step ST16, the processing in steps ST17 and ST9 is unnecessary.

In step ST18, the display apparatus 200 shifts the viewpoint in the recommended viewpoint direction along the calculated route, and then proceeds to the processing in step S15. In step ST15, the display device 200 displays a range corresponding to the current viewpoint as an output range.

Subsequently, in step ST20, the display device 200 determines whether video reproduction or distribution has ended. When it is determined that video reproduction or distribution has not ended, the display device 200 returns to the processing in step ST13 and reflects the processing of the next frame. On the other hand, when it is determined that video reproduction or distribution has ended, the series of processing ends at step ST 21.

Although the case where the occlusion range is located in the route of the viewpoint shift caused by the user operation is not described above, for example, passing through the occlusion range without avoiding the occlusion range or calculating the route avoiding the occlusion range and shifting along the calculated route may also be considered. In addition, in the case where the viewpoint is shifted when passing through the occlusion range, for example, an image in the occlusion range may be occluded or may be displayed without being occluded.

Also, although an example in which the current viewpoint is shifted in a direction opposite to the moving direction by the user to approach the recommended viewpoint is described in the above embodiment, in the case where the passing point is set, it may be considered to shift the current viewpoint along a route that does not pass through the passing point. In this case, for example, the passing point information may be added to the video stream in advance together with the recommended viewpoint information.

Although not described above, it is also conceivable that the display device 200 temporarily accumulates the received video stream in a memory, and reads the video stream from the memory and uses the video stream at any subsequent timing. In this case, it is also conceivable to set a passing point on the display apparatus 200 side by a user operation.

For example, assume that the passing point is set as shown in fig. 16 (a). In this case, as shown in fig. 16(b), a route passing through the passing point is obtained, and the current viewpoint is shifted to approach the recommended viewpoint along the route. The example shown in the diagram is an example in which the current viewpoint of the t-4 frame returns to the recommended viewpoint of the t frame through four frames.

The flowchart shown in fig. 17 represents an example of display processing steps performed by the display apparatus 200 in the case where the route passes through the passing point.

In step ST31, the display device 200 starts processing in response to a video clip reproduction start operation performed by the user, for example. Subsequently, in step ST32, the display device 200 displays a range corresponding to the recommended viewpoint (recommended range) as an output range. Thereafter, in step ST33, the display device 200 determines whether or not a viewpoint shift is caused by a user operation.

In the case where the viewpoint shift is caused by the user, the display apparatus 200 shifts the viewpoint in the operation direction in step ST34, and then proceeds to the processing in step ST 35. On the other hand, in a case where the viewpoint shifting operation by the user is not performed, the display apparatus 200 proceeds to the processing in step ST 36.

In step ST36, the display device 200 calculates a route extending in the recommended viewpoint direction and passing through the passing point. Subsequently, in step ST37, the display device 200 shifts the viewpoint in the recommended viewpoint direction along the calculated route, and then proceeds to the processing in step ST 35. In step ST35, the display device 200 displays a range corresponding to the current viewpoint as an output range.

Subsequently, in step ST38, the display device 200 determines whether video reproduction or distribution has ended. When it is determined that video reproduction or distribution has not ended, the display device 200 returns to the processing in step ST33 and reflects the processing of the next frame. On the other hand, when it is determined that video reproduction or distribution has ended, the series of processing ends at step ST 39.

Also, although the case where there is one recommended viewpoint is described by way of example in the above embodiment, there may be a case where there are a plurality of recommended viewpoints. For example, a plurality of actors may appear in the same image, and the positions of the respective actors may be set as recommended viewpoints.

In the case where there are a plurality of recommended viewpoints as described above, it may be considered to shift to a recommended viewpoint located closest among the plurality of recommended viewpoints or to a recommended viewpoint previously selected by the user when returning to the recommended viewpoint from the current viewpoint.

Fig. 18(a) shows an example in which there are a plurality of recommended viewpoints. Fig. 18(b) shows an example in which the current viewpoint is shifted to the recommended viewpoint closest in position among the plurality of recommended viewpoints when returning from the current viewpoint to the recommended viewpoint. In this example, the recommended viewpoint RVP (3) is located closest. Therefore, the current viewpoint is shifted to the recommended viewpoint RVP (3). Fig. 18(c) shows an example in which the current viewpoint is shifted from the recommended viewpoint selected by the user among the plurality of recommended viewpoints when returning from the current viewpoint to the recommended viewpoint. In this example, the user selects the recommended viewpoint RVP (1). Therefore, the current viewpoint is shifted to the recommended viewpoint RVP (1).

The flowchart in fig. 19 gives an example of display processing steps performed by the display device 200 in a case where the current viewpoint is shifted to the recommended viewpoint positioned closest among the plurality of recommended viewpoints.

In step ST41, the display device 200 starts processing in response to a video clip reproduction start operation performed by the user, for example. Subsequently, the display device 200 in step ST42 displays a range corresponding to the recommended viewpoint (recommended range) as an output range. For example, the recommended viewpoint here is a recommended viewpoint determined in advance.

Thereafter, in step ST43, the display device 200 determines whether or not a viewpoint shift is caused by a user operation. In the case where the viewpoint shift is caused by the user, the display apparatus 200 shifts the viewpoint in the operation direction in step ST44, and then proceeds to the processing in step ST 45.

On the other hand, in a case where the user has not performed the viewpoint shifting operation, the display apparatus 200 proceeds to the processing in step ST 46. In step ST46 herein, the display device 200 calculates the distance between the current viewpoint and each of the recommended viewpoints RVP (1 to N).

Thereafter, the display apparatus 200 calculates a route in the recommended viewpoint direction that most approximates the current viewpoint in step ST47, shifts the viewpoint in the recommended viewpoint direction along the calculated route, and then proceeds to the processing in step ST 45. In step ST45, the display device 200 displays a range corresponding to the current viewpoint as an output range.

Subsequently, the display device 200 determines in step ST48 whether video reproduction or distribution is ended. When it is determined that video reproduction or distribution has not ended, the display device 200 returns to the processing in step ST43, and reflects the processing of the next frame. On the other hand, when it is determined that the video reproduction or distribution is ended, the series of processes ends in step ST 49.

The flowchart in fig. 20 shows an example of display processing steps performed by the display device 200 in the case of shifting to a recommended viewpoint selected by the user among a plurality of recommended viewpoints.

In step ST51, the display device 200 starts processing in response to a video clip reproduction start operation performed by the user, for example. Subsequently, the display device 200 sets the recommended viewpoint number (RVP number) i to an initial value such as "1" in step ST 52. Thereafter, the display device 200 displays the range (recommended range) corresponding to the recommended viewpoint rvp (i) as the output range in step ST 53.

Then, the display apparatus 200 determines in step ST54 whether the recommended viewpoint is selected by a user operation. In the case where the recommended viewpoint is not selected by the user operation, the display apparatus 200 immediately proceeds to the processing in step ST 55. On the other hand, when it is determined that the recommended viewpoint is selected by the user operation, the display apparatus 200 changes the recommended viewpoint number (RVP number) i to the number selected by the user in step ST55, and then proceeds to the processing in step ST 55.

Thereafter, the display device 200 determines in step ST55 whether a viewpoint shift is caused by a user operation. In the case where the viewpoint shift is caused by the user, the display apparatus 200 shifts the viewpoint in the operation direction in step ST56, and then proceeds to the processing in step ST 57.

On the other hand, in the case where the user has not caused the viewpoint shifting operation, the display apparatus 200 calculates a route in the direction toward the recommended viewpoint rvp (i) in step ST58, shifts the viewpoint in the recommended viewpoint direction along the calculated route, and then proceeds to the processing in step ST 57. In step ST57, the display device 200 displays a range corresponding to the current viewpoint as an output range.

Subsequently, the display device 200 determines in step ST59 whether video reproduction or distribution has ended. When it is determined that video reproduction or distribution has not ended, the display device 200 returns to the processing in step ST54 and reflects the processing of the next frame. On the other hand, when it is determined that video reproduction or distribution has ended, the series of processing ends at step ST 60.

Also, in the above embodiments, an example is described in which the current viewpoint is shifted in the recommended viewpoint direction based on a condition that the current viewpoint (output range) is deviated from the recommended viewpoint (recommended range), that is, the current viewpoint is different from the recommended viewpoint.

The flowchart in fig. 21 shows an example of display processing steps performed by the display device 200 in a case where the recommended viewpoint is shifted to the recommended viewpoint based on whether or not the recommended viewpoint is displayed. In step ST71, the display device 200 starts processing in response to a video clip reproduction start operation performed by the user, for example.

Subsequently, in step ST72, the display device 200 displays a range corresponding to the recommended viewpoint (recommended range) as an output range. Then, in step ST73, the display device 200 determines whether the current viewpoint is the recommended viewpoint.

In the case where the current viewpoint is the recommended viewpoint, the display apparatus 200 proceeds to the processing in step ST 74. On the other hand, in the case where the current viewpoint is not the recommended viewpoint, the display apparatus 200 calculates a route in the recommended viewpoint direction in step ST75, shifts the viewpoint in the recommended viewpoint direction along the calculated route, and then proceeds to the processing in step ST 74. In step ST74, the display device 200 displays a range corresponding to the current viewpoint as an output range.

Subsequently, in step ST76, the display device 200 determines whether video reproduction or distribution has ended. When it is determined that video reproduction or distribution has not ended, the display device 200 returns to the processing in step ST73 and reflects the processing of the next frame. On the other hand, when it is determined that video reproduction or distribution has ended, the series of processing ends in step ST 77.

Note that, in the flowchart of fig. 21, the recommended viewpoint is displayed in step S72 and it is determined whether the current viewpoint is the recommended viewpoint in step ST 73. These operations are performed under the assumption that it may occur that the current viewpoint is offset from the recommended viewpoint and is located at a position different from the recommended viewpoint based on the fact that the time (frame) in step ST73 includes the time (frame) advanced from step ST 72. Possible examples of the case where the current viewpoint is different from the recommended viewpoint include a case where the user has changed the viewpoint in a frame before the current frame.

Also, each of the following conditions may be regarded as a condition for shifting the current viewpoint in the recommended viewpoint direction.

For example, a condition that a predetermined time has elapsed in a state where the current viewpoint is deviated from the recommended viewpoint, that is, in a state where the current viewpoint is different from the recommended viewpoint may be considered.

The flowchart in fig. 22 shows an example of display processing steps performed by the display device 200 in a case where a predetermined time is shifted to a recommended viewpoint based on whether or not the recommended viewpoint is not displayed. In step ST81, the display device 200 starts processing in response to a video clip reproduction start operation performed by the user, for example.

Subsequently, in step ST82, the display device 200 displays a range corresponding to the recommended viewpoint (recommended range) as an output range. Then, in step ST83, the display device 200 determines whether the current viewpoint is the recommended viewpoint. In the case where the current viewpoint is the recommended viewpoint, the display apparatus 200 proceeds to the processing in step ST 84.

On the other hand, in the case where the current viewpoint is not the recommended viewpoint, the display apparatus 200 determines in step ST85 whether a predetermined time or more has elapsed in a state where the current viewpoint is not the recommended viewpoint. The predetermined time may be set in advance or may be changed to any time by the user. In a case where the predetermined time or longer has not elapsed, the display device 200 proceeds to step ST 84. On the other hand, in the case where the predetermined time or more has elapsed, the display apparatus 200 calculates the route in the recommended viewpoint direction in step ST86, shifts the viewpoint in the recommended viewpoint direction along the calculated route, and then proceeds to the processing in step ST 84.

In step ST84, the display device 200 displays a range corresponding to the current viewpoint as an output range. Subsequently, the display device 200 determines in step ST87 whether video reproduction or distribution has ended. When it is determined that video reproduction or distribution has not ended, the display device 200 returns to the processing in step ST83 and reflects the processing of the next frame. On the other hand, when it is determined that video reproduction or distribution has ended, the series of processing ends in step ST 88.

Also, for example, a case regarded as the offset condition may be a case where a viewpoint position changing instruction issued by a user operation has not been detected. Also, for example, what is regarded as the offset condition may be a case where the current viewpoint is located within a certain range in the image. The specific range here is, for example, an image area in the rear surface direction.

In addition, for example, the case of being regarded as the offset condition may be a case where the output range deviates from the recommended range in a state where an image is included in the image of the specific frame. In this case, for example, whether or not an image is contained in an image of a specific frame may be previously added to a video stream together with recommended viewpoint information. Further, for example, what is considered as the offset condition may be a case where the user issues an instruction to migrate to the recommended viewpoint in a state where the current viewpoint is deviated from the recommended viewpoint.

The flowchart shown in fig. 23 represents an example of display processing steps performed by the display apparatus 200, and in step ST101, the display apparatus 200 starts processing in response to a video clip reproduction start operation performed by a user, for example.

Subsequently, in step ST102, the display device 200 displays a range corresponding to the recommended viewpoint (recommended range) as an output range. Then, in step ST103, the display apparatus 200 determines whether the current viewpoint is the recommended viewpoint. In the case where the current viewpoint is the recommended viewpoint, the display apparatus 200 proceeds to the processing in step ST 111.

On the other hand, in the case where the current viewpoint is not the recommended viewpoint, in step ST104, the display apparatus 200 determines whether or not a predetermined time has elapsed. In the case where the predetermined time has not elapsed, in step ST105, the display apparatus 200 determines whether or not the viewpoint shift is caused by the user operation. In the case where the viewpoint shift is caused, in step ST106, the display device 200 determines whether the shifted viewpoint position is located at a specific position.

This determination is not made after the actual offset, but is used to determine whether the offset destination is located at a specific position at the time of receiving the user operation. For example, examples of the specific position include a position of a rear surface corresponding to the recommended viewpoint and a position at a distance or more from the recommended viewpoint.

In the case where the shifted viewpoint position is not the specific position, in step ST107, the display apparatus 200 determines whether the viewpoint position is contained in the specific frame. In the case where the viewpoint position is not contained in the specific frame, the display apparatus 200 shifts the viewpoint in the operation direction in step ST108, and then proceeds to the processing in step ST 109.

In the case where the predetermined time has elapsed in step ST104, in the case where no viewpoint shift has been caused in step ST105, in the case where the viewpoint position after the shift is the specific position in step ST106, or in the case where the viewpoint position is contained in the specific frame in step ST107, in step ST110, the display apparatus 200 calculates a route in the recommended viewpoint direction, shifts the viewpoint in the recommended viewpoint direction along the calculated route, and then proceeds to the processing in step ST 109. In step ST109, the display device 200 displays a range corresponding to the current viewpoint as an output range.

Subsequently, in step ST111, the display apparatus 200 determines whether video reproduction or distribution has ended. When it is determined that video reproduction or distribution has not ended, the display apparatus 200 returns to the processing in step ST103 and reflects the processing of the next frame. On the other hand, when it is determined that video reproduction or distribution has ended, the series of processing ends in step ST 112.

Also, the case described in the above-described embodiment is such that the speed control is performed such that the approach speed increases as the position difference between the current viewpoint and the recommended viewpoint increases, and the approach speed decreases as the position difference decreases. In other words, the migration speed is made higher when the positional difference is large, as compared with when the positional difference is small. However, for example, it may be considered to perform the speed control in such a manner as to return to the recommended viewpoint within a certain time. Also, for example, it is also conceivable to perform speed control so as to return to the recommended viewpoint at a certain speed.

Also, although the route for transition from the current viewpoint to the recommended viewpoint is interpolated by spherical linear interpolation in the above-described embodiment, it is conceivable to interpolate the transition route by linear interpolation.

Also, the case described in the above-described embodiment is that, at the time of transition from the current viewpoint to the recommended viewpoint, the current viewpoint is shifted so as to gradually approach the recommended viewpoint. However, it may be considered to instantaneously shift the current viewpoint to the recommended viewpoint at the time of the transition.

Also, although in the above-described embodiments, an example of processing a VR image as a wide view image is described, an example of processing a panoramic image may be similarly considered. Also, although an example of processing a two-dimensional image is described in the above-described embodiment, an example of processing a three-dimensional image may be similarly considered.

In addition, although the preferred embodiments of the present invention have been described in detail with reference to the drawings, the technical scope of the present invention is not limited to this example. It is clear to those having ordinary knowledge in the technical field of the present disclosure that various examples of modifications and corrections may occur within the scope of the technical spirit described in the claims. Needless to say, it should be understood that these modifications and corrections also fall within the technical scope of the present disclosure.

Also, the advantageous effects described in this specification are only for giving explanation or example, and are not as limiting advantageous effects. Thus, the techniques according to the present invention may provide other advantageous effects that will be apparent to those skilled in the art from this description, in addition to or in lieu of the advantageous effects described above.

In addition, the present technology may have the following configuration.

(1) An image processing apparatus comprising:

a detection unit detecting viewpoint shift information according to the viewpoint position change instruction;

an output range determination unit that determines an output range of an image containing the recommended viewpoint information based on the recommended viewpoint information and the viewpoint offset information; and

an output unit that outputs a part of the image as a display image to the display unit based on the determined output range,

wherein the output range determination unit determines the output range such that: in a case where the display image satisfies the predetermined condition, the display image is located within the recommended range indicated by the recommended viewpoint information.

(2) The image processing apparatus according to the above (1), wherein the output range determination unit determines the output range such that the display image is shifted in a direction toward the recommended range.

(3) The image processing apparatus according to the above (1) or (2), wherein the predetermined condition includes a case where the display image is different from the recommended range.

(4) The image processing apparatus according to the above (1) or (2), wherein the predetermined condition includes a case where the display image is different from the recommended range for a predetermined time or more.

(5) The image processing apparatus according to the above (3), wherein the predetermined condition further includes a case where the viewpoint shift information is not detected.

(6) The image processing apparatus according to the above (1) or (2), wherein the predetermined condition includes a case where the display image is located within a specific range in the image.

(7) The image processing apparatus according to the above (1) or (2), wherein the predetermined condition includes a case where the display image is different from the recommended range in a state where the image is included in the image of the specific frame.

(8) The image processing apparatus according to the above (1) or (2), wherein the predetermined condition includes a case where the display image is different from the recommended range and an instruction to transition to the recommended range is also issued as the viewpoint offset information.

(9) The image processing apparatus according to any one of the above (2) to (8), in which the output range determination unit controls the speed of the transition based on a positional difference between the display image and the recommended range so that the speed of the transition becomes higher in a case where the positional difference is large than in a case where the positional difference is small.

(10) The image processing apparatus according to any one of the above (2) to (8), wherein the output range determining means controls the speed of transition so that return to the recommended range is achieved within a certain time.

(11) The image processing apparatus according to any one of the above (2) to (8), wherein the output range determination means controls the speed of transition so that return to the recommended range is achieved at a certain speed.

(12) The image processing apparatus according to any one of the above (2) to (11), wherein the output range determining means interpolates the transition route by spherical linear interpolation.

(13) The image processing apparatus according to any one of the above (2) to (11), wherein the output means interpolates the transition route by linear interpolation.

(14) The image processing apparatus according to any one of the above-described (2) to (13), wherein in a case where the display image is different from the recommended range as a result of changing the viewpoint position based on the viewpoint offset information, the output range determination unit determines the output range such that the offset is achieved along a route in a direction opposite to a viewpoint position change direction corresponding to the change of the viewpoint position at the time of the transition.

(15) The image processing apparatus according to any one of the above (2) to (13), wherein the output range determination unit determines the output range such that the shift is realized along the shortest route from the display image to the recommended range at the time of the transition.

(16) The image processing apparatus according to any one of the above (2) to (15), wherein in a case where the image includes mask range information indicating a mask range, the output range determination unit determines the output range so that the shift is realized along a route that does not pass through the mask range at the time of the transition.

(17) The image processing apparatus according to any one of the above (2) to (16), wherein in a case where the image contains passing point information indicating the passing point, the output range determination unit determines the output range so that the offset is realized along a route passing the passing point at the time of the transition.

(18) The image processing apparatus according to any one of the above (1) to (17), wherein in a case where the recommended viewpoint information includes a plurality of recommended ranges, the output range determination unit determines the output range so as to realize a shift toward a recommended range located at a position closest to the display image among the plurality of recommended ranges or toward a recommended range located at a position selected by the user at the time of the transition.

(19) An image processing method comprising:

a step for detecting viewpoint shift information according to the viewpoint position change instruction;

a step for determining an output range of an image containing the recommended viewpoint information based on the recommended viewpoint information and the viewpoint offset information; and

a step for outputting a part of the image as a display image to a display unit based on the determined output range,

wherein, in the step for determining the output range, the output range is determined such that: in a case where the display image satisfies the predetermined condition, the display image is located within the recommended range indicated by the recommended viewpoint information.

(20) A program that causes a computer to function as:

a detection means for detecting the viewpoint shift information based on the viewpoint position change instruction;

an output range determination means for determining an output range of an image containing the recommended viewpoint information based on the recommended viewpoint information and the viewpoint offset information; and

an output means for outputting a part of the image as a display image to the display unit based on the determined output range,

wherein the output range determining means determines the output range such that: in a case where the display image satisfies a predetermined condition, the display image is located within the recommended viewpoint range indicated by the recommended viewpoint information.

[ list of reference numerals ]

10: image distribution system

101: multi-camera

102: wide-view image conversion unit

103: encoder for encoding a video signal

104: distribution server

200: display device

201: receiving unit

202: decoder

203: visual field drawing unit

204: gyro sensor

205: viewpoint offset information calculation unit

206: migrating a computing unit

207: display unit

47页详细技术资料下载
上一篇:一种医用注射器针头装配设备
下一篇:解码器侧运动矢量推导

网友询问留言

已有0条留言

还没有人留言评论。精彩留言会获得点赞!

精彩留言,会给你点赞!

技术分类