Three-dimensional image processing method and device based on neural network and electronic equipment

文档序号：1954770 发布日期：2021-12-10 浏览：16次中文

阅读说明：本技术 基于神经网络的三维图像处理方法、装置及电子设备 (Three-dimensional image processing method and device based on neural network and electronic equipment ) 是由罗天文戴磊刘玉宇于 2021-09-15 设计创作，主要内容包括：本发明适用于人工智能及数字医疗领域,并公开了一种基于神经网络的三维图像处理方法、装置及电子设备,在基于神经网络的三维图像处理方法中,通过获取用于表征目标的深度图像的第一深度信息,并将第一深度信息输入至深度神经网络模型中进行转换得到目标的目标三维图像信息,深度神经网络模型是根据三维样本的样本三维图像信息和三维样本的第二深度信息经过训练后得到的,深度神经网络模型经过训练后可以准确地将二维的深度图像转换为与原三维的物体相对应的三维图像,因此将第一深度信息输入至训练好的深度神经网络模型中能够输出准确度高的目标三维图像信息,能够提高构建三维图像的精确度,从而提高计算机视觉识别的精度。(The invention is suitable for the field of artificial intelligence and digital medical treatment, and discloses a three-dimensional image processing method, a device and an electronic device based on a neural network, in the three-dimensional image processing method based on the neural network, the target three-dimensional image information of a target is obtained by obtaining first depth information of a depth image for representing the target and inputting the first depth information into a depth neural network model for conversion, the depth neural network model is obtained by training according to sample three-dimensional image information of a three-dimensional sample and second depth information of the three-dimensional sample, the two-dimensional depth image can be accurately converted into a three-dimensional image corresponding to an original three-dimensional object after the depth neural network model is trained, therefore, the first depth information is input into the trained depth neural network model to output the target three-dimensional image information with high accuracy, the accuracy of constructing the three-dimensional image can be improved, and therefore the accuracy of computer vision recognition is improved.)

1. A three-dimensional image processing method based on a neural network is characterized by comprising the following steps:

acquiring first depth information of a target, wherein the first depth information is used for representing a depth image of the target;

inputting the first depth information into an input layer of a depth neural network model, wherein the depth neural network model is obtained by training according to sample three-dimensional image information and second depth information of a three-dimensional sample, and the second depth information is used for representing a depth image of the three-dimensional sample;

inputting the first depth information into a convolution layer of the deep neural network model for convolution to obtain a first characteristic value;

inputting the first characteristic value into a change characteristic dimension layer of the deep neural network model for characteristic conversion to obtain a first three-dimensional characteristic body;

inputting the first three-dimensional feature into a three-dimensional deconvolution layer of the deep neural network model for deconvolution to obtain target three-dimensional image information of the target;

and outputting the target three-dimensional image information.

2. The three-dimensional image processing method based on the neural network according to claim 1, wherein the obtaining of the first depth information of the target includes:

acquiring two-dimensional image information of the target and depth image information corresponding to the two-dimensional image information;

and performing target detection on the two-dimensional image information to identify target image information used for representing the target from the two-dimensional image information, and obtaining corresponding first depth information from the depth image information according to the target image information.

3. The neural network-based three-dimensional image processing method according to claim 1, wherein the deep neural network model is obtained by training according to the following steps:

acquiring the sample three-dimensional image information of the three-dimensional sample;

obtaining second depth information of the three-dimensional sample according to the three-dimensional image information of the sample;

inputting the second depth information into the input layer;

inputting the second depth information into the convolutional layer for convolution to obtain a second characteristic value;

inputting the second characteristic value into the characteristic dimension changing layer to perform characteristic conversion to obtain a second three-dimensional characteristic body;

inputting the second three-dimensional feature into the three-dimensional deconvolution layer for deconvolution to obtain training three-dimensional image information of the three-dimensional sample;

inputting the training three-dimensional image information and the sample three-dimensional image information into a loss function to calculate a loss value;

and obtaining a target weight parameter according to the loss value and adjusting the deep neural network model according to the target weight parameter.

4. The neural network-based three-dimensional image processing method according to claim 3, wherein the inputting the second depth information into the input layer includes:

carrying out random first amplification transformation on the second depth information to obtain third depth information, wherein the first amplification transformation comprises one of random Gaussian noise values, random scaling, random angle rotation, random translation and random selection of partial areas of the depth image;

inputting the third depth information into the input layer.

5. The neural network-based three-dimensional image processing method according to claim 4, wherein the inputting the training three-dimensional image information and the sample three-dimensional image information into a loss function to calculate a loss value comprises:

converting the sample three-dimensional image information into first mesh information;

performing a second augmented transformation corresponding to the first augmented transformation on the first mesh information to obtain second mesh information corresponding to the third depth information perspective, wherein the second augmented transformation includes performing one of scaling, angular rotation and translation corresponding to the second depth information;

discretizing the second mesh information into sample three-dimensional voxel information;

inputting the training three-dimensional image information and the sample three-dimensional voxel information into the loss function to calculate the loss value.

6. The method of claim 3, wherein the obtaining a target weight parameter according to the loss value and adjusting the deep neural network model according to the target weight parameter comprises:

optimizing the loss value and performing back propagation chain type derivation on the optimized loss value to obtain a weight parameter gradient;

and executing gradient descending processing according to the weight parameter gradient to obtain the target weight parameter.

7. The three-dimensional image processing method based on the neural network as claimed in claim 6, wherein the performing gradient descent processing according to the weight parameter gradient to obtain the target weight parameter comprises:

and executing gradient descent processing according to the weight parameter gradient obtained by the last training to obtain the target weight parameter.

8. A three-dimensional image processing apparatus based on a neural network, comprising:

the image acquisition module is used for acquiring first depth information of a target, and the first depth information is used for representing a depth image of the target;

the processing module is connected with the image acquisition module and is used for inputting the first depth information into an input layer of a depth neural network model, wherein the depth neural network model is obtained by training according to sample three-dimensional image information and second depth information of a three-dimensional sample, and the second depth information is used for representing a depth image of the three-dimensional sample;

the processing module is further configured to input the first depth information into a convolution layer of the deep neural network model for convolution to obtain a first feature value, input the first feature value into a change feature dimension layer of the deep neural network model for feature conversion to obtain a first three-dimensional feature volume, and input the first three-dimensional feature volume into a three-dimensional deconvolution layer of the deep neural network model for deconvolution to obtain target three-dimensional image information of the target;

the processing module is further used for outputting the target three-dimensional image information.

9. An electronic device, comprising a memory storing a computer program, and a processor implementing the neural network-based three-dimensional image processing method according to any one of claims 1 to 7 when the processor executes the computer program.

10. A computer-readable storage medium characterized in that the storage medium stores a program executed by a processor to implement the neural network-based three-dimensional image processing method according to any one of claims 1 to 7.

Technical Field

The invention relates to the technical field of artificial intelligence and digital medical treatment, in particular to a three-dimensional image processing method and device based on a neural network and electronic equipment.

Background

In computer vision applications in the fields of artificial intelligence and digital medical treatment, two-dimensional image information of an object is often required to be converted into a three-dimensional image, namely a three-dimensional model, for example, the application of face recognition is widely applied in many fields, a terminal constructs a three-dimensional image by acquiring image information of a face, including depth image information, so that the three-dimensional image of the face, namely the three-dimensional face model, can be obtained, and the accuracy in the face recognition is improved. However, in the related art, there are many disadvantages in that a three-dimensional image is constructed from a depth image and two-dimensional image information acquired by a terminal device, the depth image corresponds to depth information of each position on a two-dimensional plane acquired at a specific viewing angle, and a three-dimensional object always has a part of an area blocked when viewed from different viewing angles, so that the three-dimensional image is simply constructed from the two-dimensional image information, which causes inaccuracy of the three-dimensional image, and the accuracy of computer visual recognition cannot be improved.

Disclosure of Invention

The following is a summary of the subject matter described in detail herein. This summary is not intended to limit the scope of the claims.

The embodiment of the invention provides a three-dimensional image processing method and device based on a neural network, electronic equipment and a storage medium, which can improve the accuracy of constructing a three-dimensional image, thereby improving the accuracy of computer vision identification.

In a first aspect, an embodiment of the present invention provides a three-dimensional image processing method based on a neural network, including:

acquiring first depth information of a target, wherein the first depth information is used for representing a depth image of the target;

inputting the first depth information into a convolution layer of the deep neural network model for convolution to obtain a first characteristic value;

and outputting the target three-dimensional image information.

In some embodiments, the obtaining the first depth information of the target includes:

acquiring two-dimensional image information of the target and depth image information corresponding to the two-dimensional image information;

In some embodiments, the deep neural network model is trained according to the following steps: