Image processing method, image processing device, electronic equipment and storage medium

文档序号：1954556 发布日期：2021-12-10 浏览：9次中文

阅读说明：本技术 图像处理方法、装置、电子设备及存储介质 (Image processing method, image processing device, electronic equipment and storage medium ) 是由徐屹于 2020-05-21 设计创作，主要内容包括：本公开关于一种图像处理方法、装置、电子设备及存储介质,属于计算机视觉领域。该方法包括：获取待处理的人体图像,对人体图像的人体区域进行分区处理,得到N个子图像区域；分别对N个子图像区域中的每个子图像区域进行边缘提取和色块提取,得到边缘提取结果和N个色块,其中,色块提取是指将每个子图像区域包括的各个像素点的颜色值统一配置为相同数值,相同数值是根据各个像素点的原始颜色值确定的；将边缘提取结果叠加到N个色块上,得到融合图像；对人体图像的人体区域进行人体关键点预测,得到人体关键点预测结果；基于人体关键点预测结果和设置的褶皱出现规则,在融合图像上绘制褶皱得到目标图像。本公开丰富了图像处理方式且处理效果佳。(The disclosure relates to an image processing method, an image processing device, electronic equipment and a storage medium, and belongs to the field of computer vision. The method comprises the following steps: acquiring a human body image to be processed, and carrying out partition processing on a human body area of the human body image to obtain N sub-image areas; respectively carrying out edge extraction and color lump extraction on each sub-image region in the N sub-image regions to obtain an edge extraction result and N color lumps, wherein the color lump extraction refers to uniformly configuring color values of all pixel points included in each sub-image region into the same value, and the same value is determined according to original color values of all the pixel points; superposing the edge extraction result to the N color blocks to obtain a fused image; predicting human body key points in a human body region of a human body image to obtain a human body key point prediction result; and drawing folds on the fusion image to obtain a target image based on the human body key point prediction result and the set fold occurrence rule. The method and the device enrich the image processing modes and have good processing effect.)

1. An image processing method, characterized in that the method comprises:

acquiring a human body image to be processed, and carrying out partition processing on a human body area of the human body image to obtain N sub-image areas, wherein the value of N is a positive integer;

respectively carrying out edge extraction and color block extraction on each sub-image region in the N sub-image regions to obtain an edge extraction result and N color blocks, wherein the color block extraction refers to uniformly configuring color values of all pixel points included in each sub-image region into the same value, and the same value is determined according to original color values of all the pixel points;

superposing the edge extraction result to the N color blocks to obtain a fused image;

predicting human body key points of the human body region of the human body image to obtain a human body key point prediction result;

and drawing folds on the fusion image based on the human body key point prediction result and the set fold occurrence rule to obtain a target image.

2. The image processing method according to claim 1, wherein the partitioning the human body region of the human body image includes:

partitioning the human body area according to the human body parts and the clothes included in the human body area to obtain masks for indicating the N sub-image areas, wherein the masks corresponding to each sub-image area are respectively represented by different colors;

wherein, a sub-image area corresponds to a color block, and each sub-image area comprises a human body part or a dress.

3. The image processing method according to claim 1, wherein the performing edge extraction on each of the N sub-image regions respectively comprises:

for each sub-image area, carrying out filtering processing on the sub-image area to obtain a filtering image;

calculating gradient data of each pixel point in the filtering image to obtain a gradient image;

according to the gradient data of each pixel point, filtering the pixel points included in the gradient image to obtain the residual pixel points which are not filtered;

based on the gradient strength of the residual pixel points and the two set thresholds, screening the residual pixel points to obtain screened pixel points;

and connecting the screened pixel points to obtain the edge extraction result.

4. The method according to claim 1, wherein said performing color block extraction on each sub-image region of the N sub-image regions comprises:

for each sub-image area, acquiring the color average value of all pixel points in the sub-image area;

and configuring the color value of each pixel point in the sub-image area as the color average value to obtain a color block corresponding to the sub-image area.

5. The image processing method of claim 4, wherein the obtaining the color average of all the pixels in the sub-image region comprises:

respectively acquiring a first color average value of all pixel points in the sub-image region in an R channel, a second color average value in a G channel and a third color average value in a B channel;

the configuring the color value of each pixel point in the sub-image region as the color average value includes:

and configuring the color value of each pixel point in the sub-image area in the R channel as the first color average value, configuring the color value in the G channel as the second color average value, and configuring the color value in the B channel as the third color average value.

6. The image processing method according to claim 1, wherein the step of drawing a wrinkle on the fused image based on the human body key point prediction result and the set wrinkle occurrence rule to obtain a target image comprises:

generating a plurality of selectable items based on the human keypoint prediction result and the wrinkle occurrence rule; each selectable item corresponds to two human key points in the human key point prediction result;

displaying the plurality of selectable items;

determining M target selectable items selected by a user in the plurality of selectable items, wherein the value of M is a positive integer;

for each target selectable item, taking two human body key points corresponding to the target selectable item as a starting point and an end point of a fold to be drawn respectively;

and connecting the determined starting point and the corresponding end point to obtain a fold drawn on the fused image.

7. The image processing method according to claim 6, wherein connecting the determined start point and the corresponding end point to obtain a wrinkle rendered on the fused image comprises:

and connecting the determined starting point and the corresponding end point by adopting a Bezier curve to obtain a fold drawn on the fused image.

8. An image processing apparatus, characterized in that the apparatus comprises:

an acquisition module configured to acquire a human body image to be processed;

the first processing module is configured to perform partition processing on a human body region of the human body image to obtain N sub-image regions, wherein the value of N is a positive integer;

the extraction module is configured to perform edge extraction and color block extraction on each sub-image region in the N sub-image regions respectively to obtain an edge extraction result and N color blocks, wherein the color block extraction refers to uniformly configuring color values of all pixel points included in each sub-image region to be the same value, and the same value is determined according to original color values of all the pixel points;

the fusion module is configured to overlay the edge extraction result onto the N color blocks to obtain a fusion image;

the prediction module is configured to predict human key points of the human body region of the human body image to obtain a human key point prediction result;

and the second processing module is configured to draw folds on the fusion image based on the human body key point prediction result and the set fold occurrence rule to obtain a target image.

9. An electronic device, comprising:

a processor;

a memory for storing the processor-executable instructions;

wherein the processor is configured to execute the instructions to implement the image processing method of any one of claims 1 to 7.

10. A computer-readable storage medium, wherein instructions in the storage medium, when executed by a processor of an electronic device, enable the electronic device to perform the image processing method of any of claims 1 to 7.

Technical Field

The present disclosure relates to the field of computer vision technologies, and in particular, to an image processing method and apparatus, an electronic device, and a storage medium.

Background

Stylizing images in an artistic manner has been a popular research direction in the field of computer vision. The stylization is a specific application in an image processing algorithm, and aims to convert the style of an image into a certain style type, and keep other elements in the image unchanged, so as to realize a specific visual effect desired by a user.

Taking a quadratic element style as an example, the technology converts an image into a cartoon type style of quadratic element, namely, realizes cartoon of the image. The better the image processing effect, the more satisfactory cartoon effect can be generated, and therefore, how to convert the real image into a high-quality painting-style image becomes a problem to be solved by those skilled in the art.

Disclosure of Invention

The present disclosure provides an image processing method, an image processing apparatus, an electronic device, and a storage medium, which not only enrich image processing modes, but also have a good image processing effect. The technical scheme of the disclosure is as follows:

according to a first aspect of embodiments of the present disclosure, there is provided an image processing method, the method including:

superposing the edge extraction result to the N color blocks to obtain a fused image;

predicting human body key points of the human body region of the human body image to obtain a human body key point prediction result;

and drawing folds on the fusion image based on the human body key point prediction result and the set fold occurrence rule to obtain a target image.

In a possible implementation manner, the partitioning the human body region of the human body image includes:

wherein, a sub-image area corresponds to a color block, and each sub-image area comprises a human body part or a dress.

In a possible implementation manner, the performing edge extraction on each of the N sub-image regions respectively includes:

for each sub-image area, carrying out filtering processing on the sub-image area to obtain a filtering image;

calculating gradient data of each pixel point in the filtering image to obtain a gradient image;

according to the gradient data of each pixel point, filtering the pixel points included in the gradient image to obtain the residual pixel points which are not filtered;

based on the gradient strength of the residual pixel points and the two set thresholds, screening the residual pixel points to obtain screened pixel points;

and connecting the screened pixel points to obtain the edge extraction result.

In a possible implementation manner, the gradient data includes a gradient strength and a gradient direction, and the filtering processing of the pixel points included in the gradient image according to the gradient data of each pixel point includes:

for each pixel point in the gradient image, comparing the gradient strength of the pixel point with the gradient strength of two adjacent pixel points;

if the gradient intensity of the pixel point is greater than the gradient intensities of the two pixel points, the pixel point is reserved;

if the gradient intensity of the pixel point is minimum or smaller than the gradient intensity of any one of the two pixel points, filtering the pixel point;

and the two adjacent phase pixel points are positioned in the gradient direction of the pixel point and positioned at two sides of the pixel point.

In one possible implementation, the two thresholds include a first threshold and a second threshold, the first threshold being greater than the second threshold; the screening processing is carried out on the residual pixel points based on the gradient strength of the residual pixel points and two set thresholds, and the screening processing comprises the following steps:

for each pixel point in the residual pixel points, if the gradient intensity of the pixel point is greater than the set first threshold value, the pixel point is reserved and marked as a first-class pixel point; or the like, or, alternatively,

if the gradient strength of the pixel point is smaller than the first threshold value and larger than the second threshold value, and the pixel point adjacent to the pixel point comprises the first type pixel point, the pixel point is reserved; or the like, or, alternatively,

if the gradient strength of the pixel point is smaller than the first threshold value and larger than the second threshold value, and the pixel point adjacent to the pixel point does not comprise the first type of pixel point, filtering the pixel point; or the like, or, alternatively,

and if the gradient strength of the pixel point is smaller than the second threshold value, filtering the pixel point.

In a possible implementation manner, the performing color block extraction on each sub-image region of the N sub-image regions includes:

for each sub-image area, acquiring the color average value of all pixel points in the sub-image area;

and configuring the color value of each pixel point in the sub-image area as the color average value to obtain a color block corresponding to the sub-image area.

In a possible implementation manner, the obtaining the color average value of all pixel points in the sub-image region includes:

the configuring the color value of each pixel point in the sub-image region as the color average value includes:

In a possible implementation manner, the predicting the human body key points in the human body region of the human body image, where the number of key points included in the human body key point prediction result is greater than a target threshold, includes:

predicting key points of the human body image on the basis of a key point prediction model;

the key point prediction model is obtained by training a deep neural network based on a specified training data set, each sample human body image in the specified training data set corresponds to label information, and the label information marks corresponding mapping points when the marking points in the sample human body images are mapped to corresponding three-dimensional human body models;

the generation process of the label information comprises the following steps: firstly, segmenting the human body part of the sample human body image, and then sampling each segmented human body part based on marker points with approximate equal distances to obtain a plurality of marker points for marking the human body part; and positioning a mapping point corresponding to each marking point on the three-dimensional human body model.

In a possible implementation manner, the drawing a wrinkle on the fused image based on the human body key point prediction result and the set wrinkle occurrence rule to obtain a target image includes:

displaying the plurality of selectable items;

determining M target selectable items selected by a user in the plurality of selectable items, wherein the value of M is a positive integer;

for each target selectable item, taking two human body key points corresponding to the target selectable item as a starting point and an end point of a fold to be drawn respectively;

and connecting the determined starting point and the corresponding end point to obtain a fold drawn on the fused image.

In one possible implementation, the generating a plurality of selectable items based on the human keypoint prediction result and the wrinkle occurrence rule includes:

determining a wrinkle occurrence region based on the wrinkle occurrence rule; wherein the wrinkle occurrence region refers to a region on a human body where wrinkles exist;

screening human key points from the human key point prediction result according to the determined fold occurrence area;

generating the plurality of selectable items according to the screened human key points; wherein each selectable item corresponds to two of the screened human key points.

In one possible implementation, connecting the determined start point and the corresponding end point to obtain a wrinkle drawn on the fused image includes:

and connecting the determined starting point and the corresponding end point by adopting a Bezier curve to obtain a fold drawn on the fused image.

In one possible implementation, the connection rule of the bezier curve includes:

randomly generating a first included angle value and a second included angle value within a specified included angle value interval;

taking the first included angle value as the tangential direction of the starting point, and taking the second included angle value as the tangential direction of the end point; generating the wrinkle based on a tangential direction of the starting point and a tangential direction of the ending point;

the first included angle value is formed by tangency of a first tangent line passing through the starting point and a specified straight line, and the second included angle value is formed by tangency of a second tangent line passing through the end point and the specified straight line; the first tangent line and the second tangent line are located on the same side of the designated straight line, and the designated straight line passes through the starting point and the ending point.

According to a second aspect of the embodiments of the present disclosure, there is provided an image processing apparatus, the apparatus including:

an acquisition module configured to acquire a human body image to be processed;

the first processing module is configured to perform partition processing on a human body region of the human body image to obtain N sub-image regions, wherein the value of N is a positive integer;

the fusion module is configured to overlay the edge extraction result onto the N color blocks to obtain a fusion image;

the prediction module is configured to predict human key points of the human body region of the human body image to obtain a human key point prediction result;

and the second processing module is configured to draw folds on the fusion image based on the human body key point prediction result and the set fold occurrence rule to obtain a target image.

In a possible implementation manner, the first processing module is configured to perform partition processing on the human body region according to a human body part and clothing included in the human body region, to obtain masks for indicating the N sub-image regions, where the masks corresponding to each sub-image region are respectively represented by different colors; wherein, a sub-image area corresponds to a color block, and each sub-image area comprises a human body part or a dress.

In one possible implementation manner, the extraction module includes:

the first processing unit is configured to carry out filtering processing on each sub-image area to obtain a filtering image;

the calculation unit is configured to calculate gradient data of each pixel point in the filtering image to obtain a gradient image;

the second processing unit is configured to filter the pixel points included in the gradient image according to the gradient data of each pixel point to obtain the residual pixel points which are not filtered;

the third processing unit is configured to perform screening processing on the residual pixel points based on the gradient strength of the residual pixel points and the two set thresholds to obtain screened pixel points;

and the connecting unit is configured to connect the screened pixel points to obtain the edge extraction result.

In a possible implementation manner, the second processing unit is configured to, for each pixel point in the gradient image, compare the gradient strength of the pixel point with the gradient strength of two adjacent pixel points; if the gradient intensity of the pixel point is greater than the gradient intensities of the two pixel points, the pixel point is reserved; if the gradient intensity of the pixel point is minimum or smaller than the gradient intensity of any one of the two pixel points, filtering the pixel point; and the two adjacent phase pixel points are positioned in the gradient direction of the pixel point and positioned at two sides of the pixel point.

In a possible implementation manner, the third processing unit is configured to, for each of the remaining pixels, if the gradient strength of the pixel is greater than the set first threshold, retain the pixel, and mark the pixel as a first-class pixel; or if the gradient strength of the pixel point is smaller than the first threshold and larger than the second threshold, and the neighborhood pixel points of the pixel point comprise the first type pixel points, keeping the pixel point; or, if the gradient strength of the pixel point is smaller than the first threshold and larger than the second threshold, and the neighborhood pixel points of the pixel point do not include the first type pixel point, filtering the pixel point; or, if the gradient strength of the pixel point is smaller than the second threshold, filtering the pixel point.

In one possible implementation manner, the extraction module further includes:

the acquisition unit is configured to acquire the color average value of all pixel points in each sub-image area;

and the extraction unit is configured to configure the color value of each pixel point in the sub-image area as the color average value to obtain a color block corresponding to the sub-image area.

In a possible implementation manner, the obtaining unit is configured to obtain a first color average value of all pixel points in the sub-image region in an R channel, a second color average value in a G channel, and a third color average value in a B channel, respectively;

the extracting unit is configured to configure the color value of each pixel point in the sub-image region in the R channel as the first color average value, the color value in the G channel as the second color average value, and the color value in the B channel as the third color average value.

In one possible implementation manner, the second processing module includes:

a determination unit configured to generate a plurality of selectable items based on the human body key point prediction result and the wrinkle occurrence rule; each selectable item corresponds to two human key points in the human key point prediction result; displaying the plurality of selectable items; determining M target selectable items selected by a user in the plurality of selectable items, wherein the value of M is a positive integer; for each target selectable item, taking two human body key points corresponding to the target selectable item as a starting point and an end point of a fold to be drawn respectively;

and the drawing unit is configured to connect the determined starting point and the corresponding end point to obtain a fold drawn on the fused image.

In a possible implementation manner, the rendering unit is configured to connect the determined start point and the corresponding end point by using a bezier curve to obtain a wrinkle rendered on the fused image.

According to a third aspect of the embodiments of the present disclosure, there is provided an electronic apparatus including:

a processor;

a memory for storing the processor-executable instructions;

wherein the processor is configured to execute the instructions to implement the image processing method of the first aspect.

According to a fourth aspect of embodiments of the present disclosure, there is provided a computer-readable storage medium, wherein instructions, when executed by a processor of an electronic device, enable the electronic device to perform the image processing method of the first aspect.

According to a fifth aspect of embodiments of the present disclosure, there is provided a computer program product, wherein instructions of the computer program product, when executed by a processor of an electronic device, enable the electronic device to perform the image processing method of the first aspect.

The technical scheme provided by the embodiment of the disclosure at least has the following beneficial effects:

after the human body image to be processed is sequentially subjected to the processing of human body partition, edge extraction, color block extraction and the like, the edge extraction result is superposed on the extracted color block to form a fused image; then, predicting the human body key points of the human body region of the human body image to obtain a human body key point prediction result; and finally, drawing folds on the fusion image to obtain a target image based on the human body key point prediction result and the set fold occurrence rule. Namely, the embodiment of the disclosure can add the wrinkle to the fused image based on the predicted human key points and wrinkle appearance rules, so that the human image to be processed is converted into an image with a certain painting style, and the image processing mode is enriched. For example, when the wrinkle occurrence rule is a cartoon style for two-dimensional, a cartoon type image having a two-dimensional style can be obtained. In addition, the edge extraction and the color block extraction are carried out after the human body is partitioned, so that the edge extraction and the color block extraction are guaranteed to have semantic selectivity and are not disordered and random, and then the wrinkles are drawn based on the predicted human body key points, so that the edge extraction effect when the human body boundary and the background color are close is guaranteed, more accurate wrinkles can be obtained, the image is not disordered, the wrinkles are guaranteed to be only shown at the required positions, and the image processing effect is better.

It is to be understood that both the foregoing general description and the following detailed description are exemplary and explanatory only and are not restrictive of the disclosure.

Drawings

The accompanying drawings, which are incorporated in and constitute a part of this specification, illustrate embodiments consistent with the present disclosure and, together with the description, serve to explain the principles of the disclosure and are not to be construed as limiting the disclosure.

Fig. 1 is a schematic diagram illustrating an implementation environment involved with an image processing method according to an example embodiment.

FIG. 2 is a flow diagram illustrating an image processing method according to an exemplary embodiment.

FIG. 3 is a flow diagram illustrating an image processing method according to an exemplary embodiment.

FIG. 4 is a diagram illustrating an image processing effect according to an exemplary embodiment.

FIG. 5 is a diagram illustrating an image processing effect according to an exemplary embodiment.

FIG. 6 is a diagram illustrating an image processing effect according to an exemplary embodiment.

FIG. 7 is a diagram illustrating an image processing effect according to an exemplary embodiment.

Fig. 8 is a block diagram illustrating an image processing apparatus according to an exemplary embodiment.

FIG. 9 is a block diagram illustrating an electronic device in accordance with an example embodiment.

Detailed Description

In order to make the technical solutions of the present disclosure better understood by those of ordinary skill in the art, the technical solutions in the embodiments of the present disclosure will be clearly and completely described below with reference to the accompanying drawings.

It should be noted that the terms "first," "second," and the like in the description and claims of the present disclosure and in the above-described drawings are used for distinguishing between similar elements and not necessarily for describing a particular sequential or chronological order. It is to be understood that the data so used is interchangeable under appropriate circumstances such that the embodiments of the disclosure described herein are capable of operation in sequences other than those illustrated or otherwise described herein. The implementations described in the exemplary embodiments below are not intended to represent all implementations consistent with the present disclosure. Rather, they are merely examples of apparatus and methods consistent with certain aspects of the present disclosure, as detailed in the appended claims.

The user information to which the present disclosure relates may be information authorized by the user or sufficiently authorized by each party.

Before explaining the embodiments of the present disclosure in detail, some terms related to the embodiments of the present disclosure are explained.

Quadratic element: the early japanese animation and game works are both formed by two-dimensional images, and the picture is a plane, so the picture is called a "two-dimensional world" and is called a "two-dimensional" for short, and the picture is opposite to the picture, namely a "three-dimensional" which is the existing one, namely the real world. Quadratic element means the wonderful world that human beings imagine, shows the visual experience of abusing viewers with various longitudes, and in essence is also the blurred longing of dream life and the expectation of a nice future in human mind in the cubic world.

Cartoon: is an artistic form, which is a picture for depicting life or current affairs by a simple and exaggerated method.

Dense human body key points: refers to a sufficiently dense number of human keypoints, i.e. the human keypoint divisions are no longer simply such as head, neck, shoulder, elbow, hand, hip, knee, foot, etc., but are distributed sufficiently dense, e.g. the number of keypoints can be tens. In one possible implementation, the dense human key points include, but are not limited to: forehead, left eye, right eye, left ear, right ear, mouth, left shoulder, right shoulder, left elbow, right elbow, left wrist, right wrist, left palm, right palm, chest, left hip, right hip, left knee, right knee, left ankle, right wrist, left foot palm, right foot palm, etc.

An implementation environment related to an image processing method provided by the embodiment of the present disclosure is described below.

The image processing method can be applied to interactive scenes, such as a video call process, a video live broadcast process and the like; the method and the device for processing the human body image or video may also be applied in a non-interactive scene, for example, in the process of taking an image or video by a user person, or may also perform image processing on a human body image or video stored locally by the user, which is not specifically limited in this embodiment of the disclosure.

Taking an application in a non-interactive scenario as an example, referring to fig. 1, the implementation environment includes a user 101 and an electronic device 102, where the electronic device 102 generally refers to a mobile computer device such as a tablet computer, a smart phone, and the like. The electronic device 102 is configured to execute the image processing method.

In addition, if the application is applied in an interactive scenario, the implementation environment shown in fig. 1 further includes a server in data communication with the electronic device 102 and at least one other electronic device in data communication with the server.

Based on the implementation environment, the embodiment of the disclosure provides an image processing method, in which after a human body image to be processed is sequentially subjected to human body partitioning, edge extraction, color block extraction and the like, an edge extraction result is superimposed on an extracted color block to form a fused image; then, predicting the human body key points of the human body region of the human body image to obtain a human body key point prediction result; and finally, drawing folds on the fusion image to obtain a target image based on the human body key point prediction result and the set fold occurrence rule. Namely, the embodiment of the disclosure can add the wrinkle to the fused image based on the predicted human key points and wrinkle appearance rules, so that the human image to be processed is converted into an image with a certain painting style, and the image processing mode is enriched. For example, when the wrinkle occurrence rule is a cartoon style for two-dimensional, a cartoon type image having a two-dimensional style can be obtained. Namely, abstract, concise and clear folds can be added, and the clear, planar and abstract quadratic element style conversion of the human body image is realized. Illustratively, the disclosed embodiments can convert the human body image to be processed into a cartoon type image with a two-dimensional style (such as flattening, abstracting and having a hook edge effect).

In addition, the edge extraction and the color block extraction are carried out after the human body is partitioned, so that the edge extraction and the color block extraction are guaranteed to be semantically selective and are not disordered and random, and then the folds are drawn based on the predicted human body key points, so that the edge extraction effect when the human body boundary and the background color are close is guaranteed, more accurate folds can be obtained, and the pictures are not in disorder, for example, the folds mainly appear on limb boundaries, joints, clothing boundaries, pockets, trouser legs, skirts and the like and have definite meanings, and the image processing effect is better.

Fig. 2 is a flowchart illustrating an image processing method, as shown in fig. 2, for use in the electronic device shown in fig. 1, according to an exemplary embodiment, including the following steps.

In step 201, a human body image to be processed is acquired, and a human body region of the human body image is subjected to partition processing to obtain N sub-image regions, where a value of N is a positive integer.

In step 202, edge extraction and color lump extraction are respectively performed on each sub-image region in the N sub-image regions to obtain an edge extraction result and N color lumps, where color lump extraction refers to uniformly configuring color values of each pixel point included in each sub-image region to be the same value, and the same value is determined according to an original color value of each pixel point.

In step 203, the edge extraction result is superimposed on the N color patches to obtain a fused image.

In step 204, the human body region of the human body image is subjected to human body key point prediction to obtain a human body key point prediction result.

In step 205, based on the human body key point prediction result and the set wrinkle occurrence rule, a wrinkle is drawn on the fusion image, and a target image is obtained.

According to the method provided by the embodiment of the disclosure, after the human body image to be processed is sequentially subjected to the processing of human body partition, edge extraction, color block extraction and the like, the edge extraction result is superposed on the extracted color block to form a fused image; then, predicting the human body key points of the human body region of the human body image to obtain a human body key point prediction result; and finally, drawing folds on the fusion image to obtain a target image based on the human body key point prediction result and the set fold occurrence rule. Namely, the embodiment of the disclosure can add the wrinkle to the fused image based on the predicted human key points and wrinkle appearance rules, so that the human image to be processed is converted into an image with a certain painting style, and the image processing mode is enriched. For example, when the wrinkle occurrence rule is a cartoon style for two-dimensional, a cartoon type image having a two-dimensional style can be obtained. In addition, the edge extraction and the color block extraction are carried out after the human body is partitioned, so that the edge extraction and the color block extraction are guaranteed to have semantic selectivity and are not disordered and random, and then the wrinkles are drawn based on the predicted human body key points, so that the edge extraction effect when the human body boundary and the background color are close is guaranteed, more accurate wrinkles can be obtained, the image is not disordered, the wrinkles are guaranteed to be only shown at the required positions, and the image processing effect is better.