Model online incremental training method, device and medium based on active learning

文档序号:20851 发布日期:2021-09-21 浏览:16次 中文

阅读说明:本技术 基于主动学习的模型在线增量训练方法、设备及介质 (Model online incremental training method, device and medium based on active learning ) 是由 冯建设 花霖 陈军 刘桂芬 姚琪 周雷 欧馨 王宗强 赵一波 许琦枫 于 2021-08-25 设计创作,主要内容包括:本申请公开了一种基于主动学习的模型在线增量训练方法、设备及介质,所述基于主动学习的模型在线增量训练方法包括:获取待预测无标签样本,通过故障检测分类模型对所述待预测无标签样本进行预测,获得预测分类结果,并确定所述预测分类结果对应的置信度,其中,所述故障检测分类模型为基于有标签样本集进行迭代训练得到的,若所述置信度满足置信度阈值条件,则将所述待预测无标签样本构建成增量训练样本,基于所述增量训练样本,对所述故障检测分类模型进行在线更新训练,获得更新后的故障检测分类模型。本申请解决因人工标注无标签样本工作量大,导致模型更新训练效率低的技术问题。(The application discloses a model online increment training method, equipment and a medium based on active learning, wherein the model online increment training method based on active learning comprises the following steps: the method comprises the steps of obtaining a to-be-predicted label-free sample, predicting the to-be-predicted label-free sample through a fault detection classification model to obtain a prediction classification result, and determining a confidence coefficient corresponding to the prediction classification result, wherein the fault detection classification model is obtained through iterative training based on a labeled sample set, if the confidence coefficient meets a confidence coefficient threshold condition, constructing the to-be-predicted label-free sample into an incremental training sample, and performing online updating training on the fault detection classification model based on the incremental training sample to obtain an updated fault detection classification model. The method and the device solve the technical problem that the efficiency of model updating training is low due to the fact that workload of manual labeling of label-free samples is large.)

1. An active learning-based model online incremental training method is characterized by comprising the following steps:

obtaining a label-free sample to be predicted;

predicting the unlabeled sample to be predicted through a fault detection classification model to obtain a prediction classification result, and determining a confidence coefficient corresponding to the prediction classification result, wherein the fault detection classification model is obtained by performing iterative training based on a labeled sample set;

if the confidence coefficient meets the confidence coefficient threshold condition, constructing the unlabeled sample to be predicted as an incremental training sample;

and performing online updating training on the fault detection classification model based on the incremental training sample to obtain an updated fault detection classification model.

2. The active learning-based online model incremental training method of claim 1, wherein if the confidence level satisfies a confidence level threshold condition, the step of constructing the unlabeled sample to be predicted as an incremental training sample comprises:

if the confidence coefficient is larger than the confidence coefficient threshold value, setting the unlabeled sample to be predicted as a key sample to be labeled;

and manually labeling the key samples to be labeled to obtain the incremental training samples.

3. The active learning-based online model incremental training method of claim 2, wherein the step of setting the unlabeled sample to be predicted as a key sample to be labeled if the confidence level is greater than the confidence level threshold value comprises:

obtaining a prediction classification result corresponding to each predicted sample within a preset time window length;

respectively calculating the mean value and the standard deviation of the prediction classification result corresponding to each predicted sample;

and based on the mean value and the standard deviation, if the confidence coefficient is greater than the operation result between the mean value and the standard deviation, setting the unlabeled sample to be predicted as the key sample to be labeled.

4. The online incremental training method for models based on active learning according to claim 1, wherein the step of performing online update training on the fault detection classification model based on the incremental training samples to obtain an updated fault detection classification model comprises:

based on the incremental training samples, recursively calculating the posterior probability distribution of the parameters corresponding to the fault detection classification model by using a Bayesian algorithm;

and performing online incremental updating on the fault detection classification model based on the parameter posterior probability distribution to obtain the updated fault detection classification model.

5. The active learning-based model online incremental training method of claim 1, wherein before the step of predicting the unlabeled sample set to be predicted by the fault detection classification model to obtain a prediction classification result and determining the confidence degree corresponding to each unlabeled sample to be predicted in the unlabeled sample set to be predicted, the fault detection classification model is obtained by iterative training based on the labeled sample set, the active learning-based model online incremental training method further comprises:

obtaining a classification model to be trained;

and performing iterative training optimization on the classification model to be trained through the labeled sample set to obtain the fault detection classification model.

6. The active learning-based model online incremental training method of claim 5, wherein the step of performing iterative training optimization on the classification model to be trained through the labeled sample set to obtain the fault detection classification model comprises:

inputting the labeled sample set into the classification model to be trained, and outputting classification results corresponding to the labeled samples in the labeled sample set;

calculating model loss corresponding to the classification model to be trained based on the labels and the classification results respectively corresponding to the labeled samples;

and performing iterative training on the classification model to be trained based on the model loss to obtain the fault detection classification model.

7. An active learning-based model online incremental training device, comprising:

the acquisition module is used for acquiring a label-free sample to be predicted;

the prediction module is used for predicting the unlabeled sample to be predicted through a fault detection classification model to obtain a prediction classification result and determining the confidence coefficient corresponding to the unlabeled sample to be predicted, wherein the fault detection classification model is obtained by performing iterative training based on a labeled sample set;

the construction module is used for constructing the unlabeled sample to be predicted into an incremental training sample if the confidence coefficient meets a confidence coefficient threshold condition;

and the online updating module is used for performing online updating training on the fault detection classification model based on the incremental training samples to obtain an updated fault detection classification model.

8. An active learning based model online incremental training device, comprising: a memory, a processor, and an active learning based model online incremental training program stored on the memory,

the active learning based model online incremental training program is executed by the processor to implement the active learning based model online incremental training method according to any one of claims 1 to 6.

9. A medium, which is a readable storage medium, wherein the readable storage medium has stored thereon an active learning-based model online incremental training program, which is executed by a processor to implement the steps of the active learning-based model online incremental training method according to any one of claims 1 to 6.

10. A computer program product comprising a computer program, wherein the computer program, when being executed by a processor, carries out the steps of the active learning based model online incremental training method according to any one of claims 1 to 6.

Technical Field

The application relates to the technical field of machine learning, in particular to a model online incremental training method, equipment and medium based on active learning.

Background

With the development of the industrial manufacturing big data technology, a large amount of label-free data exists in the industrial manufacturing process, and the training of a data-driven PHM (fault prediction and Health Management) model requires sufficiently abundant labeled samples, further, after the model is deployed and brought on line, the model is updated often by using the newly collected labeled data to perform periodic offline update training on the model, so that a large amount of label-free samples cannot be effectively utilized, and the label-free samples can be manually marked to perform update training on the model.

Disclosure of Invention

The application mainly aims to provide a model online incremental training method, equipment and medium based on active learning, and aims to solve the technical problem that in the prior art, the model training updating efficiency is low due to the fact that workload of manually labeled non-label samples is large.

In order to achieve the above object, the present application provides an active learning-based model online incremental training method, which includes:

obtaining a label-free sample to be predicted;

predicting the unlabeled sample to be predicted through a fault detection classification model to obtain a prediction classification result, and determining the confidence coefficient corresponding to the unlabeled sample to be predicted, wherein the fault detection classification model is obtained by performing iterative training based on a labeled sample set;

if the confidence coefficient meets the confidence coefficient threshold condition, constructing the unlabeled sample to be predicted as an incremental training sample;

and performing online updating training on the fault detection classification model based on the incremental training sample to obtain an updated fault detection classification model.

Optionally, if the confidence meets the confidence threshold condition, constructing the unlabeled sample to be predicted as an incremental training sample includes:

if the confidence coefficient is larger than the confidence coefficient threshold value, setting the unlabeled sample to be predicted as a key sample to be labeled;

and manually labeling the key samples to be labeled to obtain the incremental training samples.

Optionally, if the confidence is greater than the confidence threshold, the step of setting the unlabeled sample to be predicted as a key sample to be labeled includes:

obtaining a prediction classification result corresponding to each predicted sample within a preset time window length;

respectively calculating the mean value and the standard deviation of the prediction classification result corresponding to each predicted sample;

and based on the mean value and the standard deviation, if the confidence coefficient is greater than the operation result between the mean value and the standard deviation, setting the unlabeled sample to be predicted as the key sample to be labeled.

Optionally, the step of performing online update training on the fault detection classification model based on the incremental training samples to obtain an updated fault detection classification model includes:

based on the incremental training samples, recursively calculating the posterior probability distribution of the parameters corresponding to the fault detection classification model by using a Bayesian algorithm;

and performing online incremental updating on the fault detection classification model based on the parameter posterior probability distribution to obtain the updated fault detection classification model.

Optionally, before the step of predicting the unlabeled sample set to be predicted by using the fault detection classification model to obtain a prediction classification result and determining the confidence level corresponding to each unlabeled sample to be predicted in the unlabeled sample set to be predicted, where the fault detection classification model is obtained by performing iterative training based on a labeled sample set, the active learning-based model online incremental training method further includes:

obtaining a classification model to be trained;

and performing iterative training optimization on the classification model to be trained through the labeled sample set to obtain the fault detection classification model.

Optionally, the step of performing iterative training optimization on the classification model to be trained through the labeled sample set to obtain the fault detection classification model includes:

inputting the labeled sample set into the classification model to be trained, and outputting classification results corresponding to the labeled samples in the labeled sample set;

calculating model loss corresponding to the classification model to be trained based on the labels and the classification results respectively corresponding to the labeled samples;

and performing iterative training on the classification model to be trained based on the model loss to obtain the fault detection classification model.

The present application further provides an online incremental training device of model based on active learning, the online incremental training device of model based on active learning is a virtual device, the online incremental training device of model based on active learning includes:

the acquisition module is used for acquiring a label-free sample to be predicted;

the prediction module is used for predicting the unlabeled sample to be predicted through a fault detection classification model to obtain a prediction classification result and determining the confidence coefficient corresponding to the unlabeled sample to be predicted, wherein the fault detection classification model is obtained by performing iterative training based on a labeled sample set;

the construction module is used for constructing the unlabeled sample to be predicted into an incremental training sample if the confidence coefficient meets a confidence coefficient threshold condition;

and the online updating module is used for performing online updating training on the fault detection classification model based on the incremental training samples to obtain an updated fault detection classification model.

The application also provides an online incremental training device of model based on active learning, the online incremental training device of model based on active learning is entity equipment, the online incremental training device of model based on active learning includes: the online incremental training system comprises a memory, a processor and an online incremental training program of the model based on active learning stored on the memory, wherein the online incremental training program of the model based on active learning is executed by the processor to realize the steps of the online incremental training method of the model based on active learning.

The present application further provides a medium, which is a readable storage medium, on which an active learning-based model online incremental training program is stored, and the active learning-based model online incremental training program is executed by a processor to implement the steps of the active learning-based model online incremental training method as described above.

The present application further provides a computer program product comprising a computer program which, when executed by a processor, implements the steps of the active learning based model online incremental training method as described above.

Compared with the technical means of carrying out artificial labeling on a large number of unlabelled samples to be predicted to update and train the model adopted by the prior art, the method comprises the steps of firstly obtaining the unlabelled samples to be predicted, then predicting the unlabelled samples to be predicted through a fault detection classification model to obtain a prediction classification result and determining the confidence coefficient corresponding to the prediction classification result, wherein the fault detection classification model is obtained by carrying out iterative training on the basis of a labeled sample set, further, if the confidence coefficient meets the confidence coefficient threshold condition, constructing the unlabelled samples to be predicted into incremental training samples, and further realizing the purpose of selectively selecting the unlabelled samples to be predicted with high confidence coefficient to update and train the model on the basis of the confidence coefficient, furthermore, on the basis of the incremental training samples, the fault detection classification model is subjected to online updating training to obtain an updated fault detection classification model, so that the on-line updating of the fault detection classification model is realized by actively learning the unlabeled sample to be predicted through the model and further constructing the online unlabeled sample to be predicted into the incremental training sample, the model can be updated in real time, the life cycle of the model is prolonged, the technical defects that the model cannot be effectively utilized by a large amount of unlabeled samples in the prior art and the model updating training efficiency is low due to the fact that the workload of a method for manually marking the unlabeled sample to update and train the model is large, the time cost is long and further the model updating and training efficiency is low are overcome, therefore, the efficiency of model updating training is improved.

Drawings

The accompanying drawings, which are incorporated in and constitute a part of this specification, illustrate embodiments consistent with the present application and together with the description, serve to explain the principles of the application.

In order to more clearly illustrate the embodiments of the present application or the technical solutions in the prior art, the drawings needed to be used in the description of the embodiments or the prior art will be briefly described below, and it is obvious to those skilled in the art that other drawings can be obtained according to the drawings without inventive exercise.

FIG. 1 is a schematic flowchart of a first embodiment of an active learning-based model online incremental training method according to the present application;

FIG. 2 is a schematic flowchart of a second embodiment of the active learning-based model online incremental training method according to the present application;

FIG. 3 is a schematic flowchart of a third embodiment of the active learning-based model online incremental training method according to the present application;

fig. 4 is a schematic structural diagram of active learning-based model online incremental training equipment in a hardware operating environment related to the active learning-based model online incremental training method in the embodiment of the present application.

The implementation, functional features and advantages of the objects of the present invention will be further explained with reference to the accompanying drawings.

Detailed Description

It should be understood that the specific embodiments described herein are merely illustrative of the present application and are not intended to limit the present application.

In a first embodiment of the active learning-based model online incremental training method, referring to fig. 1, the active learning-based model online incremental training method includes:

step S10, obtaining a non-label sample to be predicted;

in this embodiment, it should be noted that after the model is deployed online, online streaming data on the system can be detected in real time, that is, the unlabeled sample to be predicted can be obtained in real time.

And acquiring a to-be-predicted unlabeled sample, specifically, taking the data detected in real time as the to-be-predicted unlabeled sample, and predicting the to-be-predicted unlabeled sample acquired in real time through the fault detection classification model.

Step S20, predicting the unlabeled sample to be predicted through a fault detection classification model to obtain a prediction classification result, and determining the confidence corresponding to the unlabeled sample to be predicted, wherein the fault detection classification model is obtained by iterative training based on a labeled sample set;

in this embodiment, it should be noted that the fault detection classification model is obtained by training a labeled sample set collected in advance in an offline state, and after the training is completed, the fault detection classification model is deployed for online operation.

Predicting the unlabelled sample to be predicted through a fault detection classification model to obtain a prediction classification result, and determining a confidence coefficient corresponding to the prediction classification result, wherein the fault detection classification model is obtained by iterative training based on a labeled sample set, specifically, firstly, collecting the labeled sample set in advance, inputting the labeled sample set into the classification model to be trained to optimize the classification model to be trained, and judging whether the optimized classification model to be trained meets a preset training end condition, wherein the preset training end condition comprises conditions of loss function convergence, maximum iteration threshold reaching and the like, and if not, returning to the execution step: inputting the labeled sample set into a classification model to be trained to optimize the classification model to be trained, if so, obtaining the fault detection classification model, deploying the fault detection classification model on line, further, predicting a non-labeled sample to be predicted monitored in real time through the fault detection classification model which is operated on line, outputting the predicted classification result, and further determining the confidence corresponding to the predicted classification result according to the predicted classification result to select the next training sample according to the confidence.

Before the step of predicting the unlabeled sample set to be predicted by the fault detection classification model to obtain the prediction classification result corresponding to each sample in the unlabeled sample set to be predicted, where the fault detection classification model is obtained by performing iterative training based on the labeled sample set, the active learning-based model online incremental training method further includes:

step A10, obtaining a classification model to be trained;

step A20, performing iterative training optimization on the classification model to be trained through the labeled sample set to obtain the fault detection classification model.

In this embodiment, the to-be-trained classification model is iteratively trained and optimized through the labeled sample set to obtain the fault detection classification model, specifically, the labeled sample set is input into the to-be-trained classification model, model prediction is performed to obtain a model output label, a difference between the model output label and a label corresponding to each sample of the labeled sample set is further calculated to obtain a model loss, the to-be-trained classification model is optimized based on the model loss, and whether the optimized to-be-trained classification model meets a preset training end condition is determined, where the preset training end condition includes conditions such as loss function convergence and reaching a maximum iteration threshold, and if not, the execution step is returned: and inputting the labeled sample set into a classification model to be trained to optimize the classification model to be trained, if so, obtaining the fault detection classification model, and deploying the fault detection classification model on line.

The step of performing iterative training optimization on the classification model to be trained through the labeled sample set to obtain the fault detection classification model comprises the following steps:

step A21, inputting the labeled sample set into the classification model to be trained, and outputting classification results corresponding to the labeled samples in the labeled sample set;

in this embodiment, the labeled sample set is input into the to-be-trained classification model, and classification results corresponding to the labeled samples in the labeled sample set are output, specifically, classification prediction is performed on the labeled samples in the labeled sample set through the to-be-trained classification model, so as to obtain the classification results corresponding to the labeled samples.

Step A22, calculating model loss corresponding to the classification model to be trained based on the label corresponding to each labeled sample and the classification result;

in this embodiment, the model loss corresponding to the classification model to be trained is calculated based on the labels and the classification results respectively corresponding to the labeled samples, and specifically, the model loss is calculated through a preset loss function based on the difference between the labels and the classification results respectively corresponding to the labeled samples.

Step A23, performing iterative training on the classification model to be trained based on the model loss to obtain the fault detection classification model.

In this embodiment, the classification model to be trained is iteratively trained based on the model loss to obtain the fault detection classification model, and specifically, the gradient of the model loss is calculated based on the model loss, and then the classification model to be trained is iteratively trained to obtain the fault detection classification model.

Step S30, if the confidence coefficient meets the confidence coefficient threshold condition, constructing the unlabeled sample to be predicted into an incremental training sample;

in this embodiment, it should be noted that the confidence threshold is an operation result between a mean value and a standard deviation corresponding to a prediction classification result of each predicted sample before a time corresponding to the non-labeled sample to be predicted and within a preset time window, and since a large number of non-labeled samples to be predicted are generated in an industrial manufacturing process and a labeling workload of all the non-labeled samples to be predicted is too large, a confidence corresponding to each non-labeled sample to be predicted needs to be determined, so that the non-labeled samples to be predicted with a satisfied confidence are screened, and not only a large number of non-labeled samples to be predicted are fully utilized, but also the workload of subsequent labeling is reduced.

If the confidence coefficient meets the confidence coefficient threshold condition, constructing the unlabeled sample to be predicted as an incremental training sample, specifically, screening the unlabeled sample to be predicted corresponding to the confidence coefficient threshold value based on the confidence coefficient, if the confidence coefficient is greater than the confidence coefficient threshold value, indicating that a model cannot accurately predict and output the unlabeled sample to be predicted, further using the unlabeled sample to be predicted as a key unlabeled sample to label the key unlabeled sample, using the labeled sample as the incremental training sample, further, if the confidence coefficient does not meet the confidence coefficient threshold condition, rejecting the unlabeled sample to be predicted, further obtaining a new unlabeled sample to be predicted, and performing a new round of prediction on the new unlabeled sample to be predicted through the fault detection classification model, for example, the label of the sample x after labeling is y, and the two (x, y) form the incremental training sample to be used for realizing the incremental updating of the model in the next step.

And step S40, performing online updating training on the fault detection classification model based on the incremental training samples to obtain an updated fault detection classification model.

In this embodiment, on the basis of the incremental training samples, the fault detection classification model is subjected to online update training to obtain an updated fault detection classification model, specifically, on the basis of the incremental training samples, the incremental training samples are input into the fault detection classification model, a posterior probability distribution of parameters corresponding to the fault detection classification model is recursively calculated by using a bayesian algorithm, and then, the parameters corresponding to the fault detection classification model are subjected to online incremental updating to obtain the updated fault detection classification model, so that the fault detection classification model is updated online, and then, the updated fault detection classification model is used to perform next round of prediction classification on the unlabeled sample to be predicted in real time.

Compared with the technical means of carrying out updating training on the model by manually labeling a large number of unlabeled samples to be predicted in the prior art, the embodiment of the application firstly obtains the unlabeled samples to be predicted, then predicts the unlabeled samples to be predicted through a fault detection classification model, obtains a prediction classification result and determines the confidence coefficient corresponding to the prediction classification result, wherein the fault detection classification model is obtained by carrying out iterative training on the basis of a labeled sample set, further, if the confidence coefficient meets the confidence coefficient threshold condition, the unlabeled samples to be predicted are constructed into the incremental training samples, and then the aim of selectively selecting the unlabeled samples to be predicted with high confidence coefficient to update and train the model on the basis of the confidence coefficient is fulfilled, and then on-line updating training is carried out on the fault detection classification model based on the incremental training samples to obtain an updated fault detection classification model, so that active learning of unlabeled samples to be predicted through the model is realized, the on-line unlabeled samples to be predicted are further constructed into the incremental training samples, and the fault detection classification model is updated on line, the model can be updated and trained in real time through the on-line unlabeled samples while a large amount of unlabeled samples are fully utilized, the life cycle of the model is prolonged, the technical defects that the model cannot be effectively utilized due to a large amount of unlabeled samples in the prior art is large in workload, long time cost is needed, and the efficiency of model updating and training is low due to the fact that the unlabeled samples are manually marked in order to update and train the model are overcome, therefore, the efficiency of model updating training is improved.

Further, referring to fig. 2, based on the first embodiment of the present application, in another embodiment of the present application, if the confidence level is satisfied, the step of constructing the unlabeled sample to be predicted corresponding to the confidence level as the incremental training sample includes:

step B10, if the confidence is larger than the confidence threshold, setting the unlabeled sample to be predicted as a key sample to be labeled;

in this embodiment, if the confidence is greater than the confidence threshold, setting the unlabeled sample to be predicted as a key sample to be labeled, specifically, obtaining a prediction classification result corresponding to each predicted sample within a preset time window length, further calculating a mean value of the prediction classification results corresponding to each predicted sample, and calculating a standard deviation of the prediction classification results corresponding to each predicted sample, further, comparing the confidence with an operation result between the mean value and the standard deviation, if the confidence is greater than the operation result, outputting the unlabeled sample to be predicted corresponding to the confidence, and further setting the unlabeled sample to be predicted as a key sample to be labeled, so as to manually label the key sample to be labeled.

If the confidence is greater than the confidence threshold, the step of setting the unlabeled sample to be predicted as a key sample to be labeled comprises the following steps:

step B11, obtaining the prediction classification result corresponding to each predicted sample within the preset time window length;

in this embodiment, it should be noted that each predicted sample is a sample that is before the time corresponding to the prediction of the unlabeled sample to be predicted and within the preset time window length, and the preset time window length is a preset time length.

The method includes the steps of obtaining a prediction classification result corresponding to each predicted sample within a preset time window length, specifically, obtaining a prediction classification result corresponding to each predicted sample within the preset time window length by setting a preset time window length and taking a time corresponding to prediction of the unlabeled sample to be predicted as a reference, where for example, if the time corresponding to the unlabeled sample to be predicted is T, and the preset time window length is T, obtaining a prediction classification result corresponding to each sample before the time T and within the preset time window length T.

Step B12, respectively calculating the mean value and standard deviation of the prediction classification result corresponding to each predicted sample;

in this embodiment, the mean and the standard deviation of the prediction classification result corresponding to each of the predicted samples are calculated, specifically, the mean and the standard deviation of the prediction classification result corresponding to each of the predicted samples are calculated based on the prediction classification result corresponding to each of the predicted samples.

And step B13, based on the mean value and the standard deviation, if the confidence is greater than the operation result between the mean value and the standard deviation, setting the unlabeled sample to be predicted as the key sample to be labeled.

In this embodiment, based on the mean and the standard deviation, if the confidence is greater than the mean and the standard deviationSetting the unlabeled sample to be predicted as the key unlabeled sample based on the operation result between the standard deviations, specifically, comparing the confidence degree corresponding to the unlabeled sample to be predicted with the operation result between the average value and the standard deviation based on the average value and the standard deviation, and setting the unlabeled sample to be predicted as the key unlabeled sample if the confidence degree is greater than the operation result, for example, if the operation result at the current time t is the operation result between the unlabeled sample to be predicted and the standard deviationCorresponding confidence coefficient isThe preset time window length is T, and each predicted sample is as follows:

the prediction classification result corresponding to each predicted sample is as follows:

and further comparing the confidence with the operation result between the mean and the standard deviation, namely:

and further, when the confidence coefficient is greater than the operation result, setting the unlabeled sample to be predicted as the key sample to be labeled.

And step B20, manually labeling the key samples to be labeled to obtain the incremental training samples.

In this embodiment, it should be noted that the incremental training samples are samples used for performing incremental updating on the fault detection classification model.

And manually labeling the key sample to be labeled to obtain the incremental training sample, specifically, manually labeling the key sample to be labeled through experience of a service expert to obtain a label corresponding to the key sample to be labeled, and further constructing the key sample to be labeled and the label corresponding to the key sample to be labeled into the incremental training sample, for example, the label of a sample x after being standardized is y, and the incremental training sample is (x, y), so as to realize incremental updating of a model in the next step.

The embodiment of the application provides an online model increment training method based on active learning, namely, if the confidence coefficient is greater than the confidence coefficient threshold value, setting the unlabeled sample to be predicted as a key unlabeled sample, further manually labeling the key unlabeled sample to obtain the incremental training sample, realizing that a trained fault detection classification model is deployed on line, acquiring the confidence coefficient corresponding to the unlabeled sample to be predicted on line in real time, selecting the unlabeled sample to be predicted meeting the conditions to manually label, fully and actively learning a large number of unlabeled samples to be predicted to update the fault detection classification model, overcoming the problem that a large number of unlabeled samples to be predicted in the prior art cannot be effectively utilized, and having a large workload of updating training on the model by manually labeling the unlabeled sample to be predicted, the technical defect that the model updating training efficiency is low due to the fact that long time cost is needed lays a foundation.

Further, referring to fig. 3, based on the first embodiment of the present application, in another embodiment of the present application, the performing online update training on the fault detection classification model based on the incremental training samples, and the step of obtaining an updated fault detection classification model includes:

step C10, based on the incremental training samples, calculating the posterior probability distribution of the parameters corresponding to the fault detection classification model by using a Bayesian algorithm recursion;

in this embodiment, based on the incremental training samples, a bayesian algorithm is used to recursively calculate a posterior probability distribution of parameters corresponding to the fault detection classification model, specifically, the bayesian algorithm regression form:

wherein theta is the current parameter of the model M,for noise, the posterior probability distribution of the model can be decomposed into:

wherein the content of the first and second substances,is a pre-determined probability distribution of the parameters,

inputting the incremental training samples into a model for a likelihood function, further based on the incremental training samples, the model posteriori having the form of a recursive update:

the posterior probability distribution of the parameters corresponding to the fault detection classification model can be updated.

And step C20, performing online incremental updating on the fault detection classification model based on the parameter posterior probability distribution to obtain the updated fault detection classification model.

In this embodiment, the fault detection classification model is updated on an online basis based on the parameter posterior probability distribution to obtain the updated fault detection classification model, and specifically, the fault detection classification model is updated on an online basis based on the parameter posterior probability distribution to obtain the updated and optimized fault detection classification model.

The embodiment of the application provides a model online incremental training method based on active learning, namely, based on the incremental training samples, a Bayesian algorithm is used for recursively calculating the posterior probability distribution of parameters corresponding to the fault detection classification model, and then based on the posterior probability distribution of parameters, the fault detection classification model is updated online incrementally, the updated fault detection classification model is obtained, the incremental training samples constructed based on the unlabeled samples to be predicted acquired online are realized, the fault detection classification model is updated, so that the life cycle of the model is prolonged, the problems that a large number of unlabeled samples to be predicted in the prior art cannot be effectively utilized and the method for updating and training the model by artificially labeling the unlabeled samples to be predicted has large workload and needs long time cost are solved, and further, the technical defect of low efficiency of model updating training is laid a foundation.

Referring to fig. 4, fig. 4 is a schematic structural diagram of an active learning-based model online incremental training device of a hardware operating environment according to an embodiment of the present application.

As shown in fig. 4, the active learning-based model online incremental training device may include: a processor 1001, such as a CPU, a memory 1005, and a communication bus 1002. The communication bus 1002 is used for realizing connection communication between the processor 1001 and the memory 1005. The memory 1005 may be a high-speed RAM memory or a non-volatile memory (e.g., a magnetic disk memory). The memory 1005 may alternatively be a memory device separate from the processor 1001 described above.

Optionally, the active learning-based model online incremental training device may further include a rectangular user interface, a network interface, a camera, RF (Radio Frequency) circuitry, a sensor, audio circuitry, a WiFi module, and so on. The rectangular user interface may comprise a Display screen (Display), an input sub-module such as a Keyboard (Keyboard), and the optional rectangular user interface may also comprise a standard wired interface, a wireless interface. The network interface may optionally include a standard wired interface, a wireless interface (e.g., WIFI interface).

Those skilled in the art will appreciate that the active learning based model online incremental training device architecture illustrated in FIG. 4 does not constitute a limitation of active learning based model online incremental training devices, and may include more or fewer components than illustrated, or some components in combination, or a different arrangement of components.

As shown in fig. 4, a memory 1005, which is a type of computer storage medium, may include an operating system, a network communication module, and an active learning based model online incremental training program. The operating system is a program for managing and controlling hardware and software resources of the active learning-based model online incremental training device, and supports the running of the active learning-based model online incremental training program and other software and/or programs. The network communication module is used for realizing communication among components in the memory 1005 and communication with other hardware and software in the online incremental training system based on the actively learned model.

In the active learning based model online incremental training apparatus shown in fig. 4, the processor 1001 is configured to execute an active learning based model online incremental training program stored in the memory 1005, and implement any of the steps of the active learning based model online incremental training method described above.

The specific implementation manner of the active learning-based model online increment training device is basically the same as that of each embodiment of the active learning-based model online increment training method, and is not described herein again.

The present application further provides an active learning-based online incremental training device for models, which includes:

the acquisition module is used for acquiring a label-free sample to be predicted;

the prediction module is used for predicting the unlabeled sample to be predicted through a fault detection classification model to obtain a prediction classification result and determining the confidence coefficient corresponding to the unlabeled sample to be predicted, wherein the fault detection classification model is obtained by performing iterative training based on a labeled sample set;

the construction module is used for constructing the unlabeled sample to be predicted into an incremental training sample if the confidence coefficient meets a confidence coefficient threshold condition;

and the online updating module is used for performing online updating training on the fault detection classification model based on the incremental training samples to obtain an updated fault detection classification model.

Optionally, the building module is further configured to:

if the confidence coefficient is larger than the confidence coefficient threshold value, setting the unlabeled sample to be predicted as a key sample to be labeled;

and manually labeling the key samples to be labeled to obtain the incremental training samples.

Optionally, the building module is further configured to:

obtaining a prediction classification result corresponding to each predicted sample within a preset time window length;

respectively calculating the mean value and the standard deviation of the prediction classification result corresponding to each predicted sample;

and based on the mean value and the standard deviation, if the confidence coefficient is greater than the operation result between the mean value and the standard deviation, setting the unlabeled sample to be predicted as the key sample to be labeled.

Optionally, the online update module is further configured to:

based on the incremental training samples, recursively calculating the posterior probability distribution of the parameters corresponding to the fault detection classification model by using a Bayesian algorithm;

and performing online incremental updating on the fault detection classification model based on the parameter posterior probability distribution to obtain the updated fault detection classification model.

Optionally, the active learning-based model online incremental training device is further configured to:

obtaining a classification model to be trained;

and performing iterative training optimization on the classification model to be trained through the labeled sample set to obtain the fault detection classification model.

Optionally, the active learning-based model online incremental training device is further configured to:

inputting the labeled sample set into the classification model to be trained, and outputting classification results corresponding to the labeled samples in the labeled sample set;

calculating model loss corresponding to the classification model to be trained based on the labels and the classification results respectively corresponding to the labeled samples;

and performing iterative training on the classification model to be trained based on the model loss to obtain the fault detection classification model.

The specific implementation of the active learning-based model online increment training device is basically the same as that of each embodiment of the active learning-based model online increment training method, and is not described herein again.

The present application provides a medium, which is a readable storage medium, and the readable storage medium stores one or more programs, which can be further executed by one or more processors for implementing the steps of any one of the above active learning-based model online incremental training methods.

The specific implementation manner of the readable storage medium of the present application is substantially the same as that of each embodiment of the above active learning-based model online incremental training method, and is not described herein again.

The present application provides a computer program product, and the computer program product includes one or more computer programs, which can also be executed by one or more processors for implementing the steps of any one of the above active learning-based model online incremental training methods.

The specific implementation of the computer program product of the present application is substantially the same as the embodiments of the active learning-based model online incremental training method, and is not described herein again.

The above description is only a preferred embodiment of the present application, and not intended to limit the scope of the present application, and all modifications of equivalent structures and equivalent processes, which are made by the contents of the specification and the drawings, or which are directly or indirectly applied to other related technical fields, are included in the scope of the present application.

16页详细技术资料下载
上一篇:一种医用注射器针头装配设备
下一篇:一种基于RFID标签的工具管理系统及方法

网友询问留言

已有0条留言

还没有人留言评论。精彩留言会获得点赞!

精彩留言,会给你点赞!