Wind turbine generator state monitoring method based on OC-RKELM

文档序号:732048 发布日期:2021-04-20 浏览:5次 中文

阅读说明:本技术 一种基于oc-rkelm的风电机组状态监测方法 (Wind turbine generator state monitoring method based on OC-RKELM ) 是由 金晓航 泮恒拓 于 2020-12-21 设计创作,主要内容包括:一种基于OC-RKELM的风电机组状态监测方法,包括如下步骤:步骤1:采集风电机组的运行数据、环境参数数据;步骤2:对采集到的风电机组数据进行清洗;建立风电机组正常运行数据的健康数据集;步骤3:基于风电机组的健康数据集,利用OC-RKELM模型建立针对健康数据集的数学描述;步骤4:基于历史正常数据的分析来确定健康阈值,以判断运行数据是否隶属于风电机组健康数据集;步骤5:对风电机组实时运行数据进行相应的处理,基于OC-RKELM模型判断其是否为正常运行数据,当满足预警条件时,对风电机组进行预警、停机检查。本发明人为干预少、实施简单,能有效地检测出风电机组发电机运行状态的异常,可以为风电场运维提供技术支持。(A wind turbine state monitoring method based on OC-RKELM comprises the following steps: step 1: collecting operating data and environmental parameter data of a wind turbine generator; step 2: cleaning the collected wind turbine generator data; establishing a health data set of normal operation data of the wind turbine generator; and step 3: based on the health data set of the wind turbine generator, establishing mathematical description aiming at the health data set by utilizing an OC-RKELM model; and 4, step 4: determining a health threshold value based on analysis of historical normal data to judge whether the operating data belongs to a health data set of the wind turbine generator; and 5: and correspondingly processing the real-time operation data of the wind turbine generator, judging whether the real-time operation data is normal operation data or not based on the OC-RKELM model, and performing early warning and shutdown inspection on the wind turbine generator when early warning conditions are met. The method has less human intervention and simple implementation, can effectively detect the abnormity of the running state of the generator of the wind turbine generator, and can provide technical support for the operation and maintenance of the wind power plant.)

1. A wind turbine generator state monitoring method based on OC-RKELM is characterized by comprising the following steps:

step 1: collecting operating data and environmental parameter data of a wind turbine generator, wherein the operating data of the wind turbine generator comprises a pitch angle, a generator rotating speed and active power, and the environmental parameters comprise a wind speed, a wind direction and an environmental temperature;

step 2: cleaning the collected wind turbine generator data, including processing missing values and abnormal values; clearing standby data, shutdown data and outlier data; establishing a health data set of normal operation data of the wind turbine generator;

and step 3: based on a health data set of the wind turbine generator, establishing mathematical description aiming at the health data set by utilizing an OC-RKELM model, wherein a one-dimensional index output by the OC-RKELM model represents the similarity degree of input real-time operation data and the health data set of the wind turbine generator;

and 4, step 4: determining a health threshold value based on analysis of historical normal data to judge whether the operating data belongs to a health data set of the wind turbine generator;

and 5: and correspondingly processing the real-time operation data of the wind turbine generator, judging whether the real-time operation data is normal operation data or not based on the OC-RKELM model, and performing early warning and shutdown inspection on the wind turbine generator when early warning conditions are met.

2. The method for monitoring the state of the wind turbine generator based on the OC-RKELM as claimed in claim 1, wherein the process of the step 2 is as follows:

step 21: clearing missing data in the acquired data; clearing data beyond the given range of the parameters;

step 22: in order to obtain pure normal operation data of the wind turbine generator, firstly, data of the generator in standby and shutdown states are removed; secondly, removing outlier data which are seriously deviated from the healthy data cluster through a local abnormal factor outlier detection algorithm;

step 23: and correspondingly normalizing the data after data cleaning in order to eliminate dimension influence.

3. The method for monitoring the condition of the wind turbine generator based on the OC-RKELM according to claim 2, wherein in the step 2, in order to eliminate the dimensional influence, the wind turbine generator data is normalized accordingly, the normalization is a linear transformation of the original data, so that the result of the transformation falls into the [0,1] interval, and the transformation function is as follows:

the outlier data is cleaned by adopting a local abnormal factor algorithm, wherein the local abnormal factor algorithm is defined by the following parameters:

k-distance: for a certain object p in the dataset D, defining a distance k-dist (p) between p and the object o e D (p, o), the object o satisfies the following two conditions: at least k objects o 'belong to D \ p }, and D (p, o') is less than or equal to D (p, o); at most k-1 objects o 'belong to D \ p }, and D (p, o') < D (p, o) is satisfied;

k-distance neighborhood: k-distance neighborhood N of object pk(p), defined as all objects in D that are not more than k-distance from p:

Nk(p)={q∈D\{p}|d(p,q)≤k-dist(p)} (2)

the reachable distance is: object o to object p reachable distance reach distk(p, o) is defined as:

reach-distk(p,o)=max{k-dist(o),d(p,o)} (3)

i.e. the larger value in the k-distance of object p and object o;

local accessible density: local reachable density lrd of object pk(p) is defined as:

representing the ratio of objects within the k-distance neighborhood of object p to the reachable distance of object p;

local abnormality factor: the local anomaly factor for object p is defined as:

representing an average of a ratio of a local achievable density of objects within a k-distance neighborhood of p to a local achievable density of p, a local anomaly factor may describe an outlier of object p, when the local anomaly factor is less than or equal to 1, it is indicated that p is located within a data-dense region; when the local anomaly factor is greater than 1, it indicates that p may be far away from the data-dense region, with the larger the value the higher the degree of outlier of p.

4. The method for monitoring the state of the wind turbine generator based on the OC-RKELM as claimed in any one of claims 1 to 3, wherein the process of the step 3 is as follows: selecting a support vector: in order to select representative wind turbine generator operation data, a system sampling method is adopted for sampling, and the operation is as follows: parameters such as power, time and the like are segmented, corresponding sub-sample sets are divided, and samples are extracted from each sub-sample set to serve as support vectors.

5. The method for monitoring the state of the wind turbine generator based on the OC-RKELM as claimed in claim 4, wherein in the step 3, a state monitoring model of the OC-RKELM is required to be constructed. Extracting a number of health samples, X ═ X1,x2,…,xN]TThe target output vector is R ═ 1,1, …,1]T. The output is in the form of:

in the formula (I), the compound is shown in the specification,for the model output parameter, betajFor the output weights, K (-) is the kernel function,the method is a set of support vectors, selects a radial basis function as a kernel mapping function, and has the following form:

K(x,xj)=exp(-||x-xj||2/σ) (7)

where σ is a hyperparameter, xjIs a support vector;

the single class problem solved by the OC-RKELM model is described as:

in the formula, KN×L=K(X,XL) For simplified kernel matrices, training errors and output weights are balanced by introducing L2 regularization, equation (8) is rewritten as:

in the formula, ξ is a training error, λ is a regularization coefficient, and a lagrange function is introduced to solve the formula (9):

in the formula, a is a lagrange multiplier vector, and according to the carlo-kuen-tak condition, the solution of the output weight is obtained as follows:

in the formula, I is an identity matrix;

the distance of the sample from the normal class is defined as:

the greater the distance of the sample from the normal class, the more the sample deviates from the normal class.

6. The method for monitoring the state of the wind turbine generator based on the OC-RKELM as claimed in any one of claims 1 to 3, wherein the process of the step 4 is as follows: determining a health threshold by a kernel density estimation method, comprising: calculating a one-dimensional output index corresponding to normal operation data based on an OC-RKELM model, estimating a density function of the one-dimensional output index by using a nuclear density estimation method, and determining a health threshold value according to a certain confidence coefficient.

7. The method for monitoring the state of the wind turbine generator based on the OC-RKELM as claimed in claim 6, wherein in the step 4, the health threshold is determined by using a kernel density estimation method, and the decision function is defined as:

wherein Sign (·) is a Sign function, thres is a threshold, and when d (x) is less than or equal to the threshold, the monitoring data is considered to belong to normal data; when the value is larger than the threshold value, the monitoring data is considered as abnormal data.

8. The method for monitoring the state of the wind turbine generator based on the OC-RKELM as claimed in any one of claims 1 to 3, wherein the process of the step 5 is as follows:

step 51: inputting real-time operation data into the constructed OC-RKELM model, and judging the real-time operation data as abnormal operation data when the output index exceeds a health threshold value;

step 52: and carrying out statistical analysis on the output result of the OC-RKELM model, setting a proper early warning condition, and prompting the abnormality of the wind turbine generator when the early warning condition is met.

Technical Field

The invention relates to the technical field of wind power, in particular to a wind turbine generator state monitoring method based on OC-RKELM.

Background

In order to solve the problems of environmental pollution, energy crisis and the like, wind power industry is being vigorously developed in various countries in the world, and the accumulated installed capacity of wind generation sets is continuously increased. By the end of 2019, the installed capacity of global wind turbines reaches 651GW, and hundreds of thousands of wind turbines are running in a grid-connected mode. The design life of a wind turbine generator is usually 20 years, and in the service life cycle of the wind turbine generator, due to the adverse effects of various factors such as environment, load, materials and the like, the components of the wind turbine generator inevitably generate accumulated damage, resistance attenuation and function degradation, and even serious faults. The state monitoring technology can find abnormality before equipment fails, so that technical support is provided for optimizing operation and maintenance strategies and reducing operation and maintenance cost. Therefore, developing a condition monitoring technology of the wind turbine is one of the keys for improving the reliability and the economy of the wind power plant.

The existing state monitoring technology of a data acquisition and monitoring control (SCADA) system is realized only by setting a threshold value for a one-dimensional characteristic parameter, and only extreme abnormal data can be found; the existing wind power data visualization analysis technology needs professional analysts to analyze charts such as power curves of wind turbine generators or supervise wind power plants by experienced personnel, so that the labor cost is high and the real-time performance is poor; the existing state monitoring technology based on neural network modeling has the advantages that the parameters have strong randomness, and the generalization and stability of the model are difficult to guarantee.

Disclosure of Invention

In order to overcome the defects of the prior art, the invention provides a wind turbine state monitoring method based on a single-Class simplified Kernel Extreme Learning Machine (OC-RKELM).

The technical scheme adopted by the invention for solving the technical problems is as follows:

a wind turbine state monitoring method based on OC-RKELM comprises the following steps:

step 1: collecting operating data and environmental parameter data of a wind turbine generator, wherein the operating data of the wind turbine generator comprises a pitch angle, a generator rotating speed and active power, and the environmental parameters comprise a wind speed, a wind direction and an environmental temperature;

step 2: cleaning the collected wind turbine generator data, including processing missing values and abnormal values; clearing standby data, shutdown data and outlier data; establishing a health data set of normal operation data of the wind turbine generator;

and step 3: based on a health data set of the wind turbine generator, establishing mathematical description aiming at the health data set by utilizing an OC-RKELM model, wherein a one-dimensional index output by the OC-RKELM model represents the similarity degree of input real-time operation data and the health data set of the wind turbine generator;

and 4, step 4: determining a health threshold value based on analysis of historical normal data to judge whether the operating data belongs to a health data set of the wind turbine generator;

and 5: and correspondingly processing the real-time operation data of the wind turbine generator, judging whether the real-time operation data is normal operation data or not based on the OC-RKELM model, and performing early warning and shutdown inspection on the wind turbine generator when early warning conditions are met.

Further, the process of step 2 is as follows:

step 21: clearing missing data in the acquired data; clearing data beyond the given range of the parameters;

step 22: in order to obtain pure normal operation data of the wind turbine generator, firstly, data of the generator in standby and shutdown states are removed; secondly, removing outlier data which are seriously deviated from the healthy data cluster through a local abnormal factor outlier detection algorithm;

step 23: and correspondingly normalizing the data after data cleaning in order to eliminate dimension influence.

In the step 2, corresponding normalization is performed on the wind turbine generator data in order to eliminate dimensional influence. Normalization is a linear transformation of the original data, so that the result of the transformation falls in the [0,1] interval, and the conversion function is as follows:

the outlier data is cleaned by adopting a local abnormal factor algorithm, wherein the local abnormal factor algorithm is defined by the following parameters:

k-distance: for a certain object p in the dataset D, defining a distance k-dist (p) between p and the object o e D (p, o), the object o satisfies the following two conditions: at least k objects o 'belong to D \ p }, and D (p, o') is less than or equal to D (p, o); at most k-1 objects o 'belong to D \ p }, and D (p, o') < D (p, o) is satisfied;

k-distance neighborhood: k-distance neighborhood N of object pk(p), defined as all objects in D that are not more than k-distance from p:

Nk(p)={q∈D\{p}|d(p,q)≤k-dist(p)} (2)

the reachable distance is: object o to object p reachable distance reach distk(p, o) is defined as:

reach-distk(p,o)=max{k-dist(o),d(p,o)} (3)

i.e. the larger value in the k-distance of object p and object o;

local accessible density: local reachable density lrd of object pk(p) is defined as:

representing the ratio of objects within the k-distance neighborhood of object p to the reachable distance of object p;

local abnormality factor: the local anomaly factor for object p is defined as:

represents the average of the ratio of the local achievable density of objects within the k-distance neighborhood of p to the local achievable density of p. The local anomaly factor can describe the outlier degree of the object p, and when the local anomaly factor is less than or equal to 1, the object p is positioned in the data dense area; when the local anomaly factor is greater than 1, it indicates that p may be far away from the data-dense region, with the larger the value the higher the degree of outlier of p.

Still further, the process of step 3 is: selecting a support vector: in order to select representative wind turbine generator operation data, a system sampling method is adopted for sampling, and the operation is as follows: parameters such as power, time and the like are segmented, corresponding sub-sample sets are divided, and samples are extracted from each sub-sample set to serve as support vectors.

In the step 3, an OC-RKELM state monitoring model needs to be constructed. Extracting a number of health samples, X ═ X1,x2,…,xN]TThe target output vector is R ═ 1,1, …,1]T. The output is in the form of:

in the formula (I), the compound is shown in the specification,for the model output parameter, betajFor the output weights, K (-) is the kernel function,the method is a set of support vectors, selects a radial basis function as a kernel mapping function, and has the following form:

K(x,xj)=exp(-||x-xj||2/σ) (7)

where σ is a hyperparameter, xjIs a support vector;

the single class problem solved by the OC-RKELM model is described as:

in the formula, KN×L=K(X,XL) To simplify the kernel matrix, the training errors and output weights are balanced by introducing L2 regularization, and equation (8) can be rewritten as:

where ξ is the training error and λ is the regularization coefficient. And introducing a Lagrangian function to solve the formula (9):

wherein A is a Lagrange multiplier vector. According to the Karush-Kuhn-Tucker (KKT) condition, the solution of the output weight is obtained as follows:

in the formula, I is an identity matrix.

The distance of a sample from the normal class can be defined as:

the greater the distance of the sample from the normal class, the more the sample deviates from the normal class.

Further, the process of step 4 is: determining a health threshold by a kernel density estimation method, comprising: calculating a one-dimensional output index corresponding to normal operation data based on an OC-RKELM model, estimating a density function of the one-dimensional output index by using a nuclear density estimation method, and determining a health threshold value according to a certain confidence coefficient.

In step 4, a health threshold is determined by using a kernel density estimation method, and a decision function is defined as:

where Sign (. cndot.) is a Sign function and thres is a threshold. When d (x) is less than or equal to the threshold value, the monitoring data is considered to belong to normal data; when the value is larger than the threshold value, the monitoring data is considered as abnormal data.

The process of the step 5 is as follows:

step 51: inputting real-time operation data into the constructed OC-RKELM model, and judging the real-time operation data as abnormal operation data when the output index exceeds a health threshold value;

step 52: and carrying out statistical analysis on the output result of the OC-RKELM model, setting a proper early warning condition, and prompting the abnormality of the wind turbine generator when the early warning condition is met.

In the invention, the OC-RKELM method is an improved single classification method on the basis of a single classification Extreme Learning Machine (OC-ELM), and not only is the generalization capability of the model improved, but also the lower computation time complexity is kept by combining a kernel method and randomly extracting a support vector. On the basis, the method combines the characteristics of the wind turbine generator data, adopts a system sampling and hierarchical sampling method to extract a more representative support vector, and further improves the generalization capability of the model.

The invention has the following beneficial effects: according to the method, the health state of the wind turbine generator can be automatically monitored in real time only by analyzing the health data set of the wind turbine generator, and a nuclear method is introduced to effectively improve the generalization and stability of the state monitoring model.

Drawings

FIG. 1 is a flow chart of the present invention;

FIG. 2 is a schematic diagram of a shutdown data, standby data cleaning, wherein (a) indicates before cleaning and (b) indicates after cleaning;

FIG. 3 is a schematic diagram of outlier data cleaning, wherein (a) indicates before cleaning and (b) indicates after cleaning;

fig. 4 is a schematic diagram of the visualization analysis of the generator fault, in which (a) shows a distribution diagram after the data is reduced to the 8 days before the fault, (b) shows a distribution diagram after the data is reduced to the 7 days before the fault, (c) shows a distribution diagram after the data is reduced to the 6 days before the fault, and (d) shows a distribution diagram after the data is reduced to the day of the fault.

Detailed Description

The invention is further described below with reference to the accompanying drawings.

Referring to fig. 1 to 4, a wind turbine generator state monitoring method based on OC-RKELM includes the following steps:

step 1: collecting wind turbine generator operating data and environmental parameter data, wherein the wind turbine generator operating data comprises a pitch angle, a generator rotating speed, active power and the like, and the environmental parameters comprise a wind speed, a wind direction, an environmental temperature and the like;

step 2: cleaning the collected wind turbine generator data, including processing missing values and abnormal values; clearing standby data, shutdown data, outlier data and the like, wherein the main purpose of the clearing is to establish a healthy data set of normal operation data of the wind turbine;

and step 3: based on a health data set of the wind turbine generator, establishing mathematical description aiming at the health data set by utilizing an OC-RKELM model, wherein a one-dimensional index output by the OC-RKELM model represents the similarity degree of input real-time operation data and the health data set of the wind turbine generator;

and 4, step 4: determining a health threshold value based on analysis of historical normal data to judge whether the operating data belongs to a health data set of the wind turbine generator;

and 5: and correspondingly processing the real-time operation data of the wind turbine generator, and judging whether the real-time operation data is normal operation data or not based on the OC-RKELM model. And when the early warning condition is met, the wind turbine generator is subjected to early warning and shutdown inspection.

In the step 2, corresponding normalization is performed on the wind turbine generator data in order to eliminate dimensional influence. Normalization is a linear transformation of the original data, so that the result of the transformation falls in the [0,1] interval, and the conversion function is as follows:

the outlier data is cleaned by adopting a local abnormal factor algorithm, wherein the local abnormal factor algorithm is defined by the following parameters:

k-distance: for a certain object p in the dataset D, defining a distance k-dist (p) between p and the object o e D (p, o), the object o satisfies the following two conditions: at least k objects o 'belong to D \ p }, and D (p, o') is less than or equal to D (p, o); at most k-1 objects o 'belong to D \ p }, and D (p, o') < D (p, o) is satisfied;

k-distance neighborhood: k-distance neighborhood N of object pk(p), defined as all objects in D that are not more than k-distance from p:

Nk(p)={q∈D\{p}|d(p,q)≤k-dist(p)} (2)

the reachable distance is: object o to object p reachable distance reach distk(p, o) is defined as:

reach-distk(p,o)=max{k-dist(o),d(p,o)} (3)

i.e. the larger value in the k-distance of object p and object o;

local accessible density: local reachable density lrd of object pk(p) is defined as:

representing the ratio of objects within the k-distance neighborhood of object p to the reachable distance of object p;

local abnormality factor: the local anomaly factor for object p is defined as:

represents the average of the ratio of the local achievable density of objects within the k-distance neighborhood of p to the local achievable density of p. The local anomaly factor can describe the outlier degree of the object p, and when the local anomaly factor is less than or equal to 1, the object p is positioned in the data dense area; when the local anomaly factor is greater than 1, it indicates that p may be far away from the data-dense region, with the larger the value the higher the degree of outlier of p.

In the step 3, an OC-RKELM state monitoring model needs to be constructed. Extracting a number of health samples, X ═ X1,x2,…,xN]TThe target output vector is R ═ 1,1, …,1]T. The output is in the form of:

in the formula (I), the compound is shown in the specification,for the model output parameter, betajFor the output weights, K (-) is the kernel function,is a set of support vectors. The radial basis function is selected as a kernel mapping function, and the form of the kernel mapping function is as follows:

K(x,xj)=exp(-||x-xj||2/σ) (7)

where σ is a hyperparameter, xjIs a support vector.

The single classification problem solved by the OC-RKELM model can be described as:

in the formula, KN×L=K(X,XL) To simplify the kernel matrix. By balancing the training errors and output weights by introducing L2 regularization, equation (8) can be rewritten as:

where ξ is the training error and λ is the regularization coefficient. And introducing a Lagrangian function to solve the formula (9):

wherein A is a Lagrange multiplier vector. According to the Karush-Kuhn-Tucker (KKT) condition, the solution of the output weight is obtained as follows:

in the formula, I is an identity matrix.

The distance of a sample from the normal class can be defined as:

the greater the distance of the sample from the normal class, the more the sample deviates from the normal class.

In step 4, a health threshold is determined by using a kernel density estimation method, and a decision function is defined as:

where Sign (. cndot.) is a Sign function and thres is a threshold. When d (x) is less than or equal to the threshold value, the monitoring data is considered to belong to normal data; when the value is larger than the threshold value, the monitoring data is considered as abnormal data.

In order to further improve the generalization capability of the model, the characteristics of the wind turbine data are combined, and the wind turbine data with more representativeness is extracted as the support vector by adopting a system sampling and hierarchical sampling method. According to the idea of system sampling, wind turbine generator data are arranged according to a time sequence and segmented according to a certain interval: determining a segmented interval k, segmenting data, and when N/N (N is the number of the sub data sets) is an integer, taking k as N/N to obtain N sub data sets; according to the idea of hierarchical sampling, the data in the segmented sub data set is layered: dividing the data into l levels according to a certain power interval, and randomly extracting samples from the l levels. After sampling, L-n × L samples are obtained, and the samples have good representativeness and are used as support vectors in the OC-RKELM model.

In this embodiment, the condition monitoring of the generator of the wind turbine is explained, and the condition monitoring method of the wind turbine based on OC-RKELM is verified by using SCADA data, and the process is as follows:

step 1: collecting data such as operating data, environmental parameters and the like of the wind turbine generator; extracting characteristic parameters which are closely related to the health state of the generator, such as: wind speed, generator speed, winding temperature, etc.

Step 2: processing the missing value and the abnormal value; as shown in fig. 2, the standby and shutdown data are cleared; as shown in fig. 3, the outlier data is cleaned; and establishing a health data set of the normal operation data of the generator of the wind turbine generator.

And step 3: and establishing mathematical description aiming at the generator health data set by utilizing an OC-RKELM model based on the health data set of the generator of the wind turbine generator.

And 4, step 4: and determining a health threshold value based on the analysis of the historical normal data to judge whether the operation data belongs to the health data set of the wind turbine generator.

And 5: and correspondingly processing the real-time operation data of the wind turbine generator, and judging whether the real-time operation data is normal operation data or not based on the OC-RKELM model. The method successfully carries out early warning on the faults of the embodiment and carries out visual analysis on abnormal data as shown in figure 4.

The embodiments described in this specification are merely illustrative of implementations of the inventive concepts, which are intended for purposes of illustration only. The scope of the present invention should not be construed as being limited to the particular forms set forth in the examples, but rather as being defined by the claims and the equivalents thereof which can occur to those skilled in the art upon consideration of the present inventive concept.

13页详细技术资料下载
上一篇:一种医用注射器针头装配设备
下一篇:一种风电机组高度测量方法

网友询问留言

已有0条留言

还没有人留言评论。精彩留言会获得点赞!

精彩留言,会给你点赞!