Cardiovascular disease identification method, device and medium of two-channel hybrid network model

文档序号：1837428 发布日期：2021-11-16 浏览：17次中文

阅读说明：本技术 双通道混合网络模型的心血管疾病识别方法、装置及介质 (Cardiovascular disease identification method, device and medium of two-channel hybrid network model ) 是由司玉娟杨维熠张弓冯文轩范伟孙美琪于 2021-07-07 设计创作，主要内容包括：本发明涉及一种双通道混合网络模型的心血管疾病识别方法、装置及介质的技术方案,包括：对双导联心电图的心电信号进行波段对齐分割处理,得到双导联心拍；通过第一混合卷积网络提取双导联心拍的融合特征；通过第二混合卷积网络提取双导联心拍的两个单导联特异性特征；通过线性支持向量机处理融合特征及单导联特异性特征以得到对应的三组决策值；将三组决策值映射为三组决策概率；使用D-S模型融合三组决策概率得到双导联心电图的分类结果。本发明的有益效果为：解决现有技术中在患者间数据集、不平衡数据集和含噪数据集中分类效果较差的问题,具有较好的分类结果。(The invention relates to a cardiovascular disease identification method, a device and a medium of a dual-channel hybrid network model, which comprises the following steps: performing wave band alignment segmentation processing on the electrocardiosignals of the double-lead electrocardiogram to obtain a double-lead heart beat; extracting fusion characteristics of the dual-lead heart beat through a first hybrid convolution network; extracting two single-lead specific characteristics of the double-lead heart beat through a second hybrid convolution network; processing the fusion characteristics and the single-lead specific characteristics through a linear support vector machine to obtain three corresponding groups of decision values; mapping the three sets of decision values into three sets of decision probabilities; and fusing three groups of decision probabilities by using a D-S model to obtain a classification result of the double-lead electrocardiogram. The invention has the beneficial effects that: the problem of among the prior art between the patient data set, unbalanced data set and contain the relatively poor classification effect in the data set that makes an uproar is solved, better classification result has.)

1. A cardiovascular disease identification method of a two-channel hybrid network model is characterized by comprising the following steps:

performing wave band alignment segmentation processing on the electrocardiosignals of the double-lead electrocardiogram to obtain a double-lead heart beat;

extracting fusion characteristics of the dual-lead heart beat through a first hybrid convolution network;

extracting two single-lead specific characteristics of the dual-lead heart beat through a second hybrid convolution network;

processing the fusion characteristics and the single-lead specific characteristics through a linear support vector machine to obtain three corresponding groups of decision values;

mapping the three sets of decision values into three sets of decision probabilities;

and fusing the three groups of decision probabilities by using a D-S model to obtain a classification result of the double-lead electrocardiogram.

2. The cardiovascular disease identification method of the two-channel hybrid network model according to claim 1, wherein the performing of the band-aligned segmentation process on the electrocardiographic signals of the dual-lead electrocardiogram comprises:

taking the R peak of the electrocardiosignal of the double-lead electrocardiogram as a reference, and taking the former R peak and the latter R peak as an R1 peak and an R2 peak respectively;

samples of 0.10s after the R1 peak and before the R2 peak were represented as points a1 and a2, respectively, and samples of 0.06 seconds before and after the R peak were represented as points B1 and B2, respectively;

respectively calling a sampling point between A1 and B1, a sampling point between B1 and B2 and a sampling point between B2 and A2 as a waveband X, a waveband Y and a waveband Z, and resampling each waveband to obtain a plurality of sampling points of the corresponding waveband;

connecting all the re-sampled wave bands X, Y and Z to obtain the heart beats of the sum of the sampling points of the wave bands X, Y and Z;

the amplitude of each heart beat was normalized to the interval [0,1] using a dispersion normalization method.

3. The cardiovascular disease identification method based on the two-channel hybrid network model according to claim 1, wherein the first and second dual-channel hybrid convolutional networks are the CCA-PCA convolutional network and the ICA-PCA convolutional network, respectively, wherein CCA is a canonical correlation analysis algorithm, PCA is a principal component analysis algorithm, and ICA independent component analysis algorithm.

4. The cardiovascular disease identification method of the dual-channel hybrid network model according to claim 1, wherein the extracting the fusion features of the dual-lead heart beat through the first hybrid convolutional network comprises heart beat preprocessing, CCA-PCA convolutional network construction and CCA-PCA convolutional network prediction;

the heart beat preprocessing comprises the following steps: remodeling the size of the double-lead heart beat to obtain an electrocardiogram matrix; sampling the electrocardio matrix by a window with a set size to obtain a plurality of sampling blocks, and vectorizing the sampling blocks; removing the mean value of each vector to obtain centralized vectors, and combining the centralized vectors into an initial-order matrix to be processed;

the construction of the CCA-PCA convolutional network comprises the following steps: constructing a typical related convolution kernel of the CCA-PCA convolution network convolution layer through the first eigenvector and the second eigenvector; performing two-dimensional convolution processing on the typical correlation convolution kernel and the electrocardio matrix to obtain an initial-order feature block set;

the CCA-PCA convolutional network prediction comprises the following steps: extracting a principal component convolution kernel from the initial order feature block set according to the first feature vector and the second feature vector; calculating a corresponding secondary order characteristic matrix according to the principal component convolution kernel; carrying out binarization processing on the secondary order feature matrix to obtain a processing result, and mapping the processing result to be 0 or 1; calculating to obtain a corresponding decimal matrix according to the secondary order characteristic matrix and the mapped processing result; dividing each decimal matrix into a plurality of sample blocks according to a set size and a set overlapping rate, obtaining the numerical values of all the sample blocks by using a histogram statistical mode, and further converting the numerical values of the sample blocks into the characteristic vectors corresponding to the single electrocardiogram matrix.

5. The cardiovascular disease identification method of the two-channel hybrid network model according to claim 4, wherein the heartbeat preprocessing comprises:

reshaping each heart beat into an electrocardio matrix with the size of m multiplied by n;

the ECG matrices obtained from the two-lead ECG signals are respectively expressed asAnd

to be provided withIs centered on each element, and has a size k₁×k₂In the windowUp-extracting a series of sample blocks and reconstructing the sample blocks into vectors

Removing each vectorTo obtain a centralized vector

All derived from single lead electrocardiographyCombined into a preliminary-stage pending matrix

6. The method for cardiovascular disease identification of the two-channel hybrid network model of claim 5, wherein the CCA-PCA convolutional network construction comprises:

calculating the vector a according to equation (1)₁And b₁To construct a typical correlation convolution kernel that is,

wherein S₁₂Is X¹And X²Of the covariance matrix, S₁₁And S₂₂Are each X¹And X²The autocovariance matrix of (a);

according to the formula (2), the formula (1) is optimized by adopting a Lagrange multiplier method,

wherein a is₁And b₁By maximizing J (a)₁,b₁) Obtaining, λ and ν representing lagrange multipliers;

calculating J (a) according to equation (3)₁,b₁) The partial derivative of (a) of (b),

wherein a is₁And b₁Are respectively asAndthe feature vector of (2);

calculating the first vector set according to the formula (4), wherein the vector set is a₁，b₁；

At the acquisition of L₁After the vector set, the typical correlation convolution kernel of the convolution layer is obtained according to the formula (5),

whereinA is to_iAnd b_iRespectively converted into matrix W_l ¹And W_l ²To be X¹And X²The first convolution kernel of (1);

according to the formulaCalculate 2 xL₁A primary feature block, wherein a represents a two-dimensional convolution process;

for each of the c-th leadsComputing all centered sample blocksWhereinIs thatThe jth sample block of (a);

vectorizing all sample blocks and combining into

And

all the initial-order feature blocks are processed through the above process to obtain

7. The method of claim 6, wherein the CCA-PCA convolutional network prediction comprises:

l extraction by equation (6)₂A convolution kernel of a main component of the image,

whereinThe covariance matrix (Y)^c)(Y^c)^TIs/are as followsFeature vector mapping as principal component convolution kernelAccording to

Calculating a sub-order feature block

According to

Calculated to obtain 2 XL₁×L₂A sub-level feature block connected with the sub-level feature block derived from the bi-core electrical connection to obtain

Binarizing all the sub-order feature matrices O with a function H (·)_i,lWhere H (-) maps values greater than 0 and other values to 1 and 0, respectively;

according to the formulaTo obtain L₁A decimal matrix in which the numerical range of the elements is

Partitioning each decimal matrix into sizes u₁×u₂B sample blocks with the overlapping rate of R;

according toTo process all values in the sample block using histogram statistics;

according to

To obtain the eigenvectors f of a single ECG matrix_i,1。

8. The cardiovascular disease identification method of the two-channel hybrid network model according to claim 5, wherein the extracting two single-lead specific features of the two-lead heart beat through the second hybrid convolutional network comprises:

remodeling each heart beat into an electrocardio matrix with the size of n multiplied by m

Electrocardiogram matrix for N-source derived from c-th electrocardiogram lead

All the electrocardio matrixes are processed through the initial stage of the first convolution layer in the canonical correlation analysis-principal component analysis convolution network and the initial-stage matrix to be processed is obtained

Performing blind source separation on X according to the formula S-BVX;

B. the V and independent component convolution kernel is obtained according to the following steps:

processing X with principal component analysis algorithm^cTo obtain a whitening matrix

The whitening matrix includes a matrix corresponding to covariance (X)^c)(X^c)^TL of₁Feature vector of maximum feature value

Obtaining the matrix Z according to equation (7)^c，

Z^c＝V^cX^c (7)

Using a processing matrix Z having a Gaussian non-linear function^cTo obtain an orthogonal matrix B^c；

According to D^c＝B^cV^cTo obtain a bagDraw L₁A column vectorOf the de-confusion matrix

The independent component convolution kernel is calculated according to equation (8),

wherein the content of the first and second substances,will vector d₁Mapping to matrix W_l ^c；

According toComputing a first order feature matrix

Processing first order feature matricesIs consistent with the principal component analysis convolution layer in a typical correlation analysis-principal component analysis convolution network to obtain a principal component convolution kernel

According to

Computing a sub-order feature matrix

According to the formula

And

calculating the eigenvector f of the ith electrocardio matrix_iWherein the function H (-) and the function Bhist (-) with the overlapping rate R are consistent with the output layer in the typical correlation analysis-principal component analysis network, and two groups of characteristics, namely f, are obtained after the data of the two electrocardioleads are respectively processed through the operations_i,2I is 1,2, …, N and f_i,3,i＝1,2,…,N。

9. The method of claim 8, wherein the processing the fused features and the single-derivative-specific features by a linear support vector machine to obtain three corresponding sets of decision values, and the mapping the three sets of decision values into three sets of decision probabilities comprises:

for each pair of feature vectors f_i,qAnd a label h_iThe calculation was performed using a linear support vector machine with L2 regularized L2 loss function according to equation (9)

Wherein h is_iIs an original label of the ith electrocardio matrix, and C represents a penalty coefficient and is set to be 1;

collecting corresponding labels h_iAnd a feature vector f_i,qAll decision values of

According to equation (10), all decision values of the forward propagation part of the softmax function are setAndconversion to decision probabilityAnd

whereinThe representation corresponds to a decision valueThe decision probability of (c).

10. The method for cardiovascular disease recognition based on the two-channel hybrid network model of claim 9, wherein the fusing the three sets of decision probabilities using the D-S model to obtain the classification result of the dual-lead electrocardiogram comprises:

the D-S combination rule is performed according to equation (11) to obtain a final classification result,

wherein I_iIs the ith heartElectric matrix, get m at maximum_a(I_i) Label h of (a) is the final predicted label.

11. A cardiovascular disease identification apparatus of a two-channel hybrid network model, comprising a memory, a processor and a computer program stored in the memory and executable on the processor, characterized in that the processor implements the method steps of any of claims 1 to 10 when executing the computer program.

12. A computer-readable storage medium, on which a computer program is stored which, when being executed by a processor, carries out the method of any one of claims 1 to 10.

Technical Field

The invention relates to the field of computer machine learning and biomedical signal processing, in particular to a cardiovascular disease identification method, a cardiovascular disease identification device and a cardiovascular disease identification medium of a two-channel hybrid network model.

Background

The number of patients experiencing chest pain increases each year, one of the major factors being cardiovascular disease. In recent years, cardiovascular disease has become a fatal epidemic accounting for one third of the worldwide deaths and 48% of deaths based on non-infectious diseases. According to the world health organization report, cardiovascular disease will cause death of 2300 million people in 2030. Therefore, it is important to discover and treat potential cardiovascular disease patients in an early stage.

Currently, most doctors diagnose coronary heart disease and heart failure by observing the electrocardiogram waveform of patients. However, manual detection of these diseases is difficult, time consuming and laborious due to possible visual fatigue and minor changes in the long-term electrocardiogram. Therefore, to solve these problems, an intelligent identification system based on electrocardiogram will play an important role in the automatic diagnosis of coronary heart disease and heart failure.

Currently, most relevant studies show good performance in the classification of coronary heart disease and heart failure. For example, in 2019, Acharya et al propose a convolutional neural network model for classification and feature extraction of electrocardiosignals, and obtain 95.98% heart failure recognition accuracy; lih et al in 2020 propose a hybrid network model based on a convolutional neural network and a long-and-short-term memory neural network as an electrocardiosignal classification method, and obtain an overall accuracy of 98.5% in the process of identifying normality, coronary heart disease, heart failure and myocardial infarction.

However, these studies still have some unsolved problems. First, most researchers have performed experiments on data in patients, which may lead to overfitting of the proposed method. In particular, the in-patient experiment may enable training and testing data to be selected from the same person. Since the heart rate differences between the electrocardiograms of the same person are usually small, trained algorithms may yield significantly poorer performance in identifying electrocardiograms of new individuals if their heart rates differ significantly from those in the training data due to overfitting. Therefore, in order to ensure optimal performance of the proposed method, inter-patient experiments have to be performed. Second, most researchers have not evaluated the noise robustness of the proposed method. Generally, electrocardiograms collected from an actual scene have a certain degree of noise, thereby distorting the electrocardiographic waveform and making it difficult to distinguish. However, most of the noise in the electrocardiogram has been removed from databases such as the best of the best possible ways to be considered. Thus, the electrocardiograms in these databases were recorded with clear waveform characteristics and were used directly by most researchers, resulting in overlooking the ability of the proposed method to handle noisy data. Therefore, in order to effectively evaluate the robustness of noise, the proposed method should be tested based on multi-level noise data with different signal-to-noise ratios. These studies lack quantitative and standard assessments of the ability of the proposed method to process imbalance data. In these studies, researchers randomly selected a proportion of abnormal electrocardiograms as experimental data. However, the proportion of abnormal electrocardiograms collected from a real situation is variable and unpredictable, thereby making it difficult to ensure the performance of these proposed methods and the ability of the proposed methods to process skewed data. Furthermore, in a real environment, a normal electrocardiogram is usually much more than an abnormal electrocardiogram. Therefore, it is extremely important to perform experiments in a multi-level unbalanced data set with less abnormal electrocardiograms.

Disclosure of Invention

The invention aims to at least solve one of the technical problems in the prior art, provides a cardiovascular disease identification method, a cardiovascular disease identification device and a cardiovascular disease identification medium of a dual-channel hybrid network model, solves the problem of poor classification effect of inter-patient data sets, unbalanced data sets and noisy data sets in the prior art, and has a good classification result.

The technical scheme of the invention comprises a cardiovascular disease identification method of a two-channel hybrid network model, which is characterized by comprising the following steps: performing wave band alignment segmentation processing on the electrocardiosignals of the double-lead electrocardiogram to obtain a double-lead heart beat; extracting fusion characteristics of the dual-lead heart beat through a first hybrid convolution network; extracting two single-lead specific characteristics of the dual-lead heart beat through a second hybrid convolution network; processing the fusion characteristics and the single-lead specific characteristics through a linear support vector machine to obtain three corresponding groups of decision values; mapping the three sets of decision values into three sets of decision probabilities; and fusing the three groups of decision probabilities by using a D-S model to obtain a classification result of the double-lead electrocardiogram.

According to the cardiovascular disease identification method of the two-channel hybrid network model, the wave band alignment segmentation processing of the electrocardiosignals of the double-lead electrocardiogram comprises the following steps: taking the R peak of the electrocardiosignal of the double-lead electrocardiogram as a reference, and taking the former R peak and the latter R peak as an R1 peak and an R2 peak respectively; samples of 0.10s after the R1 peak and before the R2 peak were represented as points a1 and a2, respectively, and samples of 0.06 seconds before and after the R peak were represented as points B1 and B2, respectively; respectively calling a sampling point between A1 and B1, a sampling point between B1 and B2 and a sampling point between B2 and A2 as a waveband X, a waveband Y and a waveband Z, and resampling each waveband to obtain a plurality of sampling points of the corresponding waveband; connecting all the re-sampled wave bands X, Y and Z to obtain the heart beats of the sum of the sampling points of the wave bands X, Y and Z; the amplitude of each heart beat was normalized to the interval [0,1] using a dispersion normalization method.

According to the cardiovascular disease identification method of the two-channel hybrid network model, the first two-channel hybrid convolutional network and the second two-channel hybrid convolutional network are the CCA-PCA convolutional network and the ICA-PCA convolutional network respectively, wherein the CCA is a typical correlation analysis algorithm, the PCA is a principal component analysis algorithm, and the ICA is an independent component analysis algorithm.

According to the cardiovascular disease identification method of the dual-channel hybrid network model, extracting the fusion characteristics of the dual-lead heart beat through a first hybrid convolutional network comprises heart beat preprocessing, CCA-PCA convolutional network construction and CCA-PCA convolutional network prediction;

the construction of the CCA-PCA convolutional network comprises the following steps: constructing a typical related convolution kernel of the CCA-PCA convolution network convolution layer through the first eigenvector and the second eigenvector; performing two-dimensional convolution processing on the typical correlation convolution kernel and all electrocardiograms to obtain an initial-order feature block set;

the CCA-PCA convolutional network prediction comprises the following steps: extracting a principal component convolution kernel from the initial order feature block set according to the first feature vector and the second feature vector; calculating a corresponding secondary order feature matrix according to the principal component convolution kernel; carrying out binarization processing on the secondary order feature matrix to obtain a processing result, and mapping the processing result to be 0 or 1; calculating to obtain a corresponding decimal matrix according to the secondary order characteristic matrix and the mapped processing result; dividing each decimal matrix into a plurality of sample blocks according to a set size and a set overlapping rate, obtaining the numerical values of all the sample blocks by using a histogram statistical mode, and further converting the numerical values of the sample blocks into the characteristic vectors corresponding to the single electrocardiogram matrix.

According to the cardiovascular disease identification method of the two-channel hybrid network model, the central beat preprocessing comprises the following steps:

reshaping each heart beat into an electrocardio matrix with the size of m multiplied by n;

the ECG matrices obtained from the two-lead ECG signals are respectively expressed asAnd

to be provided withIs centered on each element, and has a size k₁×k₂In the windowUp-extracting a series of sample blocks and reconstructing the sample blocks into vectors

Removing each vectorTo obtain a centralized vector

All derived from single lead electrocardiographyCombined into a preliminary-stage pending matrix

According to the cardiovascular disease identification method of the two-channel hybrid network model, the CCA-PCA convolutional network construction comprises the following steps: