Spherical array sound source direction of arrival estimation method based on atomic norm

文档序号:1002360 发布日期:2020-10-23 浏览:18次 中文

阅读说明:本技术 基于原子范数的球面阵列声源波达方向估计方法 (Spherical array sound source direction of arrival estimation method based on atomic norm ) 是由 褚志刚 杨洋 于 2020-06-16 设计创作,主要内容包括:本发明公开了基于原子范数的球面阵列声源波达方向估计方法,步骤为:1)搭建由Q个传声器(2)构成的球形传声器(2)阵列;2)声源(1)向球形传声器(2)阵列辐射声波;3)建立声源波达方向测量模型,并构建传声器(2)测量得到的声压信号矩阵P<Sup>★</Sup>;4)建立协方差矩阵<Image he="80" wi="260" file="DDA0002540654490000011.GIF" imgContent="drawing" imgFormat="GIF" orientation="portrait" inline="no"></Image>5)利用球面ESPRIT算法对协方差矩阵<Image he="81" wi="230" file="DDA0002540654490000012.GIF" imgContent="drawing" imgFormat="GIF" orientation="portrait" inline="no"></Image>进行解算,确定声源波达方向。本发明能克服球面ESPRIT在高频、相干声源或少数据快拍工况下失效的缺陷,并显著提高低SNR工况下的声源DOA估计精度,即使在存在混响的普通测试环境中,本发明仍然有效。(The invention discloses a spherical array sound source direction of arrival estimation method based on atomic norm, which comprises the following steps: 1) building a spherical microphone (2) array consisting of Q microphones (2); 2) a sound source (1) radiates sound waves to a spherical microphone (2) array; 3) establishing a sound source direction of arrival measurement model and constructing a sound pressure signal matrix P obtained by measuring a microphone (2) ★ (ii) a 4) Establishing a covariance matrix 5) Covariance matrix pair using spherical ESPRIT algorithm And resolving is carried out, and the arrival direction of the sound source is determined. The method can overcome the defect that the spherical ESPRIT fails under the working conditions of high frequency, coherent sound source or few data snapshots, obviously improve the DOA estimation precision of the sound source under the working condition of low SNR, and is still effective even in the common test environment with reverberation.)

1. The method for estimating the direction of arrival of the sound source of the spherical array based on the atomic norm is characterized by comprising the following steps of:

1) building a spherical microphone (2) array formed by the Q microphones (2); the center of the array of the spherical microphone (2) is marked as a coordinate origin; wherein the position of the q-th microphone (2) is recorded as (a, omega)Mq) (ii) a a is the array radius, Q is 1,2, …, Q; the omega is (theta, phi) represents any direction in the three-dimensional space where the spherical microphone (2) array is located; theta is equal to 0 DEG and 180 DEG]Is the elevation angle, phi belongs to [0 DEG, 360 DEG ] is the azimuth angle;

2) the sound source (1) radiates sound waves to the array of spherical microphones (2).

3) Establishing a sound source direction of arrival measurement model and constructing a sound pressure signal matrix P obtained by measuring a microphone (2)

4) Establishing a covariance matrix

5) Covariance matrix pair using spherical ESPRIT algorithmAnd resolving is carried out, and the arrival direction of the sound source is determined.

2. The method for estimating the direction of arrival of a sound source of a spherical array based on an atomic norm as claimed in claim 1 or 2, wherein: the method for establishing the sound source direction of arrival measurement model comprises the following steps:

1) calculating the transfer function t ((ka, omega) from the ith sound source to the qth microphone (2)Mq)|ΩSi) Namely:

Figure FDA0002540654460000013

in the formula, n and m are the order and the order of the spherical harmonic function respectively; bn(ka) is the modal intensity;

Figure FDA0002540654460000014

modal intensity bn(ka) is as follows:

Figure FDA0002540654460000015

in the formula, jn(ka) is an n-th order Bessel function of the first kind,is a second class ball Hankel function of order n; j'n(ka) and

Figure FDA0002540654460000017

2) calculating spherical harmonic function in omega directionNamely:

in the formula (I), the compound is shown in the specification,is a function of legendre; beta is an,m,κIs the associated Legendre function coefficient; spherical harmonic coefficient corresponding to nth order m-order item of spherical harmonic function

Figure FDA0002540654460000021

3) Establishing a vector of the directions of the Q microphones (2)And corresponding spherical harmonic vector

Figure FDA0002540654460000023

4) Establishing a transfer function matrix from each sound source to all microphones (2) and recording

Figure FDA0002540654460000026

wherein, the spherical harmonic function matrix corresponding to the direction of the microphoneAs follows:

in the formula, N0Infinity represents the highest order of the spherical harmonic;

spherical harmonic function matrix corresponding to direction of sound sourceAs follows:

Figure FDA00025406544600000210

array modal intensity matrixAs follows:

Figure FDA00025406544600000212

5) establishing a sound pressure signal matrix measured by each microphone (2)

Figure FDA00025406544600000213

in the formula (I), the compound is shown in the specification,

Figure FDA00025406544600000215

6) to spherical harmonic function matrixSpherical harmonic function matrixAnd modal intensity matrix

Figure FDA00025406544600000219

updating the sound pressure signal matrix P based on the highest order NThe following were used:

Figure FDA0002540654460000031

3. the method for estimating the direction of arrival of a sound source in a spherical array based on atomic norms as claimed in claim 1 or 2, wherein a covariance matrix is established

Figure FDA0002540654460000032

1) establishing an atomic norm minimization model, comprising the following steps:

1.1) establishing a joint Legendre function expression, namely:

Figure FDA0002540654460000033

wherein x is a function input;

wherein, the function term (x) is connected with Legendre2-1)nAs follows:

in the formula, polynomial coefficient

1.2) substituting the formula (11) into the formula (9) to obtain a legendre function expression when the legendre function power m is more than or equal to 0, namely:

1.3) determining the sine function sin θ ═ (e) for the elevation angle θ-e-jθ) /(2j) and cosine function cos θ ═ e+e-jθ)/2;

1.4) based on step 1.2) and step 1.3), the associated legendre function expression is updated as follows:

Figure FDA0002540654460000038

1.5) associated Legendre function term (e)-e-jθ)mAssociated Legendre function term (e)+e-jθ-2)o-mRespectively as follows:

1.6) based on the formula (14) and the formula (15), updating the associated Legendre function expression as follows:

Figure FDA00025406544600000311

1.7) given the function order n and the number m of stages, let the index o increase from m to n, the index u increase from 0 to m, the index v increase from 0 to o-m, the index w increase from 0 to o-m-v;

determining, for each group (o, u, v, w), an index k of 2u + v-m-w in the trigonometric polynomial expansion associated with the legendre function and coefficients in the trigonometric polynomial of the associated legendre function according to equation (16); after all groups (o, u, v, w) are calculated, the coefficients corresponding to the same kappa value are added to obtain betan,m,κ

1.8) combining equation (9) with Legendre function coefficient beta when m is greater than or equal to 0n,m,κDetermining the associated Legendre function coefficient beta when m < 0n,m,κ

1.9) determining the associated Legendre function coefficient betan,m,κThen, spherical harmonic function in omega direction is constructed

Figure FDA0002540654460000041

1.10) constructing the matrixSum matrix

In the formula, the elements in the matrix D dθSi)、dφSi) Is the basis vector that constitutes matrix D;

memory vectorThe (N + kappa) (2N +1) + N-m +1 element of (A) isn,mβn,m,κAnd k is-n, …,0, …, n, and let the other elements be 0, then the transposed conjugate of the spherical harmonic matrix is obtainedThe sound pressure signal matrix P measured by each microphone (2)≈YMNBNGDS+N;

1.11) to establish an input matrix X, namely:

in the formula (I), the compound is shown in the specification,

Figure FDA00025406544600000411

the atomic norm of equation (18) is as follows:

in the formula, "inf" represents an infimum limit;

Figure FDA00025406544600000414

collection of atomsAs follows:

Figure FDA00025406544600000416

1.12) establishing an atomic norm minimization model, namely:

Figure FDA00025406544600000417

wherein, the noise control parameter is the noise control parameter;

Figure FDA0002540654460000051

2) establishing an equivalent semi-positive planning model, comprising the following steps:

2.1) converting equation (21) into the following semi-definite programming model:

Figure FDA0002540654460000053

wherein u and E are auxiliary amounts; n is a radical ofuIs the number of elements in the auxiliary quantity u; t isb(. h) is the dual Toeplitz operator; for any given vector

Figure FDA0002540654460000054

2.2) Using the Dual Toeplitz operator Tb(u) mapping u to a block Toeplitz type Hermitian matrix of (2N +1) × (2N +1) dimensions, i.e.:

Figure FDA0002540654460000056

in the formula, each matrix is partitioned into blocks TκToeplitz matrices, both of (2N +1) × (2N +1) dimensions: kappa is more than or equal to 0 and less than or equal to 2N;

matrix block TκAs follows:

23) building a matrix

Figure FDA0002540654460000058

the van dermonde decomposition formula is shown below:

where the matrix V ═ d (Ω)S1),d(ΩS2),…,d(ΩSr)](ii) a The diagonal matrix Σ ═ diag ([ σ ])12,…,σr]);i is 1,2, …, r; r is a matrix

Figure FDA00025406544600000511

3) solving the semi-definite planning model by using an alternating direction multiplier method, comprising the following steps of:

3.1) updating a semi-definite programming model by using an alternative direction multiplier method to obtain:

in the formula, Z is an auxiliary matrix, and tau is a regularization parameter;

3.2) establishing an augmented Lagrangian function expression of formula (26), namely:

Figure FDA0002540654460000062

in the formula (I), the compound is shown in the specification,is a Hermitian Lagrange multiplier matrix; rho > 0 is a penalty parameter; "<·,·>"represents the inner product;

3.3) solving the formula (26) by using an alternative direction multiplier method, and initializing an auxiliary matrix Z0=Λ0The variables at γ +1 iterations are updated to be 0:

Figure FDA0002540654460000064

Figure FDA0002540654460000065

3.4) dividing the Hermitian Lagrange multiplier matrix and the auxiliary matrix to obtain:

Figure FDA0002540654460000066

3.5) based on step 3.3) and step 3.4), update equation (28) as follows:

Figure FDA0002540654460000068

Figure FDA0002540654460000071

in the formula I1And I2Respectively L X L and (2N +1)2×(2N+1)2A dimension unit matrix;is TbThe companion operator of (·). For any given matrix

Figure FDA00025406544600000719

3.6) based on step 3.3), update equation (29) as follows:

equation (36) represents the Hermitian matrix

Figure FDA0002540654460000079

4) based on the sum of the covariance matrix

Figure FDA00025406544600000710

4. The method for estimating the direction of arrival of a sound source of a spherical array based on atomic norms according to claim 1, wherein determining the direction of arrival of the sound source comprises the following steps:

1) eigenvalue decompositionAnd the eigenvectors are arranged in descending order according to the magnitude of the eigenvalue; writing the first s eigenvectors into the matrix USPerforming the following steps;

2) based on the spherical harmonic recursive relation, a matrix U is establishedSA related linear equation system, and a least square method is adopted to pair the matrix USSolving a related linear equation set to obtain a conversion matrix containing the direction of arrival of the sound source;

the system of linear equations is shown below:

wherein the content of the first and second substances,

Figure FDA00025406544600000714

wherein the coefficient matrix

Figure FDA00025406544600000717

3) and performing eigenvalue decomposition on the matrix containing the sound source direction of arrival by adopting a generalized eigenvalue decomposition method to determine the sound source direction of arrival.

Technical Field

The invention relates to the field of sound source identification, in particular to a spherical array sound source direction of arrival estimation method based on atomic norm.

Background

The problem Of Direction-Of-Arrival (DOA) estimation generally exists in the fields Of noise source identification, target detection, fault diagnosis and the like. The rotation invariant Signal parameter Estimation (ESPRIT) technology based on spherical microphone array measurement, referred to as spherical ESPRIT for short, is focused by the advantages of panoramic estimation of the sound source direction of arrival, low computational complexity and the like. The spherical ESPRIT takes a covariance matrix of measurement signals of an array microphone as input, and converts a sound source DOA estimation problem into a least square solution and eigenvalue decomposition problem based on a recursion relation of a spherical harmonic function. The existing spherical ESPRIT is only suitable for the situation that the spherical harmonic function sampled by the microphone satisfies orthogonality, and the discreteness of the microphone makes the order of the spherical harmonic function satisfying the characteristics not high, which limits the available upper limit frequency. Secondly, as a subspace approach, the spherical ESPRIT inevitably suffers from the inherent defect that the subspace approach fails for coherent sound sources, small number of data snapshots or low Signal-to-Noise Ratio (SNR) conditions. These problems and deficiencies constitute key obstacles that prevent the success of spherical ESPRIT in solving the various acoustic source DOA estimation problems. For the defect of limited upper limit frequency, no relevant report for solving the method is found at present.

Aiming at the defects of coherent sound source, snapshot of a few data or failure of low SNR working condition, the prior art adopts a forward and reverse average method and a frequency smoothing technology to remove the source correlation. However, the forward and backward averaging method is only suitable for the case where the number of coherent sound sources is 2, and the frequency smoothing technique is only suitable for the case where the sound sources radiate a broadband signal.

In summary, spherical ESPRIT is still open to the failure of high frequency sound sources, coherent sound sources, snapshot of few data, or low SNR conditions.

Disclosure of Invention

The present invention is directed to solving the problems of the prior art.

The technical scheme adopted for achieving the purpose of the invention is that the method for estimating the direction of arrival of the spherical array sound source based on the atomic norm comprises the following steps:

1) and building a spherical microphone array consisting of Q microphones. The spherical microphone array center is noted as the origin of coordinates. Wherein the qth microphone position is noteda is the array radius, Q is 1,2, …, Q. Omega-theta, phi represents any direction in the three-dimensional space of the spherical microphone array。θ∈[0°,180°]For elevation, φ ∈ [0 °,360 °) is the azimuth.

2) The sound source radiates sound waves towards the spherical microphone array.

3) Establishing a sound source direction of arrival measurement model and constructing a sound pressure signal matrix P obtained by microphone measurement

The method for establishing the sound source direction of arrival measurement model comprises the following steps:

3.1) calculating the transfer function from the ith sound source to the qth microphone

Figure BDA0002540654470000012

Namely:

in the formula, n and m are the order and the order of the spherical harmonic function, respectively. bn(ka) is the modal intensity.

Figure BDA0002540654470000022

Is a spherical harmonic function in the omega direction. OmegaSiIndicating the direction of arrival of the ith sound source. I is 1,2, …, I. I is the total number of sound sources. k is the wave number of the sound wave radiated by the sound source. Superscript denotes conjugation.

Modal intensity bn(ka) is as follows:

Figure BDA0002540654470000023

in the formula, jn(ka) is an n-th order Bessel function of the first kind,is a second class of spherical hank functions of order n. j'n(ka) andare respectively n-order Bessel functions j of the first kindn(ka) and n-th order second class spherical Hank functionThe first derivative of (a). The open sphere means that the microphone is arranged on the surface of the open sphere. A rigid sphere means that the microphone is arranged on the surface of a rigid sphere.

3.2) calculating spherical harmonics in the omega directionNamely:

Figure BDA0002540654470000028

in the formula (I), the compound is shown in the specification,is a function of legendre. Beta is an,m,κAlong with legendre function coefficients. Spherical harmonic coefficient corresponding to nth order m-order item of spherical harmonic function

3.3) establishing the vector of the directions of the Q microphonesAnd corresponding spherical harmonic vectorEstablishing vectors of directions of I sound sources

Figure BDA00025406544700000213

And corresponding spherical harmonic vector

3.4) establishing a transfer function matrix of each sound source to all microphones, notedThe superscript H denotes the transposed conjugate.

Wherein, the spherical harmonic moment corresponding to the direction of the microphoneMatrix of

Figure BDA00025406544700000216

As follows:

Figure BDA00025406544700000217

in the formula, N0Infinity represents the highest order of the spherical harmonics.

Spherical harmonic function matrix corresponding to direction of sound sourceAs follows:

array modal intensity matrix

Figure BDA0002540654470000032

As follows:

3.5) establishing a sound pressure signal matrix obtained by measuring each microphoneNamely:

in the formula (I), the compound is shown in the specification,is a noise matrix. Signal-to-noise ratio SNR is 20lg (| | P)-N||F/||N||F). Sound source intensity matrixAnd L is the total number of snapshots. | | non-woven hairFRepresents the Frobenius norm. Upper labelThe measured quantity is indicated.

3.6) matrix of spherical harmonics

Figure BDA0002540654470000038

Spherical harmonic function matrixAnd modal intensity matrix

Figure BDA00025406544700000310

By truncation of order n, i.e. by making a spherical harmonic matrix

Figure BDA00025406544700000311

Spherical harmonic function matrixAnd modal intensity matrixHighest order of (2)

Figure BDA00025406544700000314

Indicating an integer that rounds the value to the second nearest toward positive infinity.

Updating the sound pressure signal matrix P based on the highest order NThe following were used:

4) establishing a covariance matrix

Figure BDA00025406544700000317

Establishing a covariance matrixThe method comprises the following steps:

4.1) establishing an atomic norm minimization model, comprising the following steps:

4.1.1) establishing a joint Legendre function expression, namely:

in the formula, x is a function input.

Wherein, the function term (x) is connected with Legendre2-1)nAs follows:

Figure BDA00025406544700000320

in the formula, polynomial coefficient

Figure BDA00025406544700000322

4.1.2) substituting the formula (11) into the formula (9) to obtain a conjunctive Legendre function expression when the conjunctive Legendre function power m is more than or equal to 0, namely:

Figure BDA0002540654470000041

4.1.3) determining the sine function sin θ ═ (e) for the elevation angle θ-e-jθ) /(2j) and cosine function cos θ ═ e+e-jθ)/2。

4.1.4) based on step 4.1.2) and step 4.1.3), the associated legendre function expression is updated as follows:

Figure BDA0002540654470000042

4.1.5) associated Legendre function term (e)-e-jθ)mAssociated Legendre function term (e)+e-jθ-2)o-mRespectively as follows:

4.1.6) based on equation (14) and equation (15), the associated legendre function expression is updated as follows:

4.1.7) given a function order n and a number of stages m, let index o increase from m to n, index u increase from 0 to m, index v increase from 0 to o-m, index w increase from 0 to o-m-v.

For each set (o, u, v, w), the index k in the trigonometric polynomial expansion with legendre function is determined as 2u + v-m-w and the coefficients in the trigonometric polynomial with legendre function according to equation (16). After all groups (o, u, v, w) are calculated, the coefficients corresponding to the same kappa value are added to obtain betan,m,κ

4.1.8) associated with formula (9) and with Legendre function coefficient beta when m is greater than or equal to 0n,m,κDetermining the associated Legendre function coefficient beta when m < 0n,m,κ

4.1.9) determining the associated Legendre function coefficient betan,m,κThen, spherical harmonic function in omega direction is constructedNamely:

4.1.10) construction of matricesSum matrix

In the formula, the elements in the matrix D

Figure BDA0002540654470000051

The (N + kappa) (2N +1) + N-m +1 elements are marked as An,mβn,m,κAnd k is-n, …,0, …, n, and let the other elements be 0, then the transposed conjugate of the spherical harmonic matrix is obtained

Figure BDA0002540654470000053

Sound pressure signal matrix P measured by each microphone≈YMNBNGDS+N。

4.1.11) establishes an input matrix X, i.e.:

in the formula (I), the compound is shown in the specification,

Figure BDA0002540654470000055

is row i of S.

Figure BDA0002540654470000056

||ψi||2=1。

The atomic norm of equation (18) is as follows:

in the formula, "inf" represents an infimum limit.Is a collection of atoms.

Collection of atomsAs follows:

4.1.12) to establish an atomic norm minimization model, namely:

among these are noise control parameters.

Figure BDA00025406544700000512

Is a measure of the sparsity of sound sources in the continuous domain.

Figure BDA00025406544700000513

Representing the optimal solution to the atomic norm minimization problem.

4.2) establishing an equivalent semi-positive definite planning model, comprising the following steps:

4.2.1) converting equation (21) into the following semi-positive planning model:

wherein u and E are auxiliary amounts. N is a radical ofuIs the number of elements in the auxiliary quantity u. T isb(. cndot.) is the dual Toeplitz operator. For any given vector Is a half space of (2N,2N), Nu=8N2+4N+1。

4.2.2) Using the Dual Toeplitz operator Tb(u) mapping u to a block Toeplitz type Hermitian matrix of (2N +1) × (2N +1) dimensions, i.e.:

in the formula, each matrix is partitioned into blocks TκToeplitz matrices, both of (2N +1) × (2N +1) dimensions: kappa is more than or equal to 0 and less than or equal to 2N.

Matrix block TκAs follows:

4.2.3) building matricesVandermonde decomposition of (a), making equation (21) and equation (22) equivalent;

the van dermonde decomposition formula is shown below:

where the matrix V ═ d (Ω)S1),d(ΩS2),…,d(ΩSr)](ii) a The diagonal matrix Σ ═ diag ([ σ ])12,…,σr]);

Figure BDA0002540654470000065

i is 1,2, …, r; r is a matrix

Figure BDA0002540654470000066

The rank of (d); r.ltoreq.2N +1 is a sufficient condition for the existence of formula (25). Matrix array

Figure BDA0002540654470000067

The sum of the covariance matrices of the signals due to the individual sources in a group of sources is considered, excluding the covariance components between the signals due to different sources.

4.3) solving the semi-definite planning model by using an alternative direction multiplier method, comprising the following steps:

4.3.1) updating the semi-positive definite programming model by using an alternative direction multiplier method to obtain:

Figure BDA0002540654470000068

wherein Z is an auxiliary matrix and tau is a regularization parameter.

4.3.2) establish an augmented Lagrangian function expression of equation (26), namely:

Figure BDA0002540654470000069

in the formula (I), the compound is shown in the specification,

Figure BDA00025406544700000610

is the Hermitian lagrange multiplier matrix. ρ > 0 is a penalty parameter. "<·,·>"denotes the inner product.

4.3.3) solving the equation (26) by means of the alternative direction multiplier method, initializing the auxiliary matrix Z0=Λ0The variables at γ +1 iterations are updated to be 0:

Figure BDA0002540654470000071

Figure BDA0002540654470000072

Figure BDA0002540654470000073

4.3.4) dividing the Hermitian Lagrange multiplier matrix and the auxiliary matrix to obtain:

4.3.5) based on step 3.3) and step 3.4), update equation (28) as follows:

in the formula I1And I2Respectively L X L and (2N +1)2×(2N+1)2A dimension unit matrix.

Figure BDA00025406544700000715

Is TbThe companion operator of (·). For any given matrixM=diag([(2N+1)×[2N+1,2N,…,1],

Figure BDA00025406544700000710

Figure BDA00025406544700000711

M is a diagonal matrix. Matrix array A basic Toeplitz matrix in which the elements of the k (m) th diagonal are all 1 and the other elements are all 0.

4.3.6) based on step 3.3), update equation (29) as follows:

equation (36) represents the Hermitian matrix

Figure BDA0002540654470000081

Projection to semipositiveOn the cone, the Hermitian matrix is subjected to eigenvalue decomposition, and all negative eigenvalues are made to be.

4.4) based on the sum of the covariance matrices foundEstablishing a covariance matrix

5) Covariance matrix pair using spherical ESPRIT algorithm

Figure BDA0002540654470000084

Resolving is carried out, and the arrival direction of the sound source is determined, and the method comprises the following steps:

5.1) eigenvalue decomposition

Figure BDA0002540654470000085

And sorting the feature vectors in descending order according to the size of the feature values. Writing the first s eigenvectors into the matrix USIn (1).

5.2) establishing a matrix U based on the spherical harmonic recursive relationshipSA related linear equation system, and a least square method is adopted to pair the matrix USAnd solving the related linear equation system to obtain a conversion matrix containing the arrival direction of the sound source. The system of linear equations is as follows:

wherein the content of the first and second substances,

Figure BDA0002540654470000087

for subordinate matrix U by superscript (U, v)SIn which a part of the rows is selected to be combined into a matrix, psixy+、ψxy-And psizFor a transformation matrix containing the direction of arrival of the sound source,and

Figure BDA0002540654470000089

is a matrix of coefficients.

Wherein the coefficient matrix

Figure BDA00025406544700000810

Sum coefficient matrix

Figure BDA00025406544700000811

Respectively as follows:

Figure BDA00025406544700000812

and 5.3) carrying out eigenvalue decomposition on the matrix containing the sound source direction of arrival by adopting a generalized eigenvalue decomposition method to determine the sound source direction of arrival.

The technical effects of the invention are undoubted, the invention establishes a novel spherical ESPRIT technology based on atomic norm, and analyzes the performance based on simulation and verification tests, and the result shows that ANM + spherical ESPRIT can perfectly overcome the defect that the spherical ESPRIT fails under the working conditions of high frequency, coherent sound source or a few data snapshots, and obviously improves the estimation precision of the DOA of the sound source under the working condition of low SNR, and the invention is still effective even in the common test environment with reverberation. Secondly, the ADMM-based solving algorithm derived in the invention is more efficient than an IPM-based SDPT3 solver.

Drawings

FIG. 1 is a schematic view of a spherical microphone array and a sound source;

FIG. 2 is a sound source identification imaging plot of a single Monte Carlo calculation for different frequencies;

FIG. 2(a) is an imaging plot of the acoustic source for spherical ESPRIT at a frequency of 1000 Hz;

FIG. 2(b) is a sound source imaging plot of ANM + spherical ESPRIT at 1000Hz frequency, ANM solved by an IPM based SDPT3 solver;

FIG. 2(c) is a sound source imaging plot of ANM + spherical ESPRIT at 1000Hz frequency, ANM solved by ADMM-based solution algorithm;

FIG. 2(d) is an imaging plot of the acoustic source for spherical ESPRIT at a frequency of 3000 Hz;

FIG. 2(e) is a sound source imaging plot of ANM + spherical ESPRIT at 3000Hz frequency, ANM solved by an IPM based SDPT3 solver;

FIG. 2(f) is a sound source imaging plot of ANM + spherical ESPRIT at 3000Hz frequency, the ANM being solved by an ADMM-based solution algorithm;

FIG. 2(g) is an imaging plot of the acoustic source for spherical ESPRIT at a frequency of 1000 Hz;

FIG. 2(h) is a sound source imaging plot of ANM + spherical ESPRIT at 1000Hz frequency, ANM solved by an IPM based SDPT3 solver;

FIG. 2(i) is a sound source imaging plot of ANM + spherical ESPRIT at 1000Hz frequency, ANM solved by ADMM-based solution algorithm;

FIG. 3 is a graph of variation of time taken to estimate root mean square error for the DOA of the acoustic source and ANM solution with frequency;

FIG. 3(a) is a graph showing the variation of sigma with frequency for 100 Monte Carlo calculations

FIG. 3(b) is a comparison graph of the time consumption of two ANM solution methods;

FIG. 4 is a sound source identification imaging plot for a single Monte Carlo calculation of different sound source coherence;

FIG. 4(a) is an imaging diagram of a sound source of spherical ESPRIT when the sound sources are not coherent;

FIG. 4(b) is an imaging diagram of an ANM + spherical ESPRIT sound source when the sound sources are not coherent, the ANM being solved by an IPM-based SDPT3 solver;

FIG. 4(c) is an imaging diagram of the sound source for ANM + spherical ESPRIT when the sound sources are not coherent, the ANM being solved by an ADMM-based solution algorithm;

FIG. 4(d) is an imaging plot of a spherical ESPRIT sound source with the source partially coherent;

FIG. 4(e) is an acoustic source imaging plot of ANM + spherical ESPRIT with the acoustic source partially coherent, the ANM being solved by an IPM based SDPT3 solver;

FIG. 4(f) is an acoustic source imaging plot of ANM + spherical ESPRIT with the acoustic source partially coherent, the ANM being solved by an ADMM-based solution algorithm;

FIG. 4(g) is an imaging plot of a spherical ESPRIT sound source when the source is fully coherent;

FIG. 4(h) is an acoustic source imaging plot of ANM + spherical ESPRIT when the acoustic source is fully coherent, the ANM being solved by an IPM based SDPT3 solver;

FIG. 4(i) is an acoustic source imaging plot of ANM + spherical ESPRIT when the acoustic source is fully coherent, the ANM being solved by an ADMM-based solution algorithm;

FIG. 5 is a complementary cumulative distribution function of the root mean square error of the DOA estimation of the sound source for different total number of snapshots and different SNR;

fig. 5(a) is a complementary cumulative distribution function of sound source DOA estimation root mean square error under different total snapshots and different SNRs when T is 2 °, spherical ESPRIT, and 100 monte carlo calculations;

fig. 5(b) is a complementary cumulative distribution function of sound source DOA estimated root mean square error for different total snapshots and different SNRs when T is 2 °, ANM + sphere ESPRIT (ANM is solved by an IPM-based SDPT3 solver), and 100 monte carlo calculations;

fig. 5(c) is a complementary cumulative distribution function of sound source DOA estimation root mean square error for different total number of snapshots and different SNRs when T is 2 °, ANM + sphere ESPRIT (ANM is solved by ADMM-based solution algorithm), and 100 monte carlo calculations;

fig. 5(d) is a complementary cumulative distribution function of the sound source DOA estimated root mean square error under different total snapshots and different SNRs when T is 1 °, the spherical ESPRIT, and 100 monte carlo calculations;

fig. 5(e) is a complementary cumulative distribution function of sound source DOA estimated root mean square error for different total snapshots and different SNRs when T is 1 °, ANM + sphere ESPRIT (ANM is solved by an IPM-based SDPT3 solver), and 100 monte carlo calculations;

fig. 5(f) is a complementary cumulative distribution function of sound source DOA estimation root mean square error for different total number of snapshots and different SNRs when T is 1 °, ANM + spherical ESPRIT (ANM is solved by ADMM-based solution algorithm), and 100 monte carlo calculations;

FIG. 6(a) is a test layout in a semi-anechoic chamber;

FIG. 6(b) is a development view of the three-dimensional space in the semi-anechoic chamber;

FIG. 7 is an imaging of a test sound source in a semi-anechoic chamber;

FIG. 7(a) is an imaging diagram of a sound source with a spherical ESPRIT at a frequency of 1500Hz with a total number of snapshots of 30, with the speaker excited by a steady-state white noise signal;

FIG. 7(b) is a sound source imaging plot with a speaker excited by a steady-state white noise signal, 1500Hz frequency, and ANM + spherical ESPRIT with a total number of snapshots of 30, the ANM being solved by an IPM-based SDPT3 solver;

FIG. 7(c) is a sound source imaging plot with a speaker excited by a steady-state white noise signal, 1500Hz frequency, and ANM + spherical ESPRIT with a total number of snapshots of 30, the ANM being solved by an ADMM-based solution algorithm;

FIG. 7(d) is an image of a sound source with a spherical ESPRIT at a frequency of 3000Hz with a total number of snapshots of 30, the speaker being excited by a steady-state white noise signal;

FIG. 7(e) is a sound source imaging plot with a speaker excited by a steady-state white noise signal, 3000Hz frequency, and with ANM + spherical ESPRIT at a total snapshot count of 30, the ANM being solved by an IPM-based SDPT3 solver;

FIG. 7(f) is a sound source imaging plot with a speaker excited by a steady-state white noise signal, 3000Hz frequency, ANM + spherical ESPRIT with a total number of snapshots of 30, the ANM being solved by an ADMM-based solution algorithm;

FIG. 7(g) is an image of a sound source with a speaker excited by a steady-state white noise signal, 1500Hz frequency, using spherical ESPRIT with a total of 5 snapshots;

FIG. 7(h) is a sound source imaging plot with a speaker excited by a steady-state white noise signal, 1500Hz frequency, and ANM + sphere ESPRIT with a total number of snapshots of 5, the ANM being solved by an IPM-based SDPT3 solver;

FIG. 7(i) is a sound source imaging plot with a speaker excited by a steady-state white noise signal, 1500Hz frequency, and ANM + spherical ESPRIT with a total number of snapshots of 5, the ANM being solved by an ADMM-based solution algorithm;

FIG. 7(j) is an image of a sound source with a spherical ESPRIT at 1500Hz frequency with the total number of snapshots 1, with the speaker excited by a steady-state white noise signal;

FIG. 7(k) is a sound source imaging plot of a speaker excited by a steady-state white noise signal at 1500Hz with ANM + sphere ESPRIT at a total snapshot count of 1, the ANM being solved by an IPM-based SDPT3 solver;

FIG. 7(l) is a sound source imaging plot of a speaker excited by a steady-state white noise signal at 1500Hz with ANM + sphere ESPRIT when the total number of snapshots is 1, the ANM being solved by an ADMM-based solution algorithm;

FIG. 7(m) is an image of a sound source with a spherical ESPRIT at a frequency of 1500Hz with speakers excited by the same pure tone signal and a total number of snapshots of 30;

FIG. 7(n) is a sound source imaging plot of a speaker excited by the same pure tone signal, at 1500Hz, with ANM + spherical ESPRIT at a total snapshot count of 30, the ANM being solved by an IPM-based SDPT3 solver;

FIG. 7(o) is a sound source imaging plot of a speaker excited by the same pure tone signal at 1500Hz with a total snapshot of 30 ANM + spherical ESPRIT, the ANM being solved by an ADMM-based solution algorithm;

FIG. 8(a) is a test layout in a general room;

FIG. 8(b) is an expanded view of a three-dimensional space of a general room;

FIG. 9 is an imaging diagram of a test sound source in a general room;

FIG. 9(a) is an imaging diagram of a sound source with a spherical ESPRIT at a frequency of 1000Hz with a total number of snapshots of 30, with the speaker excited by a steady-state white noise signal;

FIG. 9(b) is a sound source imaging plot with a speaker excited by a steady-state white noise signal, at a frequency of 1000Hz, using ANM + spherical ESPRIT with a total number of snapshots of 30, the ANM being solved by an IPM-based SDPT3 solver;

FIG. 9(c) is a sound source imaging plot with a speaker excited by a steady-state white noise signal, at a frequency of 1000Hz, using ANM + spherical ESPRIT with a total number of snapshots of 30, the ANM being solved by an ADMM-based solution algorithm;

FIG. 9(d) is an imaging plot of a source with a spherical ESPRIT at a frequency of 2000Hz with the speaker excited by a steady-state white noise signal and a total number of snapshots of 30;

FIG. 9(e) is an acoustic source imaging plot with a speaker excited by a steady-state white noise signal, at 2000Hz, using an ANM + spherical ESPRIT with a total number of snapshots of 30, the ANM being solved by an IPM-based SDPT3 solver;

FIG. 9(f) is an image of a sound source with a speaker excited by a steady-state white noise signal, at a frequency of 2000Hz, using ANM + spherical ESPRIT with a total number of snapshots of 30, the ANM being solved by an ADMM-based solution algorithm;

FIG. 9(g) is an imaging diagram of a sound source with a spherical ESPRIT at a frequency of 3000Hz with a total number of snapshots of 30, with the speaker excited by a steady-state white noise signal;

FIG. 9(h) is a sound source imaging plot with a speaker excited by a steady-state white noise signal, 3000Hz frequency, and with ANM + spherical ESPRIT with a total number of snapshots of 30, the ANM is solved by an IPM-based SDPT3 solver;

FIG. 9(i) is a sound source imaging plot with a speaker excited by a steady-state white noise signal, 3000Hz frequency, and an ANM + spherical ESPRIT with a total number of snapshots of 30, the ANM being solved by an ADMM-based solution algorithm;

FIG. 9(j) is an image of a sound source with a spherical ESPRIT at a frequency of 4000Hz with a speaker excited by a steady-state white noise signal and a total number of snapshots of 30;

FIG. 9(k) is a sound source imaging plot with a speaker excited by a steady-state white noise signal, at 4000Hz frequency, using ANM + spherical ESPRIT with a total number of snapshots of 30, the ANM being solved by an IPM-based SDPT3 solver;

FIG. 9(l) is a sound source imaging plot of a speaker excited by a steady-state white noise signal, at 4000Hz, with ANM + spherical ESPRIT at a total snapshot count of 30, the ANM being solved by an ADMM-based solution algorithm;

FIG. 9(m) is an imaging diagram of a sound source with a spherical ESPRIT at a frequency of 5000Hz with a total number of snapshots of 30, with the speaker excited by a steady-state white noise signal;

FIG. 9(n) is a sound source imaging plot with a speaker excited by a steady-state white noise signal, at a frequency of 5000Hz, using ANM + spherical ESPRIT with a total number of snapshots of 30, the ANM being solved by an IPM-based SDPT3 solver;

FIG. 9(o) is a sound source imaging plot with a speaker excited by a steady-state white noise signal, at a frequency of 5000Hz, using ANM + spherical ESPRIT with a total number of snapshots of 30, the ANM being solved by an ADMM-based solution algorithm;

in the figure, a sound source 1 and a microphone 2 are shown.

Detailed Description

The present invention is further illustrated by the following examples, but it should not be construed that the scope of the above-described subject matter is limited to the following examples. Various substitutions and alterations can be made without departing from the technical idea of the invention and the scope of the invention is covered by the present invention according to the common technical knowledge and the conventional means in the field.

36页详细技术资料下载
上一篇:一种医用注射器针头装配设备
下一篇:一种基于分布式无人机移动监测的干扰源直接定位方法

网友询问留言

已有0条留言

还没有人留言评论。精彩留言会获得点赞!

精彩留言,会给你点赞!

技术分类