Sound source tracking control method and control device, and sound source tracking system

文档序号：231721 发布日期：2021-11-09 浏览：13次中文

阅读说明：本技术 声源跟踪控制方法和控制装置、声源跟踪系统 (Sound source tracking control method and control device, and sound source tracking system ) 是由王建亭邵喜斌布占场孟智明雷利平石阳孙元慧于 2020-02-24 设计创作，主要内容包括：一种声源跟踪控制方法和控制装置(82)、声源跟踪系统。声源跟踪控制装置(82)从第一音频采集电路(F1,811)采集的第一音频信息中提取第一音频段,并同步地从第二音频采集电路(F2,812)采集的第二音频信息中提取第二音频段(101)；根据第一音频段和第二音频段中的预设峰值之间的偏差,确定第一音频段和第二音频段的第一时间偏移量(102)；根据第一时间偏移量,确定声源相距第一音频采集电路(F1,811)的第一距离和声源相距第二音频采集电路(F2,812)的第二距离的第一距离差(103)；根据第一距离差,确定声源的第一偏移角(104)；根据第一偏移角调整视频采集电路(83)的视频采集方向,以便视频采集电路(83)对准声源(105)。(A sound source tracking control method, a sound source tracking control device (82) and a sound source tracking system are provided. A sound source tracking control apparatus (82) extracts a first audio piece from first audio information collected by a first audio collecting circuit (F1,811), and synchronously extracts a second audio piece (101) from second audio information collected by a second audio collecting circuit (F2,812); determining a first time offset for the first audio piece and the second audio piece based on a deviation between preset peaks in the first audio piece and the second audio piece (102); determining a first distance difference (103) of a first distance of the sound source from the first audio acquisition circuit (F1,811) and a second distance of the sound source from the second audio acquisition circuit (F2,812) according to the first time offset; determining a first offset angle (104) of the sound source based on the first distance difference; the video capture direction of the video capture circuit (83) is adjusted according to the first offset angle so that the video capture circuit (83) is directed at the sound source (105).)

A sound source tracking control method comprising:

extracting a first audio segment from first audio information collected by a first audio collecting circuit, and synchronously extracting a second audio segment from second audio information collected by a second audio collecting circuit;

determining a first time offset for the first audio segment and the second audio segment based on a deviation between preset peaks in the first audio segment and the second audio segment;

determining a first distance difference between a first distance from a sound source to the first audio acquisition circuit and a second distance from the sound source to the second audio acquisition circuit according to the first time offset;

determining a first offset angle of the sound source according to the first distance difference;

and adjusting the video acquisition direction of a video acquisition circuit according to the first offset angle so that the video acquisition circuit is aligned with the sound source.

The control method according to claim 1, wherein the determining a first offset angle of the sound source from the first distance difference comprises:

determining a first distance parameter using the first distance difference and a distance between the first audio acquisition circuit and the second audio acquisition circuit;

and determining a first offset angle of the sound source according to the ratio of the first distance parameter to the first distance difference.

A control method as defined in claim 1, wherein determining a first time offset for the first and second audio segments based on a deviation between preset peaks in the first and second audio segments comprises:

selecting a corresponding valid positive peak value from the first audio segment and the second audio segment according to a first difference value between a maximum positive peak value sample sequence number in the first audio segment and a maximum positive peak value sample sequence number in the second audio segment, wherein the first audio segment and the second audio segment respectively comprise a plurality of sample values;

selecting a corresponding effective negative peak value from the first audio segment and the second audio segment according to a second difference value of the minimum negative peak value sampling sequence number in the first audio segment and the minimum negative peak value sampling sequence number in the second audio segment;

determining a first sampling clock deviation of the first audio segment and the second audio segment according to the sampling sequence number deviation of the corresponding effective positive peak value in the first audio segment and the second audio segment and the sampling sequence number deviation of the corresponding effective negative peak value in the first audio segment and the second audio segment;

and determining the first time offset according to the first sampling clock deviation and the sampling conversion frequency.

The control method according to claim 3, wherein:

the difference between the effective positive peak value sampling sequence number in the first audio frequency segment and the corresponding effective positive peak value sampling sequence number in the second audio frequency segment and the first difference value is within a first preset range;

and the difference between the effective negative peak value sampling sequence number in the first audio segment and the corresponding effective negative peak value sampling sequence number in the second audio segment and the second difference value is within a second preset range.

The control method according to claim 3, further comprising:

judging whether a first sum of effective positive peak values and effective negative peak values in the first audio frequency segment or the second audio frequency segment is smaller than a first preset threshold;

and if the first sum is smaller than a first preset threshold, controlling the video acquisition circuit to carry out panoramic shooting.

The control method according to claim 5, further comprising:

if the first sum value is not smaller than a first preset threshold, judging whether the number of the effective positive peak values and the number of the effective negative peak values in the first audio segment or the second audio segment are the same;

in case the number of said valid positive peaks and the number of said valid negative peaks in said first audio segment or said second audio segment are the same, further calculating a second sum of the total number of positive peaks and the total number of negative peaks in the first audio segment or the second audio segment;

and controlling the video acquisition circuit to carry out panoramic shooting in response to the fact that the ratio of the first sum value to the second sum value is greater than a second preset threshold.

The control method according to claim 3, further comprising:

calculating a third difference between the maximum positive peak sample number and the minimum negative positive peak sample number in the first audio segment;

calculating a fourth difference value between the maximum positive peak sample number and the minimum negative positive peak sample number in the second audio segment;

and in response to the third difference value and the fourth difference value being consistent in positive and negative, and the difference between the third difference value and the fourth difference value being within a third preset range, selecting a corresponding valid positive peak value in the first audio segment and the second audio segment.

The control method according to claim 3, further comprising:

calculating a fifth difference of the total number of positive peaks in the first audio segment and the total number of positive peaks in the second audio segment, and a third sum of the total number of positive peaks in the first audio segment and the total number of positive peaks in the second audio segment;

calculating a sixth difference between the total number of negative peaks in the first audio segment and the total number of negative peaks in the second audio segment, and a fourth sum of the total number of negative peaks in the first audio segment and the total number of negative peaks in the second audio segment;

in response to the ratio of the fifth difference to the third sum being within a fourth predetermined range and the ratio of the sixth difference to the fourth sum being within the fifth predetermined range, a corresponding valid positive peak is selected in the first audio piece and the second audio piece.

The control method according to any one of claims 1 to 8, further comprising:

synchronously extracting a third audio frequency band from third audio information acquired by a third audio acquisition circuit, and extracting a fourth audio frequency band from fourth audio information acquired by a fourth audio acquisition circuit;

determining a second time offset of the third audio frequency band and the fourth audio frequency band according to a deviation between preset peaks in the third audio frequency band and the fourth audio frequency band;

determining a second distance difference between a third distance from the sound source to the third audio acquisition circuit and a fourth distance from the sound source to the fourth audio acquisition circuit according to the second time offset;

determining a second offset angle of the sound source according to the second distance difference;

and adjusting the video acquisition direction of a video acquisition circuit according to the first offset angle and the second offset angle so that the video acquisition circuit is aligned with the sound source.

A sound source tracking control apparatus comprising:

the extraction module is configured to extract a first audio segment from first audio information collected by the first audio collection circuit and synchronously extract a second audio segment from second audio information collected by the second audio collection circuit;

a time offset determination module configured to determine a first time offset for the first audio piece and the second audio piece based on a deviation between preset peaks in the first audio piece and the second audio piece;

a distance difference determination module configured to determine a first distance difference between a first distance of a sound source from the first audio acquisition circuit and a second distance of the sound source from the second audio acquisition circuit according to the first time offset;

an offset angle determination module configured to determine a first offset angle of the sound source according to the first distance difference;

a direction adjustment module configured to adjust a video acquisition direction of a video acquisition circuit according to the first offset angle so that the video acquisition circuit is aligned with the sound source.

A sound source tracking control apparatus comprising:

a memory configured to store instructions;

a processor coupled to the memory, the processor configured to perform implementing the method of any of claims 1-9 based on instructions stored by the memory.

A sound source tracking system comprising the sound source tracking control apparatus according to claim 10 or 11, and

a video acquisition circuit configured to adjust a video acquisition direction according to control of the sound source tracking control device;

the first audio acquisition circuit and the second audio acquisition circuit are symmetrically arranged on two sides of the video acquisition circuit.

The tracking system of claim 12, wherein:

the ratio of the distance from the sound source to the video acquisition circuit to the distance from the first audio acquisition circuit to the second audio acquisition circuit is greater than a preset distance threshold.

The tracking system of claim 13, further comprising:

the analog-to-digital converter is used for performing analog-to-digital conversion on the audio signal acquired by the first audio acquisition circuit to generate first audio information and performing analog-to-digital conversion on the audio signal acquired by the second audio acquisition circuit to generate second audio information;

the video acquisition circuit includes: the direction control platform is configured to adjust the direction according to the control of the sound source tracking control device.

A computer-readable storage medium, wherein the computer-readable storage medium stores computer instructions which, when executed by a processor, implement the method of any one of claims 1-9.

36页详细技术资料下载

Sound source tracking control method and control device, and sound source tracking system

相关技术

网友询问留言