一种基于等边三角形麦克风阵列的声源分离方法
首发时间:2012-12-28
摘要:本文介绍了一种基于等边三角形麦克风阵列的多声源混迭语音分离方法。首先,我们将水平面均分为多个扇形区域,提取每个区域的空间特征并建立其精确的空间方位模型;然后,在录音时,将输入音频数据分为若干相等长度的数据段,在每个数据段中估计声源的个数与方位;随后,对每个声源选择方位角最近的预建模型进行适应,得到此声源的精确空间方位模型,并用这些模型来估计二元时频掩膜;最后,我们将每个数据块中的所有声源进行分离,并将所有数据块中属于同一扇形区域的音频数据合为一个数据流。通过在一真实会议室环境下实验证明,我们的分离方法可以较好地分离各路语音,并且语音失真较小,其性能接近于非盲条件下的分离。
关键词: 欠定盲源分离 波达方向估计 时频掩膜 等边三角形麦克风阵列
For information in English, please click here
A Block-Based Blind Source Separation Approach with Equilateral Triangular Microphone Array
Abstract:we describe a method for multiple speech sources separation using an equilateral triangular microphone array. Firstly, the azimuths of horizontal plane are divided into many units and the spatial features of some directions observed by the microphone array are modeled precisely. Secondly, the input mixing signals are segmented into blocks, and then the number of active speakers and their directions are estimated in each block. Thirdly, the pre-trained model with the nearest azimuth to each speaker is adapted to obtain a precise model, which is then used for time-frequency binary mask estimation. Finally, we separate every source appeared in each block and concatenate those sounds from same unit to reproduce the whole stream. The experiments are set up in a real meeting room. The results show that our method can separate multiple speech sources correctly with low distortion, and are competitive with the total un-blind separation results.
论文图表:
引用

No.****
同行评议
共计0人参与
勘误表
一种基于等边三角形麦克风阵列的声源分离方法
评论
全部评论