基于时间尺度聚合的短语音说话人识别

王逸轩; 别红霞

0
0
浏览
下载

摘要
关键词
基金信息
论文图表
动态公开评议
相关论文
评论

基于时间尺度聚合的短语音说话人识别

首发时间：2025-04-18

王逸轩 ¹
王逸轩（1999-），男，硕士研究生，主要研究方向：语音识别，说话人识别
别红霞 ¹
别红霞（1971-），女，教授、博导，主要研究方向：多媒体信息智能与传输、工业大数据智能、智能边缘计算

1、北京邮电大学人工智能学院，北京市，100876

摘要：说话人识别技术基于个体语音特征进行身份区分，广泛应用于语音助手、智能安防等领域。然而，短语音数据由于时长有限，难以提取稳定的说话人特征，严重影响识别准确率。传统的多尺度特征聚合方法大多侧重于通道维度上的信息融合，可能无法充分捕捉短语音场景下关键的时序动态信息。本文提出了一种基于时序特征的多尺度特征聚合方法。该方法通过构建多尺度特征提取模块，有效捕捉短语音中的局部和全局时序特征。该方法可以增强不同尺度特征的互补性，在模型规模减小50%的情况下，并实现约1%的准确率提升。

关键词：人工智能说话人识别短语音特征聚合?????

For information in English, please click here

Short Speech Speaker Recognition Based on Time-Scale Aggregation

WANG Yixuan ¹
王逸轩（1999-），男，硕士研究生，主要研究方向：语音识别，说话人识别
BIE Hongxia ¹
别红霞（1971-），女，教授、博导，主要研究方向：多媒体信息智能与传输、工业大数据智能、智能边缘计算

1、School of Artificial Intelligence, Beijing University of Posts and Telecommunications, Beijing, 100876

Abstract：Speaker recognition distinguishes identities based on individual voice features and is widely used in voice assistants and security systems. Short speech data, due to its limited duration, makes it difficult to extract stable speaker features, which reduces recognition accuracy. Traditional multi-scale feature aggregation methods often focus on information fusion along the channel dimension but may miss important temporal dynamics in short speech scenarios. This paper proposes a multi-scale feature aggregation method based on temporal features. A multi-scale feature extraction module captures both local and global temporal features in short speech. This approach enhances the complementarity of features at different scales, maintaining recognition accuracy even when the model size is reduced by 50%, with an accuracy improvement of approximately 1%.

Keywords： artificial intelligence speaker recognition short speech feature aggregation

基金：

论文图表：

引用

导出参考文献

.txt

.ris

.doc

王逸轩，别红霞. 基于时间尺度聚合的短语音说话人识别[EB/OL]. 北京：中国科技论文在线 [2025-04-18]. https://www.paper.edu.cn/releasepaper/content/202504-171.

No.****

动态公开评议

共计0人参与

动态评论进行中

全部评论

0/1000

论文编号	202504-171
论文题目	基于时间尺度聚合的短语音说话人识别
文献类型
收录期刊	上传封面中文期刊英文期刊期刊名称（中文）期刊名称（英文）年，卷（）上传封面中文专著英文专著书名（中文）书名（英文）出版地出版社出版年上传封面中文译著英文译著书名（中文）书名（英文）出版地出版社出版年上传封面中文论文集英文论文集编者.论文集名称（中文） [c]. 出版地出版社出版年， - 编者.论文集名称（英文） [c]. 出版地出版社出版年，- 上传封面中文文献英文文献期刊名称（中文）期刊名称（英文）日期-- 在线地址http:// 上传封面中文文献英文文献文题（中文）文题（英文）出版地出版社,出版日期-- 上传封面中文文献英文文献文题（中文）文题（英文）出版地出版社,出版日期--
英文作者写法：中外文作者均姓前名后，姓大写，名的第一个字母大写，姓全称写出，名可只写第一个字母，其后不加实心圆点“.”, 作者之间用逗号“，”分隔，最后为实心圆点“.”, 示例1：原姓名写法：Albert Einstein,编入参考文献时写法：Einstein A. 示例2：原姓名写法：李时珍；编入参考文献时写法：LI S Z. 示例3：YELLAND R L,JONES S C,EASTON K S,et al.