基于双BERT模型的主观问答评分算法

王英瑾; 李文生

0
0
浏览
下载

摘要
关键词
基金信息
论文图表
动态公开评议
相关论文
评论

基于双BERT模型的主观问答评分算法

首发时间：2021-03-30

王英瑾 ¹
王英瑾（1996-），女，硕士研究生，主要研究方向：多媒体技术
李文生 ¹
李文生（1966-），女，副教授，主要研究方向：多媒体技术，网络信息处理

1、北京邮电大学计算机学院，北京

摘要：随着自然语言处理的快速发展，计算机已经可以构建良好的问答算法，但是仍然无法很好地从主观角度对问答进行评估。对问题和答案从不同主观角度进行评估，可以增强计算机对复杂问答内容的自动理解，进一步促进问答系统的发展。本文选用谷歌在2019年举办的"Google QUEST Q&A Labeling"比赛数据集，提出了双BERT模型，分别对问题和答案相关的主观标签进行评分。在此基础上，对比四种不同的文本截断方法处理长问答文本；针对BERT模型不同层可以捕获不同级别语义和语法信息的特点，提出了多层特征融合的方法，有效地结合了不同层的特征。在基于BERT预训练模型的基础上，本文采用了层学习率递减的微调训练策略，使得模型可以更好地拟合。实验结果表明，本文提出的算法在主观问答评分任务上表现优异。

关键词：计算机应用技术 BERT 特征融合主观问答评分

For information in English, please click here

Subjective QA scoring algorithm based on two-BERTs model

WANG Yingjin ¹
王英瑾（1996-），女，硕士研究生，主要研究方向：多媒体技术
LI Wensheng ¹
李文生（1966-），女，副教授，主要研究方向：多媒体技术，网络信息处理

1、Institute of Computer Science，Beijing University of Posts and Telecommunications , Beijing

Abstract：With the development of natural language processing, computers have been able to construct good question answering algorithms, but they cannot evaluate questions and answers well from a subjective perspective. Evaluating questions and answers from different subjective perspectives can enhance automatic understanding of the content of complex questions and answers, and further promote the development of question answering systems. The data set in this paper comes from the "Google QUEST Q&A Labeling" competition held by Google in 2019, and a method based on two-BERTs model is proposed to score the subjective labels related to the question and answer separately. On this basis, four different text truncation methods are compared to process long question and answer texts; in view of each layer of BERT captures the different features of the input text, a multi-layer feature fusion method is proposed in this paper to effectively combine different layer features. Based on the pre-training model of BERT, this paper adopts a fine-tuning training strategy with different layer learning rate, so that the model can be better fitted. Experimental results show that the algorithm proposed in this paper performs well on subjective QA scoring tasks.

Keywords： Computer application technology BERT Feature fusion Subjective QA scoring

基金：

论文图表：

引用

导出参考文献

.txt

.ris

.doc

王英瑾，李文生. 基于双BERT模型的主观问答评分算法[EB/OL]. 北京：中国科技论文在线 [2021-03-30]. https://www.paper.edu.cn/releasepaper/content/202103-354.

No.****

动态公开评议

共计0人参与

动态评论进行中

全部评论

0/1000

论文编号	202103-354
论文题目	基于双BERT模型的主观问答评分算法
文献类型
收录期刊	上传封面中文期刊英文期刊期刊名称（中文）期刊名称（英文）年，卷（）上传封面中文专著英文专著书名（中文）书名（英文）出版地出版社出版年上传封面中文译著英文译著书名（中文）书名（英文）出版地出版社出版年上传封面中文论文集英文论文集编者.论文集名称（中文） [c]. 出版地出版社出版年， - 编者.论文集名称（英文） [c]. 出版地出版社出版年，- 上传封面中文文献英文文献期刊名称（中文）期刊名称（英文）日期-- 在线地址http:// 上传封面中文文献英文文献文题（中文）文题（英文）出版地出版社,出版日期-- 上传封面中文文献英文文献文题（中文）文题（英文）出版地出版社,出版日期--
英文作者写法：中外文作者均姓前名后，姓大写，名的第一个字母大写，姓全称写出，名可只写第一个字母，其后不加实心圆点“.”, 作者之间用逗号“，”分隔，最后为实心圆点“.”, 示例1：原姓名写法：Albert Einstein,编入参考文献时写法：Einstein A. 示例2：原姓名写法：李时珍；编入参考文献时写法：LI S Z. 示例3：YELLAND R L,JONES S C,EASTON K S,et al.