基于双BERT模型的主观问答评分算法
首发时间:2021-03-30
摘要:随着自然语言处理的快速发展,计算机已经可以构建良好的问答算法,但是仍然无法很好地从主观角度对问答进行评估。对问题和答案从不同主观角度进行评估,可以增强计算机对复杂问答内容的自动理解,进一步促进问答系统的发展。本文选用谷歌在2019年举办的"Google QUEST Q&A Labeling"比赛数据集,提出了双BERT模型,分别对问题和答案相关的主观标签进行评分。在此基础上,对比四种不同的文本截断方法处理长问答文本;针对BERT模型不同层可以捕获不同级别语义和语法信息的特点,提出了多层特征融合的方法,有效地结合了不同层的特征。在基于BERT预训练模型的基础上,本文采用了层学习率递减的微调训练策略,使得模型可以更好地拟合。实验结果表明,本文提出的算法在主观问答评分任务上表现优异。
For information in English, please click here
Subjective QA scoring algorithm based on two-BERTs model
Abstract:With the development of natural language processing, computers have been able to construct good question answering algorithms, but they cannot evaluate questions and answers well from a subjective perspective. Evaluating questions and answers from different subjective perspectives can enhance automatic understanding of the content of complex questions and answers, and further promote the development of question answering systems. The data set in this paper comes from the "Google QUEST Q&A Labeling" competition held by Google in 2019, and a method based on two-BERTs model is proposed to score the subjective labels related to the question and answer separately. On this basis, four different text truncation methods are compared to process long question and answer texts; in view of each layer of BERT captures the different features of the input text, a multi-layer feature fusion method is proposed in this paper to effectively combine different layer features. Based on the pre-training model of BERT, this paper adopts a fine-tuning training strategy with different layer learning rate, so that the model can be better fitted. Experimental results show that the algorithm proposed in this paper performs well on subjective QA scoring tasks.
Keywords: Computer application technology BERT Feature fusion Subjective QA scoring
基金:
引用
No.****
动态公开评议
共计0人参与
勘误表
基于双BERT模型的主观问答评分算法
评论
全部评论