基于大语言模型自训练的少样本领域文档级关系抽取方法

黄广; 武斌

0
0
浏览
下载

摘要
关键词
基金信息
论文图表
动态公开评议
相关论文
评论

基于大语言模型自训练的少样本领域文档级关系抽取方法

首发时间：2026-03-03

黄广 ¹
黄广（2001-），男，硕士研究生，主要研究方向：网络安全。
武斌 ¹
武斌（1981-），男，副研究员，主要研究方向：网络安全、认知安全。

1、北京邮电大学，网络空间安全学院，北京100876

摘要：面向小样本领域的文档级信息抽取任务,长文档跨句证据分散、候选实体对噪声与标注稀缺往往共同导致召回率受限。本文提出基于数据增强的大模型自训练算法,首先以少量标注进行监督微调以对齐任务定义与结构化输出；随后基于概率重采样与规则校验,利用大模型生成并筛选高置信伪标注以扩充长尾关系模式；最后引入基于奖励建模的偏好对齐训练,通过格式、合法性与关系相似集等奖励约束抑制随机波动,使模型在噪声与长上下文下保持稳定决策。实验在 DocRED 与 Re-DocRED 上验证了方法有效性,并在网安 AZERG 数据集上表现出更高的召回与更稳健的结构化抽取能力,表明该范式适用于信息密度高但样本稀缺的领域文档级抽取场景。

关键词：文档级关系抽取小样本学习自增强数据生成奖励优化对齐 STIX 威胁情报

For information in English, please click here

A Self-Training Approach to Domain Document-Level Relation Extraction with Large Language Models

HUANG Guang ¹
黄广（2001-），男，硕士研究生，主要研究方向：网络安全。
WU Bin ¹
武斌（1981-），男，副研究员，主要研究方向：网络安全、认知安全。

1、the School of Cyberspace Security, Beijing University of Posts and Telecommunications, Beijing 100876

Abstract：For document-level information extraction in low-resource domains, dispersed cross-sentence evidence in long documents, noisy candidate entity pairs, and scarce annotations often jointly constrain recall. This paper proposes an LLM self-training algorithm based on data augmentation. It first performs supervised fine-tuning with a small amount of labeled data to align task definitions and structured outputs; then, guided by probabilistic re-sampling and rule-based verification, it uses an LLM to generate and filter high-confidence pseudo-labeled data to enrich long-tail relation patterns; finally, it introduces reward-based preference alignment, and suppresses stochastic fluctuations via a relation-similarity-set reward, enabling stable decision-making under noise and long contexts. Experiments on DocRED and Re-DocRED validate the effectiveness of the proposed method, and results on a cybersecurity AZERG dataset show higher recall and more robust structured extraction, indicating that this paradigm is well suited for information-dense yet annotation-scarce domain document extraction scenarios.

Keywords： document-level relation extraction few-shot learning self-augmented data generation preference alignment STIX threat intelligence

基金：

论文图表：

引用

导出参考文献

.txt

.ris

.doc

黄广，武斌. 基于大语言模型自训练的少样本领域文档级关系抽取方法[EB/OL]. 北京：中国科技论文在线 [2026-03-03]. https://www.paper.edu.cn/releasepaper/content/202603-34.

No.****

动态公开评议

共计0人参与

动态评论进行中

全部评论

0/1000

论文编号	202603-34
论文题目	基于大语言模型自训练的少样本领域文档级关系抽取方法
文献类型
收录期刊	上传封面中文期刊英文期刊期刊名称（中文）期刊名称（英文）年，卷（）上传封面中文专著英文专著书名（中文）书名（英文）出版地出版社出版年上传封面中文译著英文译著书名（中文）书名（英文）出版地出版社出版年上传封面中文论文集英文论文集编者.论文集名称（中文） [c]. 出版地出版社出版年， - 编者.论文集名称（英文） [c]. 出版地出版社出版年，- 上传封面中文文献英文文献期刊名称（中文）期刊名称（英文）日期-- 在线地址http:// 上传封面中文文献英文文献文题（中文）文题（英文）出版地出版社,出版日期-- 上传封面中文文献英文文献文题（中文）文题（英文）出版地出版社,出版日期--
英文作者写法：中外文作者均姓前名后，姓大写，名的第一个字母大写，姓全称写出，名可只写第一个字母，其后不加实心圆点“.”, 作者之间用逗号“，”分隔，最后为实心圆点“.”, 示例1：原姓名写法：Albert Einstein,编入参考文献时写法：Einstein A. 示例2：原姓名写法：李时珍；编入参考文献时写法：LI S Z. 示例3：YELLAND R L,JONES S C,EASTON K S,et al.