基于自监督Transformer的小数据集识别网络

周青伟; 谷利泽

0
0
浏览
下载

摘要
关键词
基金信息
论文图表
同行评议
相关论文
评论

基于自监督Transformer的小数据集识别网络

首发时间：2023-02-20

周青伟 ¹
周青伟（1998-），男，硕士研究生，主要研究方向：大数据与智能信息处理
谷利泽 ¹
谷利泽（1965-），男，北京邮电大学教授、博导，主要研究方向：密码学研究与应用

1、北京邮电大学网络空间安全学院，北京 100876

摘要：Vision Transformer(ViT)，一种与卷积神经网络截然不同的架构网络，具有多种优势，包括设计简单性，健壮性，并且已经在许多视觉任务上取得sota。然而与卷积神经网络比，ViT缺少归纳偏置，因此需要大量的数据集预训练从中学习归纳偏置，使得在小型数据集上从头开始训练效果并不好。本文目的是设计一个鲁棒的训练小规模数据集的方案。采用两阶段的方式。第一阶段，设计一种自监督学习方案，从小数据集上进行训练，从中学习归纳偏置，作为初始化权重。第二阶段，对ViT图片分割阶段进行优化，并使用初始化权重在优化的ViT模型上，使用小数据集进行微调。通过在多种公开小数据集上进行广泛的实验证明，与现有算法相比，本文提出的方法有更好的表现。

关键词：计算机应用技术图像识别 ViT 自监督学习小数据集

For information in English, please click here

Small data set recognition network based on self-supervised Transformer

ZHOU Qingwei ¹
周青伟（1998-），男，硕士研究生，主要研究方向：大数据与智能信息处理
GU Lize ¹
谷利泽（1965-），男，北京邮电大学教授、博导，主要研究方向：密码学研究与应用

1、School of Cyberspace Security, Beijing University of Posts and Telecommunications,Beijing 100876

Abstract：Vision Transformer (ViT), an architectural network distinct from convolutional neural networks, has multiple advantages including design simplicity, robustness, and achieving sota on many vision tasks. However, compared with the convolutional neural network, ViT lacks an inductive bias, so a large amount of data set pre-training is required to learn the inductive bias from it, making it difficult to train from scratch on a small data set. The purpose of this paper is to design a robust training scheme for small-scale datasets. Take a two-stage approach. In the first stage, a self-supervised learning scheme is designed to train on a small data set and learn inductive biases from it as initial weights. In the second stage, the ViT image segmentation stage is optimized and fine-tuned with a small dataset using the initialized weights on the optimized ViT model. Through extensive experiments on a variety of public small data sets, it is proved that the method proposed in this paper has better performance compared with existing algorithms.

Keywords： computer application technology image recognition ViT self-supervised learning small dataset

基金：

论文图表：

引用

导出参考文献

.txt

.ris

.doc

周青伟，谷利泽. 基于自监督Transformer的小数据集识别网络[EB/OL]. 北京：中国科技论文在线 [2023-02-20]. https://www.paper.edu.cn/releasepaper/content/202302-117.

No.****

同行评议

未申请同行评议

全部评论

0/1000

论文编号	202302-117
论文题目	基于自监督Transformer的小数据集识别网络
文献类型
收录期刊	上传封面中文期刊英文期刊期刊名称（中文）期刊名称（英文）年，卷（）上传封面中文专著英文专著书名（中文）书名（英文）出版地出版社出版年上传封面中文译著英文译著书名（中文）书名（英文）出版地出版社出版年上传封面中文论文集英文论文集编者.论文集名称（中文） [c]. 出版地出版社出版年， - 编者.论文集名称（英文） [c]. 出版地出版社出版年，- 上传封面中文文献英文文献期刊名称（中文）期刊名称（英文）日期-- 在线地址http:// 上传封面中文文献英文文献文题（中文）文题（英文）出版地出版社,出版日期-- 上传封面中文文献英文文献文题（中文）文题（英文）出版地出版社,出版日期--
英文作者写法：中外文作者均姓前名后，姓大写，名的第一个字母大写，姓全称写出，名可只写第一个字母，其后不加实心圆点“.”, 作者之间用逗号“，”分隔，最后为实心圆点“.”, 示例1：原姓名写法：Albert Einstein,编入参考文献时写法：Einstein A. 示例2：原姓名写法：李时珍；编入参考文献时写法：LI S Z. 示例3：YELLAND R L,JONES S C,EASTON K S,et al.