基于深度学习的恶意文档可视化检测

首页 > 过刊浏览>2022年第45卷第18期 >126-133

基于深度学习的恶意文档可视化检测
DOI:
                        
                    
CSTR:
                        [cstr]
                    
作者:
                        黄昆黄昆
贵州师范大学 贵州省信息与计算科学重点实验室,贵阳 550001
在期刊界中查找
在百度中查找
在本站中查找
徐洋徐洋
贵州师范大学 贵州省信息与计算科学重点实验室,贵阳 550001
在期刊界中查找
在百度中查找
在本站中查找
张思聪张思聪
贵州师范大学 贵州省信息与计算科学重点实验室,贵阳 550001
在期刊界中查找
在百度中查找
在本站中查找
李克资李克资
贵州师范大学 贵州省信息与计算科学重点实验室,贵阳 550001
在期刊界中查找
在百度中查找
在本站中查找

                    
作者单位:贵州师范大学 贵州省信息与计算科学重点实验室,贵阳 550001
作者简介:
通讯作者:
中图分类号:TP393.08
基金项目:中央引导地方科技发展专项资金(黔科中引地〔2018〕4008)；贵州省科技计划项目（黔科合支撑[2020]2Y013号）; 贵州省研究生科研基金项目（黔教合YJSKYJJ〔2021〕102）

Visual detection of malicious document based on deep learning

Author:

Huang Kun
Huang Kun
Key Laboratory of Information and Computing Science of Guizhou Province, Guizhou Normal University, Guiyang 550001, China
在期刊界中查找
在百度中查找
在本站中查找
Xu Yang
Xu Yang
Key Laboratory of Information and Computing Science of Guizhou Province, Guizhou Normal University, Guiyang 550001, China
在期刊界中查找
在百度中查找
在本站中查找
Zhang Sicong
Zhang Sicong
Key Laboratory of Information and Computing Science of Guizhou Province, Guizhou Normal University, Guiyang 550001, China
在期刊界中查找
在百度中查找
在本站中查找
Li Kezi
Li Kezi
Key Laboratory of Information and Computing Science of Guizhou Province, Guizhou Normal University, Guiyang 550001, China
在期刊界中查找
在百度中查找
在本站中查找

Affiliation:

Key Laboratory of Information and Computing Science of Guizhou Province, Guizhou Normal University, Guiyang 550001, China

Fund Project:

摘要

图/表

访问统计

参考文献

相似文献

引证文献

资源附件

文章评论

摘要:

为了更加准确、快速地检测恶意PDF与DOCX格式文档，提出一种基于深度学习的恶意文档可视化检测方法。该方法通过马尔可夫模型将文档的字节序列转化为三通道的彩色图，从而获取更能区分恶意文档和良性文档的视觉表征，并采用当前主流的EfficientNet-B0模型对提取的可视化特征进行分类。结合迁移学习领域中的微调技术，将ImageNet上的分类权重应用到EfficientNet-B0模型的训练中，加快检测模型的收敛速度，缩短模型的训练时间。实验证明，在两个数据集上，模型的收敛速度快于随机初始化权重的预训练，且模型对恶意PDF文档和恶意DOCX文档的检测准确率分别达到了99.80%和98.14%，优于ResNet34、MobileNetV2等模型。与主流的恶意文档检测工具Wepawet和PJScan相比，所提出的方法具有更优的综合检测性能，进一步验证了所提出方法对恶意文档检测的有效性。

关键词:恶意文档;EfficientNet-B0;可视化;马尔可夫模型;迁移学习

Abstract:

In order to detect malicious PDF and DOCX format documents more accurately and quickly, a visual detection method of malicious documents based on deep learning is proposed. This method converts the byte stream of the document into a three-channel color image through the Markov model, so as to obtain a visual representation that can better distinguish between malicious documents and benign documents, and uses the current mainstream EfficientNet-B0 model to extract visual features to classify. Combined with the fine-tuning technology in the field of transfer learning, the classification weights on ImageNet are applied to the training of the EfficientNet-B0 model, which speeds up the convergence of the detection model and shortens the training time of the model. Experiments show that on two datasets, the convergence speed of the model is faster than the pre-training of random initialization weights, and the detection accuracy of the model for malicious PDF documents and malicious DOCX documents reaches 99.80% and 98.14%, respectively, which is better than models such as ResNet34 and MobileNetV2.Compared with the mainstream malicious document detection tools Wepawet and PJScan, the proposed method has better comprehensive detection performance, which further verifies the effectiveness of the proposed method for malicious document detection.

Key words:malicious document; EfficientEet-B0; visualization; markov model; transfer learning

引用本文

黄昆,徐洋,张思聪,李克资.基于深度学习的恶意文档可视化检测[J].电子测量技术,2022,45(18):126-133

复制

文章指标

点击次数:
下载次数:
HTML阅读次数:
引用次数:

历史

收稿日期:
最后修改日期:
录用日期:
在线发布日期: 2024-03-29
出版日期:

网站首页

杂志简介

过刊浏览

投稿须知

欢迎订阅

联系我们

English

引用本文

分享

文章指标

历史

文章二维码

网站首页

杂志简介

过刊浏览

投稿须知

欢迎订阅

联系我们

English

引用本文

分享

微信扫一扫：分享

文章指标

历史

文章二维码