基于ViT燃气表外观零件识别与定位方法研究

首页 > 过刊浏览>2023年第46卷第11期 >7-12

基于ViT燃气表外观零件识别与定位方法研究
DOI:
                        
                    
CSTR:
                        [cstr]
                    
作者:
                        高泽铭1高泽铭
华南理工大学机械与汽车工程学院 广州 510640
在期刊界中查找
在百度中查找
在本站中查找
刘桂雄1刘桂雄
华南理工大学机械与汽车工程学院 广州 510640
在期刊界中查找
在百度中查找
在本站中查找
陈国宇2陈国宇
广州能源检测研究院 广州 511447
在期刊界中查找
在百度中查找
在本站中查找
黄坚黄坚
1.华南理工大学机械与汽车工程学院 广州 510640；3.广州计量检测技术研究院 广州 510663
在期刊界中查找
在百度中查找
在本站中查找

                    
作者单位:1.华南理工大学机械与汽车工程学院 广州 510640; 2.广州能源检测研究院 广州 511447; 3.广州计量检测技术研究院 广州 510663
作者简介:
通讯作者:
中图分类号:TP391.4
基金项目:广东省市场监督管理局科技项目(2022CJ04)、广东省市场监督管理局科技项目(XMBH20220614019)资助

Research on recognition and localization inspection of appearance for parts of gas meter based on ViT

Author:

Gao Zeming ^¹
Gao Zeming
School of Mechanical and Automotive Engineering, South China University of Technology，Guangzhou 510640, China
在期刊界中查找
在百度中查找
在本站中查找
Liu Guixiong ^¹
Liu Guixiong
School of Mechanical and Automotive Engineering, South China University of Technology，Guangzhou 510640, China
在期刊界中查找
在百度中查找
在本站中查找
Chen Guoyu ^²
Chen Guoyu
Guangzhou Institute of Energy Testing, Guangzhou 511447, China
在期刊界中查找
在百度中查找
在本站中查找
Huang Jian
Huang Jian
1.School of Mechanical and Automotive Engineering, South China University of Technology，Guangzhou 510640, China;3.Guangzhou Institute of Measurement and Testing Technology, Guangzhou 510663, China
在期刊界中查找
在百度中查找
在本站中查找

Affiliation:

1.School of Mechanical and Automotive Engineering, South China University of Technology，Guangzhou 510640, China; 2.Guangzhou Institute of Energy Testing, Guangzhou 511447, China; 3.Guangzhou Institute of Measurement and Testing Technology, Guangzhou 510663, China

Fund Project:

摘要

图/表

访问统计

参考文献

相似文献 [20]

引证文献

资源附件

文章评论

摘要:

关键零件完整性是燃气表重要检定要求之一，经典图像特征匹配方法实现其完整性检测，存在通用性、泛化能力较低问题。本文提出一种改进Faster R-CNN多视角燃气表关键零件识别定位方法，该方法首先采用Vision Transformer(ViT)替代Faster R-CNN卷积神经网络，其自注意力机制促进学习图像块特征之间相关性，强化表征能力；其次研究ViT优化结构参数，在Transformer层数L=14、自注意头数m=12下，模型可达到相对较优准确率。实验表明，最优模型mAP达86.71%，较ResNet50提高2.48%，与ResNet101检测准确率相当，能有效降低模型复杂性，检测效率提高5.8%；燃气表关键零件单次检测耗时1.13 s，可满足燃气表外观关键零件检测的准确性、实时性要求。

关键词:燃气表;深度学习;目标检测;Vision Transformer;自注意力

Abstract:

The completeness of key parts is an important verification requirement for gas meters. Although the traditional image feature matching method is used to realize the automation of part detection, its universality is poor. This paper proposes an improved method for Faster R-CNN to identify and locate key parts of gas meters from multiple perspectives. First, Faster R-CNN utilizes Vision Transformer (ViT) to replace the convolutional neural networks, whose self-attention mechanism can help to learn the correlation between image block features and strengthen the representation ability. And then the ViT structure with 14 Transformer layers and 12 self-attention heads is optimized to achieve optimal accuracy. Experimental results show that the mAP of the optimal model is 86.71%, 2.48% higher than that of ResNet50. It is equivalent to the detection accuracy of ResNet101, whose detection efficiency is increased by 5.8%, and effectively reduces the complexity of the model. It takes 1.13 s to accomplish the single detection of key parts of gas meter. The method balances the accuracy and real-time ability for key parts detection of gas meter.

Key words:gas meter;deep learning;object detection;Vision Transformer;self-attention

引用本文

高泽铭,刘桂雄,陈国宇,黄坚.基于ViT燃气表外观零件识别与定位方法研究[J].电子测量技术,2023,46(11):7-12

复制

文章指标

点击次数:469
下载次数: 578
HTML阅读次数: 0
引用次数: 0

历史

收稿日期:
最后修改日期:
录用日期:
在线发布日期: 2024-02-05
出版日期:

网站首页

杂志简介

过刊浏览

投稿须知

欢迎订阅

联系我们

English

引用本文

分享

文章指标

历史

文章二维码

网站首页

杂志简介

过刊浏览

投稿须知

欢迎订阅

联系我们

English

引用本文

分享

微信扫一扫：分享

文章指标

历史

文章二维码