基于YOLACT与Transformer相结合的实例分割算法研究

doi:10.16818/j.issn1001-5868.2022110201

首页 > 过刊浏览>2023年第44卷第1期 >134-140. DOI:10.16818/j.issn1001-5868.2022110201

基于YOLACT与Transformer相结合的实例分割算法研究
DOI:
                        10.16818/j.issn1001-5868.2022110201
                    
作者:
                        
                        
                    
作者单位:(1. 福州大学 物理与信息工程学院, 福州 350116;2. 中国福建光电信息科学与技术创新实验室, 福州 350116;3. 福州大学 先进制造学院, 福建 泉州 362200)
作者简介:赵敬伟(1996-),男,硕士研究生,研究方向为图像处理、计算机视觉和光电显示技术；
通讯作者:
中图分类号:TP391.4
基金项目:国家重点研发计划项目(2021YFB3600603)；福建省自然科学基金项目(2020J01468).通信作者:林志贤 E-mail:lzx2005000@163.com

Research on Instance Segmentation Algorithm Based on YOLACT and Transformer

Author:

Affiliation:

(1. College of Physics and Information Engineering, Fuzhou University, Fuzhou 350116, CHN;2. Fujian Science & Technology Innovation Laboratory for Optoelectronic Information of China, Fuzhou 350116, CHN;3. School of Advanced Manufacturing, Fuzhou University, Quanzhou 362200, CHN)

Fund Project:

摘要

图/表

访问统计

参考文献

相似文献

引证文献

资源附件

文章评论

摘要:

为提高单阶段实例分割的检测精度和改善小目标漏检、错检情况,提出一种基于YOLACT改进的YOLACTR算法。该算法首先利用CNN与Transformer相结合,设计一种新的头部预测网络,对特征进一步提取,并使用双向注意力来关联同一实例的掩码信息并区分不同实例之间的掩码特征,注重特征点周围的关联信息,使得检测框的预测更加准确；然后利用多级上采样和设计的CS注意力模块结合形成掩码分支,使其融入多种不同尺度信息,并利用CS注意力来关注不同的尺度信息。在MS COCO数据上,YOLACTR算法与YOLACT算法相比,其边框和掩码检测精度分别提升了7.4%和2.9%,在小目标检测上分别提升了18.9%和13.5%。实验表明,YOLACTR算法可以在多目标复杂场景下,提升检测和分割精度以及分类的准确度,改善小目标和重叠目标漏检、错检的问题。

Abstract:

In order to improve the segmentation accuracy of single stage instance segmentation and improve the situation of missed and wrong detection of small targets, an improved YOLACTR algorithm is proposed based on YOLACT algorithm. The algorithm first used the combination of CNN and Transformer to design a new head prediction network to further extract features, and used two-way attention to correlate the mask information of the same instance and distinguish the mask features between different instances. It paid attention to the correlation information around the feature points, making the prediction of the detection box more accurate. Then the mask branch was formed by the combination of multi-level up sampling module and the designed CS attention module, which integrated a variety of different scale information. Then the CS attention module was used to pay attention to different scale information. On the MS COCO data, compared with YOLACT algorithm, YOLACTR algorithm improves the detection accuracy of box and mask by 7.4% and 2.9% respectively, and improves the detection accuracy of small targets by 18.9% and 13.5% respectively. Experiments results show that YOLACTR algorithm can improve the accuracy of detection, segmentation and classification in multi-target complex scenes, which improves the problem of missed and wrong detection of small targets and overlapping targets.

参考文献

相似文献

引证文献

引用本文

赵敬伟,林珊玲,梅婷,林志贤,郭太良.基于YOLACT与Transformer相结合的实例分割算法研究[J].半导体光电,2023,44(1):134-140. ZHAO Jingwei, LIN Shanling, MEI Ting, LIN Zhixian, GUO Tailiang. Research on Instance Segmentation Algorithm Based on YOLACT and Transformer[J].,2023,44(1):134-140.

复制

文章指标

点击次数:
下载次数:
HTML阅读次数:
引用次数:

历史

收稿日期:2022-11-02
最后修改日期:
录用日期:
在线发布日期: 2023-04-07
出版日期:

首页

期刊简介

编委会

在线订阅

不端检测

投稿须知

联系我们

资料下载

引用本文

分享

文章指标

历史

漂浮通知