结合特征融合与混合注意力的细粒度图像分类

COMBINING FEATURE FUSION AND MIXED ATTENTION FOR FINE-GRAINED IMAGE CLASSIFICATION

  • 摘要: 为充分提取细粒度图像中的局部关键特征,提出特征融合与混合注意力相结合的细粒度图像分类算法。该文利用 SE (Squeeze-and-Excitation Networks) 引入通道注意力,提高特征提取能力;提出特征融合,充分融合跨通道交互后的低层和高层语义信息;改进选择性稀疏采样 (Selective Sparse Sampling, S3N) 方法引入空间注意力获取显著采样图;构造一个能够端到端训练的两分支分类模型,以交叉验证的方式提高分类准确率。该算法在 CUB-200-2011、FGVC-Aircraft 和 Stanford Cars 数据集上分别达到了 87.84%、93.59% 和 94.25% 的分类准确率,优于骨干网络和当前主流算法。

     

    Abstract: In order to fully extract local key features in fine-grained images, a fine-grained image classification algorithm combining feature fusion and hybrid attention is proposed. We used SE (Squeeze-and-Excitation Networks) to introduce channel attention to improve feature extraction capabilities. We proposed feature fusion to fully fuse low-level and high-level semantic information after cross-channel interaction. We improved selective sparse sampling (S3N) method, and introduced spatial attention to obtain salient sampling maps. A two-branch classification model that could be trained end-to-end was constructed to improve the classification accuracy by cross-validation. The classification accuracies of 87.84%, 93.59% and 94.25% are achieved on the CUB-200-2011, FGVC-Aircraft and Stanford Cars datasets, respectively, outperforming the backbone network and current mainstream algorithms.

     

/

返回文章
返回