一种融合区域引导机制与多尺度感知能力的自监督对比学习方法

许鹏程; 薄戟; 吴奕锴; 梅云初; 王建军; 蔡洁萱

doi:10.3969/j.issn.1000-386x.2025.11.038

一种融合区域引导机制与多尺度感知能力的自监督对比学习方法

A SELF-SUPERVISED CONTRASTIVE LEARNING METHOD INTEGRATING REGION-GUIDED MECHANISM AND MULTI-SCALE PERCEPTION

摘要

摘要: 自监督对比学习(Self-supervised Contrastive Learning, SCL)已成为深度学习领域中无监督细粒度特征提取的重要技术。然而，SCL在细粒度场景中常面临性能下降的问题，主要原因在于现有方法过度依赖全局特征，缺乏对关键区域的重要性建模与多尺度细节信息的有效利用。为此，提出一种细粒度对比学习方法，该方法由两个关键模块构成。区域引导中心点回归模块能够自适应地引导感兴趣区域的分割和提取，减轻了传统ROI方法中的信息丢失问题；轻量化多尺度提取模块通过融合多尺度感受野，有效提升了特征提取的效率与准确性。在多个真实场景数据集上的实验结果表明，所提方法在生物特征识别和图像分类等任务中均表现出显著优势。与主流SCL方法相比，该方法将等错误率(EER)分别降低了52.0%(相较于SimCLR)和66.7%(相较于BYOL)，充分验证了其有效性与先进性。

Abstract: Self-supervised contrastive learning (SCL) has emerged as a key technique for unsupervised fine-grained feature extraction in deep learning. However, SCL often suffers from performance degradation in fine-grained scenarios, primarily due to its excessive reliance on global representations and the insufficient modeling of key regions as well as the underutilization of multi-scale detailed information. To address these limitations, this paper proposes a novel fine-grained contrastive learning method consisting of two core modules. A region-guided centroid regression module adaptively guided the segmentation and extraction of regions of interest, which reduced the problem of information loss in traditional ROI methods. A lightweight multi-scale extraction module enhanced both the efficiency and accuracy of feature representation by integrating multi-scale receptive fields. Experimental results on multiple real-world datasets demonstrate that the proposed method achieves significant improvements in tasks such as biometric recognition and image classification. Compared with mainstream SCL approaches, the proposed method reduces the equal error rate (EER) by 52.0% relative to SimCLR and 66.7% relative to BYOL, which strongly validates its effectiveness and superiority.

HTML全文

参考文献(0)

施引文献

资源附件(0)