基于可变感受野的遥感图像火点目标检测

REMOTE SENSING IMAGE FIRE POINT TARGET DETECTION BASED ON VARIABLE RECEPTIVE FIELD

  • 摘要: 针对遥感火点检测中目标尺寸跨度大、背景复杂而导致多类目标易混淆的问题,提出一种深度学习模型VRFNet,旨在优化森林火灾着火点的识别效果。VRFNet算法模型通过结合小波变换与大核卷积分解技术提取多尺度特征,有效扩展模型的感受野并减少参数数量;模型将这些多尺度特征进行平均池化和最大池化,结合注意力机制从中提取关键信息,增强特征的表达能力;通过自适应感受野选择机制,对不同尺度的感受野特征进行加权融合,以适应火点目标的尺度多样性。VRFNet在DOTAv1.0和GF-4数据集上分别达到了86.9%和73.4%的精度,较现有最先进模型(SOTA)提高了0.026,证实了其在遥感图像目标检测任务中的有效性。

     

    Abstract: In response to the challenges in remote sensing fire point detection, such as small target size, high confusion, and complex background, this study proposes an innovative deep learning model named VRFNet, aiming at optimizing the identification of forest fire ignition points. The VRFNet model extracted multi-scale features by integrating wavelet transform and large kernel convolutional decomposition techniques, effectively expanding the model's receptive field and reducing the number of parameters. The model applied average pooling and max pooling to these multi-scale features, combined with an attention mechanism to extract key information, thereby enhancing the expressive power of the features. Through an adaptive receptive field selection mechanism, the model fused features from varying receptive fields through weighted integration, accommodating the scale diversity of fire point targets. The VRFNet achieved accuracies of 86.9% and 73.4% on the DOTA v1.0 and GF-4 datasets, respectively, improving upon the state-of-the-art (SOTA) models by 0.026, thereby confirming its effectiveness in remote sensing image target detection tasks.

     

/

返回文章
返回