基于视觉注意力机制的三维平面恢复方法
3D PLANAR RECOVERY METHOD BASED ON VISUAL ATTENTION MECHANISM
-
摘要: 当前三维平面恢复方法中存在无法对场景中小物体目标平面进行有效恢复的难题。为了解决这个难题,提出一种基于视觉注意力机制的三维平面恢复方法。具体地,对低尺度特征进行下采样并与高尺度特征进行融合。利用自适应多头注意力机制对融合特征进行编码。定义平面分割模块和平面参数估计模块,并分别使用卷积神经网络和视觉注意力机制对特征进行解码。大量实验结果表明,该方法能有效地恢复小物体目标平面表示,并且与当前最先进的三维平面恢复方法相比具有显著的竞争力。Abstract: The current 3D plane recovery method can not effectively restore the target plane of small objects in the scene. To solve this problem, a 3D plane recovery method based on visual attention mechanism is proposed. Specifically, low-scale features were down-sampled and fused with high-scale features. An adaptive multi-head attention mechanism was utilized to encode the fused features. A plane segmentation module and a plane parameter estimation module were defined, and the convolutional neural network and the visual attention mechanism decoded the features. Extensive experimental results show that the proposed method can effectively restore the object plane representation of small objects, and it is significantly competitive with the current state-of-the-art 3D plane recovery methods.
下载: