基于自注意力的时序动作提名生成算法

TEMPORAL ACTION PROPOSAL GENERATION ALGORITHM BASED SELF-ATTENTION

  • 摘要: 为解决BMN(Boundary Matching Network)算法在动作边界预测时只捕捉了局部时间上下文,从而导致在复杂场景中定位边界不精确的问题,提出一种基于自注意力的时序动作提名生成算法。具体来说,在BMN的骨干网络中引入局部-全局编码器(LGE)来充分挖掘局部和全局上下文信息。为解决BMN忽略提名与提名之间关系建模问题,设计一个提名关系模块,其中包含两个自注意力子模块。在AvtivityNet-1.3数据集上的实验结果表明,AUC从67.10%提高到68.20%,提高了1.1百分点,达到了先进的性能。

     

    Abstract: In order to solve the problem that BMN (boundary matching network) algorithm only captures the local temporal context when predicting the action boundary, which lead to inaccurate positioning of the boundary in complex scenes, a temporal action proposal generation algorithm based on self-attention is proposed. Specifically, a local-global encoder (LGE) was introduced in the backbone network of BMN to fully mine local and global context. In order to solve the problem that proposal-proposal ignored in BMN, a proposal relation module was designed, which included two self-attention sub-modules. The experimental results on the ActivityNet-1.3 dataset show that AUC improved from 67.10% to 68.20%, an improvement of 1.1 percentage points, reaching advanced performance.

     

/

返回文章
返回