Li Zhenqi, Hu Yao, Gao Xiang, Wen Zhiqing. ANTI-NOISE SPEECH RECOGNITION BASED ON INFORMATION FUSIONJ. Computer Applications and Software, 2025, 42(9): 189-195,269. DOI: 10.3969/j.issn.1000-386x.2025.09.025
Citation: Li Zhenqi, Hu Yao, Gao Xiang, Wen Zhiqing. ANTI-NOISE SPEECH RECOGNITION BASED ON INFORMATION FUSIONJ. Computer Applications and Software, 2025, 42(9): 189-195,269. DOI: 10.3969/j.issn.1000-386x.2025.09.025

ANTI-NOISE SPEECH RECOGNITION BASED ON INFORMATION FUSION

  • To improve the anti-noise capacity of single audio information-based continuous speech recognition approaches in a noisy environment, we propose an information fusion-based anti-noise audio-visual speech recognition (AAVSR) model. The proposed AAVSR model utilized the attention mechanism to learn the correspondences between the audio and video streams autonomously. Based on the learned correspondences, the features extracted from audio and video streams were fused to supplement the missing information of each individual modality. The fused complementary information improved information utilization and enhanced the robust recognition ability of the AAVSR model. Comprehensive simulation results on the LRS2 dataset demonstrate that AAVSR outperforms the other competing models in terms of the word error rate under a noisy environment with various signal-to-noise ratios.
  • loading

Catalog

    Turn off MathJax
    Article Contents

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return