基于对抗迁移学习与孪生网络的知识库问答

KNOWLEDGE BASE QUESTION ANSWERING BASED ON ADVERSARIAL TRANSFER LEARNING AND SIAMESE NETWORK

摘要: 传统问答方法通常存在效率不高以及未充分利用数据信息的问题。针对以上问题，在实体识别部分，利用对抗迁移学习融入中文分词边界信息提升实体识别准确性，同时提出基于全局指针的实体标注方法代替CRF提升模型训练效率；在谓词匹配部分，利用孪生网络的思想解决直接使用BERT获取的句向量语义表达不充分的问题。在数据集NLPCC-2016KBQA上取得了85.99%的平均F1值，表明了该方法的可行性。

Abstract: Traditional question answering methodsoften have problems of inefficiency and insufficient use of data information. In order to solve the problems above, in the entity recognition part, adversarial transfer learning was used to integrate the boundary information of Chinese word segmentation to improve the accuracy of entity recognition. At the same time, an entity labeling method was proposed based on global pointers instead of CRF to improve model training efficiency. In the predicate matching part, the Siamese network was used to solve the problem of insufficient semantic expression of sentence vectors obtained by using BERT directly. On the data set NLPCC-2016KBQA, an average F1 value of 85.99% was obtained, indicating the feasibility of this method.