基于对抗训练的中文电子病历命名实体识别
DOI:
作者:
作者单位:

作者简介:

通讯作者:

中图分类号:

基金项目:

湖南省自然科学基金资助项目(2020JJ6089);湖南省教育厅科研基金资助重点项目(19A133)


Named Entity Recognition of Chinese Electronic Medical Records Based on Adversarial Training
Author:
Affiliation:

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    为提高传统命名实体识别模型在中文电子病历上的准确性,提出一种在基线模型BERT-BiLSTM-CRF中加入对抗训练的方法,该方法在词嵌入层添加扰动因子从而生成对抗样本,并利用对抗样本进行迭代训练,从而优化模型参数。CCKS2021评测数据集实验结果表明,加入FGM和PGD两个对抗训练模型后,其精准率、召回率以及F1值相比于基线模型均有所提升。并且通过对比实验,验证了加入对抗训练能够提高模型的预测能力和鲁棒性。

    Abstract:

    In view of an improvement of the accuracy of the traditional named entity recognition model in Chinese electronic medical records, a method has thus been proposed with adversarial training added to the baseline model BERT-BILSTM-CRF. By adopting the proposed method, disturbance factors are added to the word embedding layer for the generation of adversarial samples, which will be used for an iterative training to optimize the model parameters. The experimental results of CCKS2021 evaluation data set show that the accuracy rate, recall rate and F1 value are improved compared with the baseline model with FGM and PGD confrontation training models added. Based on comparative experiments, it is verified that adding confrontation training can improve the prediction ability and robustness of the model.

    参考文献
    相似文献
    引证文献
引用本文

孔令巍,朱艳辉,张 旭,欧阳康,黄雅淋,金书川,沈加锐.基于对抗训练的中文电子病历命名实体识别[J].湖南工业大学学报,2022,36(3):36-43.

复制
分享
文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:2021-12-20
  • 最后修改日期:
  • 录用日期:
  • 在线发布日期: 2022-05-10
  • 出版日期: 2022-05-01