留言板

尊敬的读者、作者、审稿人, 关于本刊的投稿、审稿、编辑和出版的任何问题, 您可以本页添加留言。我们将尽快给您答复。谢谢您的支持!

姓名
邮箱
手机号码
标题
留言内容
验证码

面向语法加权图文本的方面情感三元组抽取

韩虎 孟甜甜

韩虎,孟甜甜. 面向语法加权图文本的方面情感三元组抽取[J]. 北京航空航天大学学报,2024,50(2):409-418 doi: 10.13700/j.bh.1001-5965.2022.0443
引用本文: 韩虎,孟甜甜. 面向语法加权图文本的方面情感三元组抽取[J]. 北京航空航天大学学报,2024,50(2):409-418 doi: 10.13700/j.bh.1001-5965.2022.0443
HAN H,MENG T T. Aspect sentiment triple extraction for grammar-weighted graph text[J]. Journal of Beijing University of Aeronautics and Astronautics,2024,50(2):409-418 (in Chinese) doi: 10.13700/j.bh.1001-5965.2022.0443
Citation: HAN H,MENG T T. Aspect sentiment triple extraction for grammar-weighted graph text[J]. Journal of Beijing University of Aeronautics and Astronautics,2024,50(2):409-418 (in Chinese) doi: 10.13700/j.bh.1001-5965.2022.0443

面向语法加权图文本的方面情感三元组抽取

doi: 10.13700/j.bh.1001-5965.2022.0443
基金项目: 国家自然科学基金(62166024)
详细信息
    通讯作者:

    E-mail:hanhu_lzjtu@mail.lzjtu.cn

  • 中图分类号: TP391

Aspect sentiment triple extraction for grammar-weighted graph text

Funds: National Natural Science Foundation of China (62166024)
More Information
  • 摘要:

    方面情感三元组抽取包括方面抽取、意见抽取和方面情感分类3项任务,以管道方式解决该任务的研究方法无法利用元素之间的交互信息,同时也会造成错误传播和冗余训练。基于此,提出一种基于门控注意力和加权图文本的方面情感三元组抽取方法。采用双向长短时记忆网络学习句子的序列特征表示;利用门控注意力单元学习单词之间的线性联系;利用语法距离加权图卷积网络增强三元组元素之间的交互;利用网格标记推理策略预测三元组。在4个公开数据集上进行实验,结果表明:所提方法可以有效增强三元组元素之间的交互,提高三元组抽取的准确率;同时,所提方法的F1值分别为57.94%、70.54%、61.95%和67.66%,与基准模型相比均有所提高。

     

  • 图 1  GTS标记示例

    Figure 1.  GTS tagging example

    图 2  模型整体架构

    Figure 2.  Overall architecture of model

    图 3  门控注意力结构

    Figure 3.  Architecture of gated attention

    图 4  加权图文本架构

    Figure 4.  Architecture of weighted graph text

    图 5  方面“Foods”的依赖树语法距离

    Figure 5.  Syntax distance of dependence tree for aspect “Foods”

    图 6  方面“price”的依赖树语法距离

    Figure 6.  Syntax distance of dependence tree for aspect “price”

    图 7  GCN层数与F1的关系

    Figure 7.  Relationship between number of GCN layers and F1

    表  1  数据集统计结果

    Table  1.   Dataset statistics results

    数据集 句子
    数量
    三元组
    数量
    情感极性
    积极的
    三元组数量
    情感极性
    中立的
    三元组数量
    情感极性
    消极的
    三元组数量
    lap14训练集8991452808111533
    验证集22538319948136
    测试集33254736467116
    res14训练集125923561693172491
    验证集31558042746107
    测试集493100878468156
    res15训练集603103879929210
    验证集151239181949
    测试集32549332425144
    res16训练集8631421103655330
    验证集216348263877
    测试集3285254163079
    下载: 导出CSV

    表  2  GA-DWGT模型与基准模型结果对比

    Table  2.   Comparison of GA-DWGT model and benchmark model results %

    模型 P R F1
    lap14 res14 res15 res16 lap14 res14 res15 res16 lap14 res14 res15 res16
    Li-unified-R+PD 42.25 41.44 43.34 38.19 42.78 68.79 50.73 53.47 42.47 51.68 46.69 44.51
    Peng-unified-R+PD 40.40 44.18 40.97 46.76 47.24 62.99 54.68 62.97 43.50 51.89 46.79 53.62
    Peng-unified-R+IOG 48.62 58.89 51.70 59.25 45.52 60.41 46.04 58.09 47.02 59.64 48.71 58.67
    IMN+IOG 49.21 59.57 55.24 46.23 63.88 52.33 47.68 61.65 53.75
    PASTE 52.10 63.40 54.80 62.30 48.10 61.90 52.60 63.60 50.00 62.60 53.70 62.90
    GTS-CNN 55.93 70.79 60.09 62.63 47.52 61.71 53.57 66.98 51.38 65.94 56.64 64.73
    GTS-LSTM 59.42 67.28 63.26 66.07 45.13 61.91 50.71 65.05 51.30 64.49 56.29 65.56
    GA-DWGT 59.09 70.67 65.87 67.80 47.34 62.42 50.92 65.39 52.60 66.21 57.44 66.29
    PASTE-BERT 59.70 68.70 63.60 68.00 55.30 63.80 59.80 67.70 57.40 66.10 61.60 67.80
    GTS-BERT 57.52 70.92 59.29 68.58 51.92 69.49 58.07 66.60 54.58 70.20 58.67 67.58
    GA-DWGT-BERT 63.80 70.74 64.11 65.41 51.74 68.38 59.92 70.08 57.94 70.54 61.95 67.66
    下载: 导出CSV

    表  3  对比模型的可训练参数数量

    Table  3.   Number of trainable parameters of comparison models

    模型可训练参数数量/106
    Li-unified-R+PD0.31
    Peng-unified-R+PD0.21
    Peng-unified-R+IOG0.36
    IMN+IOG0.45
    PASTE0.71
    GTS-CNN0.47
    GTS-LSTM0.33
    GA-DWGT0.33
    下载: 导出CSV

    表  4  消融实验结果

    Table  4.   Ablation study results %

    模型 P R F1
    lap14 res14 res15 res16 lap14 res14 res15 res16 lap14 res14 res15 res16
    GA-DWGT-GAU-WG 52.29 67.75 58.79 66.46 43.85 59.79 49.89 61.19 47.71 63.18 53.98 63.72
    GA-DWGT-WG 57.07 67.78 59.65 67.68 42.20 61.01 55.01 64.29 48.52 64.22 57.23 65.94
    GA-DWGT-GAU 55.71 69.57 62.89 65.59 43.85 59.59 49.89 62.55 49.08 64.20 55.65 64.03
    GA-DWGT 59.09 70.67 65.87 67.80 47.34 62.42 50.92 65.39 52.60 66.21 57.44 66.29
    下载: 导出CSV

    表  5  案例分析

    Table  5.   Case study

    例句 正确的三元组 模型预测三元组
    GTS-CNN GTS-LSTM GA-DWGT
    Made interneting difficult
    to maintain
    (interneting, difficult, NEG) (maintain, difficult, POS) (maintain, difficult, POS) (interneting, difficult, NEG)
    Made interneting difficult to maintain
    and the screen is very sharp
    (speed, much more, POS)
    (screen, sharp, POS)
    (screen, sharp, POS) (screen, sharp, POS) (speed, much more, POS)
    (screen, sharp, POS)
    The bread is top notch
    as well
    (bread, top notch, POS) (bread, top notch, POS) (bread, top notch, POS) (bread, top notch, POS)
    The staff should be
    a bit more friendly
    (staff, friendly, NEG) (staff, more friendly, POS) (staff, more friendly, POS)
    (staff, should be, POS)
    (staff, friendly, NEG)
    (staff, should, NEG)
    The food was extremely tasty,
    creatively presented
    and the wine excellent
    (food, tasty, POS) (food,
    creatively presented, POS)
    (wine, excellent, POS)
    (food, tasty, POS) (food,
    creatively presented, POS)
    (wine, excellent, POS)
    (food, tasty, POS) (food,
    creatively presented, POS)
    (wine, excellent, POS)
    (food, excellent, POS)
    (food, tasty, POS) (food,
    creatively presented, POS)
    (wine, excellent, POS)
    下载: 导出CSV
  • [1] PONTIKI M, GALANIS D, PAVLOPOULOS J, et al. SemEval-2014 Task 4: Aspect based sentiment analysis[C]//Proceedings of the 8th International Workshop on Semantic Evaluation. Stroudsburg: ACL, 2014: 27-35.
    [2] DONG L, WEI F R, TAN C Q. Adaptive recursive neural network for target-dependent twitter sentiment classification[C]//Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics. Stroudsburg: ACL, 2014, 2: 49-54.
    [3] YIN Y C, WEI F R, DONG L, et al. Unsupervised word and dependency path embeddings for aspect term extraction[C]//Proceedings of the 25th International Joint Conference on Artificial Intelligence. Palo Alto: AAAI, 2016: 2979-2985.
    [4] YANG B S, CARDIE C. Extracting opinion expressions with semi-Markov conditional random fields[C]//Proceedings of the Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning. Stroudsburg: ACL, 2012: 1335-1345.
    [5] XU L, CHIA Y K, BING L D. Learning span-level interactions for aspect sentiment triplet extraction[C]//Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing. Stroudsburg: ACL, 2021: 4755-4766.
    [6] PENG H Y, XU L, BING L D, et al. Knowing what, how and why: A near complete solution for aspect-based sentiment analysis[C]//Proceedings of the AAAI Conference on Artificial Intelligence. Palo Alto: AAAI, 2020, 34: 8600-8607.
    [7] XU L, LI H, LU W, et al. Position-aware tagging for aspect sentiment triplet extraction[C]//Proceedings of the Conference on Empirical Methods in Natural Language Processing. Stroudsburg: ACL, 2020: 2339-2349.
    [8] ZHANG C, LI Q C, SONG D W, et al. A multi-task learning framework for opinion triplet extraction[C]//Proceedings of the Findings of the Association for Computational Linguistics. Stroudsburg: ACL, 2020: 819-828.
    [9] WU Z, YING C C, ZHAO F, et al. Grid tagging scheme for aspect-oriented fine-grained opinion extraction[C]//Proceedings of the Findings of the Association for Computational Linguistics. Stroudsburg: ACL, 2020: 2576-2585.
    [10] HUANG L Z, WANG P Y, LI S J, et al. First target and opinion then polarity: Enhancing target-opinion correlation for aspect sentiment triplet extraction[EB/OL]. (2021-11-01)[2022-05-01]. https://arxiv.org/abs/2102.08549v1.
    [11] MUKHERJEE R, NAYAK T, BUTALA Y, et al. PASTE: A tagging-free decoding framework using pointer networks for aspect sentiment triplet extraction[C]//Proceedings of the Conference on Empirical Methods in Natural Language Processing. Stroudsburg: ACL, 2021: 9279-9291.
    [12] JEFFREY P, RICHARD S, CHRISTOPHER M. Glove: Global vectors for word representation[C]//Proceedings of the Conference on Empirical Methods in Natural Language Processing. Stroudsburg: ACL, 2014: 1532-1543.
    [13] BOJANOWSKI P, GRAVE E, JOULIN A, et al. Enriching word vectors with subword information[J]. Transactions of the Association for Computational Linguistics, 2017, 5(1): 135-146.
    [14] DEVLIN J, CHANG M W, LEE K, et al. BERT: Pre-training of deep bidirectional transformers for language understanding[C]// Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. Stroudsburg: ACL, 2019, 1: 4171-4186.
    [15] VASWANI A, SHAZEER N, PARMAR N, et al. Attention is all you need[C]//Proceedings of the 31st Annual Conference on Neural Information Processing Systems. La Jolla: NIPS, 2017: 5999-6009.
    [16] HUA W Z, DAI Z H, LIU H X, et al. Transformer quality in linear time[EB/OL]. (2022-02-21)[2022-05-01]. https://arxiv.org/abs/2202.10447.
    [17] 苏锦钿, 欧阳志凡, 余珊珊. 基于依存树及距离注意力的句子属性情感分类[J]. 计算机研究与发展, 2019, 56(8): 1731-1745. doi: 10.7544/issn1000-1239.2019.20190102

    SU J D, OUYANG Z F, YU S S. Aspect-level sentiment classification for sentences based on dependency tree and distance attention[J]. Journal of Computer Research and Development, 2019, 56(8): 1731-1745(in Chinese). doi: 10.7544/issn1000-1239.2019.20190102
    [18] PONTIKI M, GALANIS D, PAPAGEORGIOU H, et al. SemEval-2015 Task 12: Aspect based sentiment analysis[C]//Proceedings of the 9th International Workshop on Semantic Evaluation. Stroudsburg: ACL, 2015: 486-495.
    [19] PATERIA S, CHOUBEY P K. SemEval-2016 Task 5: Aspect based sentiment analysis[C]//Proceedings of the 10th International Workshop on Semantic Evaluation. Stroudsburg: ACL, 2016: 318-324.
    [20] LI X, BING L D, LI P J, et al. A unified model for opinion target extraction and target sentiment prediction[C]//Proceedings of the 33rd AAAI Conference on Artificial Intelligence. Palo Alto: AAAI, 2019: 6714-6721.
    [21] FAN Z F, WU Z, DAI X Y, et al. Target-oriented opinion words extraction with target-fused neural sequence labeling[C]//Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. Stroudsburg: ACL, 2019, 1: 2509-2518.
    [22] HE R D, LEE W S, NG H T, et al. An interactive multi-task learning network for end-to-end aspect-based sentiment analysis[C]//Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics. Stroudsburg: ACL, 2020: 504-515.
  • 加载中
图(7) / 表(5)
计量
  • 文章访问数:  557
  • HTML全文浏览量:  74
  • PDF下载量:  9
  • 被引次数: 0
出版历程
  • 收稿日期:  2022-05-31
  • 录用日期:  2022-11-04
  • 网络出版日期:  2023-01-09
  • 整期出版日期:  2024-02-27

目录

    /

    返回文章
    返回
    常见问答