Coreference resolution based on graph structure and multitask learning

LI Kaiyang; WANG Yaoying; ZHU Tianyou; LI Jiwei; REN Junda; CHEN Zhenyu

doi:10.13700/j.bh.1001-5965.2022.0941

Volume 50 Issue 12

Dec. 2024

Turn off MathJax

Article Contents

Abstract

References

Journal of Beijing University of Aeronautics and Astronautics > 2024 > 50(12): 3825-3833.

LI K Y，WANG Y Y，ZHU T Y，et al. Coreference resolution based on graph structure and multitask learning[J]. Journal of Beijing University of Aeronautics and Astronautics，2024，50（12）：3825-3833 （in Chinese） doi: 10.13700/j.bh.1001-5965.2022.0941

Citation:

PDF( 843 KB)

Coreference resolution based on graph structure and multitask learning

doi: 10.13700/j.bh.1001-5965.2022.0941

Big Data Center of State Grid Corporation of China，Beijing 100031，China

Funds: Technology Project of Big Data Center of State Grid Corporation of China (SGSJ0000YFJS2200066)

More Information

Corresponding author: E-mail：kaiyang2@163.com
Received Date: 24 Nov 2022
Accepted Date: 29 May 2023

Available Online: 10 Jul 2023

Publish Date: 06 Jul 2023

Abstract

Abstract

Coreference resolution is an important task in the domain of natural language processing. Learning effective referential feature representation is a core problem of coreference resolution. It is ineffective to reflect the internal relationships between the information, such as named entities in text fragments and coreference pairs, because the majority of current research views the identification of reference text fragments and the prediction of coreference relationships as two stages of learning. This research proposes a new model of coreference resolution based on graph structure and multitask learning. It combines sequence semantics and structure information to learn referential feature vectors. A multitask learning framework is used to combine the two tasks of coreference resolution and named entity recognition. The two tasks, named entity recognition and coreference resolution, can learn from each other and get better at each other by sharing parameters in the underlying network. Extensive experiments are conducted to verify the superior performance of the proposed model.
- coreference resolution,
- named entity recognition,
- information extraction,
- entity disambiguation,
- multitask learning

FullText(HTML)

References(27)

References

[1]	DODDINGTON G R, MITCHELL A, PRZYBOCKI M A, et al. The automatic content extraction (ACE) program-tasks, data, and evaluation[C]//Proceedings of the International Conference on Language Resources and Evaluation. Brussels: European Language Resources Association, 2004: 837-840.
[2]	HOBBS J R. Resolving pronoun references[J]. Lingua, 1978, 44(4): 311-338. doi: 10.1016/0024-3841(78)90006-2
[3]	GE N, HALE J, CHARNIAK E. A statistical approach to anaphora resolution[C]//Proceedings of the 6th Workshop on Very Large Corpora. Stroudsbury: Association for Computational Linguistics, 1998: 161-170.
[4]	ZHENG J P, CHAPMAN W W, MILLER T A, et al. A system for coreference resolution for the clinical narrative[J]. Journal of the American Medical Informatics Association, 2012, 19(4): 660-667. doi: 10.1136/amiajnl-2011-000599
[5]	ZHANG R, DOS SANTOS C N, YASUNAGA M, et al. Neural coreference resolution with deep biaffine attention by joint mention detection and mention clustering[C]//Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics. Stroudsburg: Association for Computational Linguistics, 2018: 2450-2355.
[6]	LEE K, HE L H, ZETTLEMOYER L. Higher-order coreference resolution with coarse-to-fine inference[EB/OL]. (2018-04-15)[2022-08-01].
[7]	JOSHI M, LEVY O, WELD D S, et al. BERT for coreference resolution: Baselines and analysis[EB/OL]. (2019-08-24)[2022-08-01].
[8]	LAPPIN S, LEASS H J. An algorithm for pronominal anaphora resolution[J]. Computational Linguistics, 1994, 20(4): 535-561.
[9]	DAGAN I, ITAI A. Automatic acquisition of constraints for the resolution of anaphoric references and syntactic ambiguities[C]//Proceedings of the 28th Annual Meeting of the Association for Computational Linguistics. Stroudsburg: Association for Computational Linguistics, 1990: 122-129.
[10]	CLARK K, MANNING C D. Improving coreference resolution by learning entity-level distributed representations[C]//Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics. Stroudsburg: Association for Computational Linguistics, 2016: 1231-1241.
[11]	LEE K, HE L, LEWIS M, et al. End-to-end neural coreference resolution[C]//Proceedings of the Conference on Empirical Methods in Natural Language Processing. Stroudsbury: Association for Computational Linguistics, 2017: 561-570.
[12]	WU W, WANG F, YUAN A, et al. CorefQA: Coreference resolution as query-based span prediction[C]//Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. Stroudsbury: Association for Computational Linguistics, 2020: 6953-6963.
[13]	LUAN Y, WADDEN D, HE L H, et al. A general framework for information extraction using dynamic span graphs[EB/OL]. (2019-05-05)[2022-08-01].
[14]	GANDHI N, FIELD A, TSVETKOV Y. Improving span representation for domain-adapted coreference resolution[EB/OL]. (2021-09-20)[2022-08-01].
[15]	付健, 孔芳. 融入结构化信息的端到端中文指代消解[J]. 计算机工程, 2020, 46(1): 45-51. FU J, KONG F. End to end Chinese coreference resolution with structural information[J]. Computer Engineering, 2020, 46(1): 45-51(in Chinese).
[16]	AL-RFOU R, PEROZZI B, SKIENA S. Polyglot: Distributed word representations for multilingual NLP[EB/OL]. (2013-07-05)[2022-08-01].
[17]	JIANG F, COHN T. Incorporating syntax and semantics in coreference resolution with heterogeneous graph attention network[C]// Proceedings of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. Stroudsburg: Association for Computational Linguistics, 2021: 1584-1591.
[18]	HUANG Z H, XU W, YU K. Bidirectional LSTM-CRF models for sequence tagging[EB/OL]. (2015-08-09)[2022-08-01].
[19]	LIU Y H, OTT M, GOYAL N, et al. RoBERTa: A robustly optimized BERT pretraining approach[EB/OL]. (2019-07-26)[2022-08-01].
[20]	LI J, WANG X, TU Z P, et al. On the diversity of multi-head attention[J]. Neurocomputing, 2021, 454: 14-24. doi: 10.1016/j.neucom.2021.04.038
[21]	HUANG G, LIU Z, VAN DER MAATEN L, et al. Densely connected convolutional networks[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Piscataway: IEEE Press, 2017: 2261-2269.
[22]	LAFFERTY J, MCCALLUM A, PEREIRA F. Conditional random fields: Probabilistic models for segmenting and labeling sequence data[C]//Proceedings of the Eighteenth International Conference on Machine Learning. Williamstown: Morgan Kaufmann, 2001: 282-289.
[23]	RAM R V S, AKILANDESWARI A, DEVI S L. Linguistic features for named entity recognition using CRFs[C]//Proceedings of the International Conference on Asian Language Processing. Piscataway: IEEE Press, 2010: 158-161.
[24]	AUGENSTEIN I, DAS M, RIEDEL S, et al. SemEval 2017 task 10: ScienceIE-extracting keyphrases and relations from scientific publications[EB/OL]. (2017-04-10)[2022-08-01].
[25]	GÁBOR K, BUSCALDI D, SCHUMANN A K, et al. SemEval2018 task 7: Semantic relation extraction and classification in scientific papers[C]//Proceedings of the 12th International Workshop on Semantic Evaluation. Stroudsburg: Association for Computational Linguistics, 2018: 679-688.
[26]	MA X Z, HOVY E. End-to-end sequence labeling via bi-directional LSTM-CNNs-CRF[EB/OL]. (2016-03-04)[2022-08-01].
[27]	ZHOU R J, HU Q, WAN J, et al. WCL-BBCD: A contrastive learning and knowledge graph approach to named entity recognition[EB/OL]. (2022-03-14)[2022-08-01].

Relative Articles

[1]	QIAN P F，QIN G L，CHEN Q L，et al. A recognition method of radio fuze signal based on supervised contrastive learning[J]. Journal of Beijing University of Aeronautics and Astronautics，2025，51（3）：953-961 （in Chinese）. doi: 10.13700/j.bh.1001-5965.2023.0128.
[2]	HAN H，MENG T T. Aspect sentiment triple extraction for grammar-weighted graph text[J]. Journal of Beijing University of Aeronautics and Astronautics，2024，50（2）：409-418 （in Chinese）. doi: 10.13700/j.bh.1001-5965.2022.0443.
[3]	ZHANG Y T，LI Q Y，LIU S K. Tabular subordination relation extraction based on graph convolutional networks[J]. Journal of Beijing University of Aeronautics and Astronautics，2024，50（4）：1308-1315 （in Chinese）. doi: 10.13700/j.bh.1001-5965.2022.0382.
[4]	DENG Yupeng, GUO Fang, WANG Rong, SONG Zhenfeng. A referring image segmentation method based on bidirectional vision-language interaction module[J]. Journal of Beijing University of Aeronautics and Astronautics. doi: 10.13700/j.bh.1001-5965.2024.0462
[5]	HAI Chao, TIAN Xin, ZHANG Hong, TAN Da-long, HE Yi-xin, MENG Fan-yong, YANG Min. A Deep Learning-Based Dual-Domain Information Method for CT Metal Artifact Reduction[J]. Journal of Beijing University of Aeronautics and Astronautics. doi: 10.13700/j.bh.1001-5965.2023.0753
[6]	TANG Rong-chuan, XU Qiu-cheng, TANG Wen-yi, ZHAI Fei-fei, ZHOU Yu. Multilingual knowledge graph completion without aligned entity pairs[J]. Journal of Beijing University of Aeronautics and Astronautics. doi: 10.13700/j.bh.1001-5965.2023.0709
[7]	ZOU Y B，LI T，CHEN M，et al. Indoor spatial layout estimation model based on multi-task supervised learning[J]. Journal of Beijing University of Aeronautics and Astronautics，2024，50（11）：3327-3337 （in Chinese）. doi: 10.13700/j.bh.1001-5965.2022.0834.
[8]	JI X，WU T X，WANG H G，et al. Attribute aggregation entity alignment based on multi-channel graph neural network[J]. Journal of Beijing University of Aeronautics and Astronautics，2024，50（9）：2791-2799 （in Chinese）. doi: 10.13700/j.bh.1001-5965.2022.0703.
[9]	YANG Rong-tai, SHAO Yu-bin, DU Qing-zhi, LONG Hua, QI Yu-ting, ZHANG Feng. Few-shot entity linking prediction based on Graph-Transformer network[J]. Journal of Beijing University of Aeronautics and Astronautics. doi: 10.13700/j.bh.1001-5965.2024.0023
[10]	NIU G C，WANG X N. A multi-task traffic scene detection model based on cross-attention[J]. Journal of Beijing University of Aeronautics and Astronautics，2024，50（5）：1491-1499 （in Chinese）. doi: 10.13700/j.bh.1001-5965.2022.0610.
[11]	WU T X，JI X，WANG H G，et al. Relation extraction based on fusion of graph structure and sequence features[J]. Journal of Beijing University of Aeronautics and Astronautics，2024，50（9）：2763-2771 （in Chinese）. doi: 10.13700/j.bh.1001-5965.2022.0706.
[12]	Xu Haoran, Xiang Yang, Ding Ling. Chinese Named Entity Recognition Method Based on Knowledge Enhancement from Large Language Models and Multi-feature Fusion[J]. Journal of Beijing University of Aeronautics and Astronautics. doi: 10.13700/j.bh.1001-5965.2024.0421
[13]	JI X，WU T X，YU T，et al. Power text information extraction based on multi-task learning[J]. Journal of Beijing University of Aeronautics and Astronautics，2024，50（8）：2461-2469 （in Chinese）. doi: 10.13700/j.bh.1001-5965.2022.0683.
[14]	RAN Hua-ming. Airborne sensor multi-task scheduling algorithm based on slide time window[J]. Journal of Beijing University of Aeronautics and Astronautics. doi: 10.13700/j.bh.1001-5965.2023.0488
[15]	HUANG Jun, FAN Hao-dong, HONG Xu-dong, LI Xue. Semantic information guided multi-label image classification[J]. Journal of Beijing University of Aeronautics and Astronautics. doi: 10.13700/j.bh.1001-5965.2023.0382
[16]	LI C，HE Y Z，HU Y. Characteristic model control of nutation target contact detumbling[J]. Journal of Beijing University of Aeronautics and Astronautics，2023，49（11）：2977-2988 （in Chinese）. doi: 10.13700/j.bh.1001-5965.2021.0798.
[17]	NI Wen-kai, PENG Shu-fan, DU Yan-hui. Identification of induced information for personalized recommendations based on knowledge graph[J]. Journal of Beijing University of Aeronautics and Astronautics. doi: 10.13700/j.bh.1001-5965.2023.0475
[18]	KE Zhi-jie, XU Guo-ning, CAI Rong, LI Yong-xiang, YANG Yan-chu. Optimization of Multitask Scheduling for Swarm UAV System with Charging Platform[J]. Journal of Beijing University of Aeronautics and Astronautics. doi: 10.13700/j.bh.1001-5965.2022.0414
[19]	LI Hui, ZHANG Xiaowei, ZHAO Xinpeng, LU Xinyu. Multi-label cooperative learning for cross domain person re-identification[J]. Journal of Beijing University of Aeronautics and Astronautics, 2022, 48(8): 1534-1542. doi: 10.13700/j.bh.1001-5965.2021.0600
[20]	JING Xin, WANG Huafeng, LIU Qianfeng, LUO Siwu, ZHANG Fan. Named entity recognition in nuclear power field based on ELMo-GCN[J]. Journal of Beijing University of Aeronautics and Astronautics, 2022, 48(12): 2556-2565. doi: 10.13700/j.bh.1001-5965.2021.0155

Supplements(0)

Cited By

Proportional views

Proportional views

通讯作者: 陈斌, bchen63@163.com

1.
沈阳化工大学材料科学与工程学院沈阳 110142

Figures(1) / Tables(8)

Get Citation

PDF

XML

Article Metrics

Article views(272) PDF downloads(10)

Coreference resolution based on graph structure and multitask learning

doi: 10.13700/j.bh.1001-5965.2022.0941

Abstract

References

Relative Articles

Proportional views

Catalog

通讯作者: 陈斌, bchen63@163.com

Article Metrics

Proportional views

Related

Coreference resolution based on graph structure and multitask learning

doi: 10.13700/j.bh.1001-5965.2022.0941

Abstract

References

Relative Articles

Proportional views

Catalog

通讯作者: 陈斌, bchen63@163.com

Article Metrics

Proportional views

Related

Export File

Citation

Format

Content