北京航空航天大学学报 ›› 2015, Vol. 41 ›› Issue (6): 1080-1086.doi: 10.13700/j.bh.1001-5965.2014.0419

• 论文 • 上一篇    下一篇

基于本体的故障案例信息抽取方法研究

柯倩云1, 李青1, 孙勇2   

  1. 1. 北京航空航天大学 机械工程及自动化学院, 北京 100191;
    2. 中航工业成都飞机设计研究所 综保部, 成都 610000
  • 收稿日期:2014-07-11 出版日期:2015-06-20 发布日期:2015-07-30
  • 通讯作者: 李青(1961—),女,湖北黄梅人,教授,liqing@buaa.edu.cn,主要研究方向为装备保障信息化. E-mail:liqing@buaa.edu.cn
  • 作者简介:柯倩云(1989—),女,福建厦门人,硕士研究生,yjmymh2011@163.com

Fault case information extraction method research based on ontology

KE Qianyun1, LI Qing1, SUN Yong2   

  1. 1. School of Mechanical Engineering and Automation, Beijing University of Aeronautics and Astronautics, Beijing 100191, China;
    2. Comprehensive Security Department, AVIC Chengdu Aircraft Design Institute, Chengdu 610000, China
  • Received:2014-07-11 Online:2015-06-20 Published:2015-07-30

摘要: 以飞机维修保障中的经验知识积累和重用为目的,针对故障案例知识由于缺乏结构化、规范化描述,导致共享与重用困难的问题,对飞机故障案例的知识表达与信息抽取方法进行了研究.首先,根据飞机故障领域的特殊性以及知识共享和重用的实际需求,建立了飞机故障案例知识的本体模型;其次,利用中文分词工具以及文本工程通用框架(GATE),研究了对故障案例信息文档的语义标注以及基于规则的信息抽取技术;最后,利用Jena推理机挖掘出隐性信息,并实现在信息抽取过程中,通过不断发现新知识,主动扩展知识库.在此基础上开发了信息抽取原型系统,实现了从多种不同类型的文档信息中抽取出结构化故障案例信息,并利用数据库进行存储和管理,提高了故障案例知识的重用性,验证了研究方法的可行性.

关键词: 信息抽取, 本体, GATE, 知识管理, 故障案例

Abstract: To solve the accumulation and reusing problems of fault case knowledge that are described as unstructured and unnormalized information in the current maintenance support activities of aircraft, research on the knowledge representation and information extraction method of aircraft fault case was carried out. Firstly,ontology model of aircraft fault case knowledge was established according to the particularity of aircraft fault domain and the actual demand of knowledge sharing and reusing. Then with Chinese segmentation tools and general architecture for text engineering (GATE) frame, semantic annotation and rule based information extraction technology of aircraft fault case documents were studied. Finally, the hidden knowledge was discovered by using apache Jena inference engine, and knowledge base was expanded by the new knowledge found in the process of information extraction. Moreover, the prototype system for information extraction was developed and was used to extract structured fault case information from different types of documents, the information was then stored and managed by using database. This method was proved feasible to improve the reusability of fault case knowledge.

Key words: information extraction, ontology, general architecture for text engineering (GATE), knowledge management, fault case

中图分类号: 


版权所有 © 《北京航空航天大学学报》编辑部
通讯地址:北京市海淀区学院路37号 北京航空航天大学学报编辑部 邮编:100191 E-mail:jbuaa@buaa.edu.cn
本系统由北京玛格泰克科技发展有限公司设计开发