北京航空航天大学学报 ›› 2015, Vol. 41 ›› Issue (8): 1476-1484.doi: 10.13700/j.bh.1001-5965.2014.0568

• 论文 • 上一篇    下一篇

基于两层元数据与本体的异构数据共享技术

李小涛, 胡晓惠, 李斌全   

  1. 北京航空航天大学 自动化科学与电气工程学院, 北京 100191
  • 收稿日期:2014-09-16 出版日期:2015-08-20 发布日期:2015-09-08
  • 通讯作者: 胡晓惠(1960-),男,河北承德人,教授,hxh@iscas.ac.cn,主要研究方向为智能系统的综合集成与优化决策、综合信息系统与集成技术. E-mail:hxh@iscas.ac.cn
  • 作者简介:李小涛(1987-),男,河北唐山人,博士研究生,taosmall@163.com
  • 基金资助:
    国家自然科学基金(61273350)

Heterogeneous data sharing technology based on two-layer metadata and ontology

LI Xiaotao, HU Xiaohui, LI Binquan   

  1. School of Automation Science and Electrical Engineering, Beijing University of Aeronautics and Astronautics, Beijing 100191, China
  • Received:2014-09-16 Online:2015-08-20 Published:2015-09-08

摘要: 针对多源、多类、异构数据难以同时共享的问题,提出了一种两层元数据结合本体的信息共享技术.首先,分析了两层元数据的结构,介绍了如何通过两层元数据统一描述多类异构数据.其次,针对元数据缺乏语义信息不能描述数据类别之间的隐含关系的问题,在元数据之上建立本体层,对元数据进行语义描述和本体推理.最后,在数据检索方面,利用Lucene全文检索引擎与SPARQL(Simple Protocol and RDF Query Language)本体查询语言相结合,在关键词查询过程增加了SPARQL检索操作,提高了查全率,并优化了检索时间.实验选取了2014-2015赛季欧洲足球冠军联赛数据作为测试数据,证明了本文方法在异构数据共享上的有效性和元数据查询性能的改进.

关键词: 异构数据, 元数据, 本体, 信息共享, 语义检索

Abstract: With the aim to share multi-sourced, multi-class, heterogeneous data simultaneously, an information sharing technology was proposed based on a two-layer metadata combined with ontology. Firstly, the structure of the two-layer metadata standard was analyzed. At the same time, how to implement uniform description for heterogeneous data was introduced. Secondly, due to the lack of semantic information, some important potential correlations between metadata classes may be ignored. For this reason ontology was established on the metadata layer for describing and reasoning the relationships between classes. Finally, in order to improve the recall rate and optimize the retrieval time, an improved method combining Lucene full-text search engine with SPARQL query was proposed to retrieve metadata. SPARQL retrieval was performed before the keyword queried by Lucene. Soccer match information of 2014-2015 UEFA Champions League was selected as test data. The experiment results illustrate the effectiveness on sharing heterogeneous data and improvement on recall and timeliness of the approach.

Key words: heterogeneous data, metadata, ontology, information sharing, semantic retrieval

中图分类号: 


版权所有 © 《北京航空航天大学学报》编辑部
通讯地址:北京市海淀区学院路37号 北京航空航天大学学报编辑部 邮编:100191 E-mail:jbuaa@buaa.edu.cn
本系统由北京玛格泰克科技发展有限公司设计开发