植物领域知识图谱构建中本体非分类关系提取方法
详细信息    查看全文 | 推荐本文 |
  • 英文篇名:Research on Ontology Non-taxonomic Relations Extraction in Plant Domain Knowledge Graph Construction
  • 作者:赵明 ; 杜亚茹 ; 杜会芳 ; 张家军 ; 王红说 ; 陈瑛
  • 英文作者:Zhao Ming;Du Yaru;Du Huifang;Zhang Jiajun;Wang Hongshuo;Chen Ying;College of Information and Electrical Engineering,China Agricultural University;
  • 关键词:植物领域本体 ; 知识图谱 ; 非分类关系 ; 本体学习 ; 百度百科
  • 英文关键词:plant domain ontology;;knowledge graph;;non-taxonomic relation;;ontology learning;;Baidu Encyclopedia
  • 中文刊名:NYJX
  • 英文刊名:Transactions of the Chinese Society for Agricultural Machinery
  • 机构:中国农业大学信息与电气工程学院;
  • 出版日期:2016-06-20 10:55
  • 出版单位:农业机械学报
  • 年:2016
  • 期:v.47
  • 基金:国家自然科学基金项目(61503386)
  • 语种:中文;
  • 页:NYJX201609038
  • 页数:7
  • CN:09
  • ISSN:11-1964/S
  • 分类号:283-289
摘要
采用本体学习的方法,以百度百科植物类词条内容的非结构和半结构化中文文本信息作为语料进行处理。使用一种有指导的基于依存句法分析的词汇-语法模式来获取植物领域的概念、分类和非分类关系,并分别利用基于词表过滤的方法和给模式添加限制的方法,较大程度地提高了关系抽取的精确度,完成在轻量级本体的基础上自动构建重量级本体。该方法建立了一个特定领域语料的概念层次,提高了最具代表性的分类和非分类关系的发现,并使用OWL语言形式化表达抽取结果。实验表明,该方法在非分类关系抽取上取得了较好的结果,为该领域知识图谱构建奠定了基础。
        In order to provide more specific knowledge and technology of plant field,the main task of KG( knowledge graph) is to extract a wealth of concepts and relationships. Due to the relation extraction is the most difficult in KG construction,this paper makes use of ontology learning,and proposes a nontaxonomic relation learning method to obtain representative concepts and their relations from unstructured and semi-structured texts of Baidu Encyclopedia entry content by using lexicon-syntactic patterns based on dependency grammar analysis. Moreover,the methods of adding constraint models and words filtering were adopted to build heavy weight ontology automatically based on a lightweight ontology and greatly improved the precision of the relation extraction. The approach established a concept structure from the plant domain corpus,ameliorated the discovery of the most representative non-taxonomic relation,and formalized them in the standardized OWL 2. 0. A set of experiments was performed using the approach implemented in the plant domain. The results indicated that extraction by patterns should be performed directly after natural language processing,which has a comparatively high accuracy compared to the former algorithms,and this approach can extract non-taxonomic relations with high effectiveness,which lays the foundation for KG construction of plant field.
引文
1王昊奋.大规模知识图谱技术[EB/OL].(2014-06-12)http:∥www.China-cloud.com/zhongyunxy/20140612_38070.html.
    2 DESHPANDE O,LAMBA D S,TOURN T,et al.Building,maintaining,and using knowledge bases:a report from the trenches[C]∥2013 SIGMOD'13,2013:1209-1220.
    3程童凌,李娟子.基于维基类百科知识资源的实体关系发现和语标注[J].电子技术与软件工程,2015(18):170-173.
    4 MAEDCHE A,STAAB S.Ontology learning for the semantic web[J].IEEE,Intelligent Systems,2001,16(2):72-79.
    5 WONG W,LIU W,BENNAMOUN M.Ontology learning from text:a look back and into the future[J].Acm Computing Surveys,2012,44(4):1-36.
    6廖福燕.本体构建中概念和关系获取方法研究[D].西安:西安建筑科技大学,2011.LIAO Fuyan.Research on domain ontology concept and relation acquisition[D].Xi'an:Xi'an University of Architecture and Technology,2011.(in Chinese)
    7谷俊,严明,王昊.基于改进关联规则的本体关系获取研究[J].情报理论与实践,2011,34(12):121-125.GU Jun,YAN Ming,WANG Hao.Research on ontology relation extraction based on improved association rule[J].Information Studies,2011,34(12):121-125.(in Chinese)
    8舒万里.中文领域本体学习中概念和关系抽取的研究[D].重庆:重庆大学,2012.SHU Wanli.Research on concept and relation extraction of Chinese domain ontology[D].Chongqing:Chongqing University,2012.(in Chinese)
    9胡云飞.本体学习中关系获取的研究[D].西安:西安建筑科技大学,2012.HU Yunfei.Research on relations acquisition of ontology learning[D].Xi'an:Xi'an University of Architecture and Technology,2012.(in Chinese)
    10邱桃荣,黄海泉,段文影,等.非分类关系学习的粒计算模型研究[J].南昌大学学报:工科版,2012,34(3):273-278.QIU T R,HUANG H Q,DUAN W Y,et al.Research on granular computing model for non-taxonomic relations learning[J].Journal of Nanchang University,2012,34(3):273-278.(in Chinese)
    11梁吉震.基于领域概念知识的非分类关系学习研究[D].长春:吉林大学,2012.LIANG Jizhen.Research on non-taxonomic relationships learning based on domain concept knowledge[D].Changchun:Jilin University,2012.(in Chinese)
    12 WEICHSELBRAUN A,WOHLGENANNT G,SCHARL A.Refining non-taxonomic relation labels with external structured data t support ontology learning[J].Data&Knowledge Engineering,2010,69(8):763-778.
    13向阳,张波,韩婕.Agent驱动的中文本体智能构建研究[J].计算机工程与应用,2009,45(10):133-137.XIANG Yang,ZHANG Bo,HAN Jie.Agent driven intelligent construction of Chinese ontology[J].Computer Engineering and Appfication,2009,45(10):133-137.(in Chinese)
    14叶琼.农业领域本体知识云化方法研究[D].合肥:安徽农业大学,2012.YE Qiong.Research on cloudization method of agricultural ontology knowledge[D].Hefei:Anhui Agricultural University,2012(in Chinese)
    15邓子平.面向医学诊疗的本体自动生成系统的研究与开发[D].广州:广东工业大学,2011.DENG Ziping.Research and development of a ontology automatic generation system oriented medical diagnosis[D].Guangzhou Guangdong University of Technology,2011.(in Chinese)
    16马莉,陈志新.基于网站结构的领域本体学习方法[J].计算机光盘软件与应用,2014(16):83,85.MA Li,CHEN Zhixin.Domain ontology learning mehtod based on structure of the site[J].Computer CD Software and Applications,2014(16):83,85.(in Chinese)
    17王红,高斯婷,潘振杰,等.基于NNV关联规则的非分类关系提取方法及其应用研究[J].计算机应用研究,2012,29(10)3665-3668.WANG Hong,GAO Siting,PAN Zhenjie,et al.Application and research of non-taxonimic relation extraction method based on NNV association rule[J].Application Research of Computers,2012,29(10):3665-3668.(in Chinese)
    18 SNCHEZ D,MORENO A.Learning non-taxonomic relationships from web documents for domain ontology construction[J].Dat&Knowledge Engineering,2008,63(3):600-623.
    19 SERRA I,GIRARDI R,NOVAIS P.Evaluating techniques for learning non-taxonomic relationships of ontologies from text[J]Expert Systems with Applications,2014,41(11):5201-5211.
    20 CHE W,LI Z,LIU T.LTP:a Chinese language technology platform[C]∥Proceedings of the 23rd International Conference on Computational Linguistics:Demonstrations,2010:13-16.
    21 ZOUAQ A,GASEVIC D,HATALA M.Linguistic patterns for information extraction in OntoC maps[C]∥Proceedings of the 3rd Workshop on Ontology Patterns,2012:1-12.
    22古凌岚,孙素云.基于语义依存的中文本体非分类关系抽取方法[J].计算机工程与设计,2012,33(4):1676-1680.GU Linglan,SUN Suyun.Approach to Chinese ontology non-taxonomic relation extraction based on semantic dependency[J]Computer Engineering and Design,2012,33(4):1676-1680.(in Chinese)