王书鸿, 郑少明, 刘中硕, 刘一民, 董鹏, 陶畅, 于逸廷, 薛安成. 面向某地区电网继电保护装置缺陷知识图谱构建的实体关系抽取[J]. 电网技术, 2023, 47(5): 1874-1885. DOI: 10.13335/j.1000-3673.pst.2022.2110
引用本文: 王书鸿, 郑少明, 刘中硕, 刘一民, 董鹏, 陶畅, 于逸廷, 薛安成. 面向某地区电网继电保护装置缺陷知识图谱构建的实体关系抽取[J]. 电网技术, 2023, 47(5): 1874-1885. DOI: 10.13335/j.1000-3673.pst.2022.2110
WANG Shuhong, ZHENG Shaoming, LIU Zhongshuo, LIU Yimin, DONG Peng, TAO Chang, YU Yiting, XUE Ancheng. Entity Relation Extraction for Construction of Relay Protection Device Defect Knowledge Graph in Some Certain Area Power Grid[J]. Power System Technology, 2023, 47(5): 1874-1885. DOI: 10.13335/j.1000-3673.pst.2022.2110
Citation: WANG Shuhong, ZHENG Shaoming, LIU Zhongshuo, LIU Yimin, DONG Peng, TAO Chang, YU Yiting, XUE Ancheng. Entity Relation Extraction for Construction of Relay Protection Device Defect Knowledge Graph in Some Certain Area Power Grid[J]. Power System Technology, 2023, 47(5): 1874-1885. DOI: 10.13335/j.1000-3673.pst.2022.2110

面向某地区电网继电保护装置缺陷知识图谱构建的实体关系抽取

Entity Relation Extraction for Construction of Relay Protection Device Defect Knowledge Graph in Some Certain Area Power Grid

  • 摘要: 继电保护装置缺陷时有发生,威胁其作为第一道防线的可靠性,进而影响电网安全稳定运行。随着电网规模扩大,保护装置数量增加,现场运维工作需要智能化辅助减小运维压力;同时,保护装置历史缺陷数据累积,蕴含有助于辅助运维的信息有待文本挖掘。有鉴于此,该文以某地区电网实际继电保护装置缺陷数据和专业词典为基础,提出了一种面向继电保护装置缺陷知识图谱构建的实体关系抽取方法。首先,以传统本体构建思想为基础,借助词频–逆文档频率(term frequency-inverse document frequency,TF-IDF)方法,抽取了继电保护装置领域术语,并定义了概念及其间关系,实现了继电保护装置缺陷本体构建;其次,根据继电保护装置缺陷记录文本特征,提出了基于语法规则的实体关系抽取方法,实现了近邻词位实体的关系抽取;再次,提出了基于远程监督学习的实体关系抽取方法,实现了长间隔词位实体关系抽取;进一步,利用关系抽取可信度评价指标,实现了综合语法规则和远程监督的实体关系抽取;最后,利用Protégé对装置缺陷本体及关系抽取结果做了局部可视化展示。

     

    Abstract: Defects of the relay protection devices occur from time to time, threatening its reliability as the first line of defense, thereby affecting the safe and stable operation of the electric power grid. With the expansion of the power grid and the increase in the number of the repay protection devices, the operation and maintenance require proper intelligent assistance to reduce the work pressure. In the meantime, the accumulation of the historical defect data of the protection devices contains a lot of information which may be useful for the data mining of the intelligent assistance. In view of this, this paper proposes an entity relation extraction for the construction of the relay protection device defect knowledge graph based on the actual relay protection device defect data in a certain area and the professional dictionaries. Firstly, based on the traditional ontology construction idea and the TF-IDF(term frequency - inverse document frequency) method, the terminology in the relay protection field is extracted and the concepts and their relationships are defined so as to realize the construction of the ontology for the relay protection device defect. Secondly, according to the text features of the relay protection device defect records, an entity relationship extraction based on the grammar rules is proposed, which realizes the adjacent lexeme entity relationship extraction. Thirdly, an entity relationship extraction based on the remote supervised learning is proposed, which achieves the entity relationship extraction from the long-spaced lexeme. Further, combined with the relationship extraction evaluation, the entity relationship extraction with the comprehensive grammar rules and the remote supervision is realized. Finally, the examples of the ontology and the relationship extraction for the defect of the relay protection device are partially visualized using the Protégé.

     

/

返回文章
返回