束嘉伟, 杨挺, 耿毅男, 于洁. 面向电力知识图谱构建的重叠实体关系联合抽取方法[J]. 高电压技术, 2024, 50(11): 4912-4922. DOI: 10.13336/j.1003-6520.hve.20230772
引用本文: 束嘉伟, 杨挺, 耿毅男, 于洁. 面向电力知识图谱构建的重叠实体关系联合抽取方法[J]. 高电压技术, 2024, 50(11): 4912-4922. DOI: 10.13336/j.1003-6520.hve.20230772
SHU Jiawei, YANG Ting, GENG Yinan, YU Jie. Joint Extraction Method for Overlapping Entity Relationships in the Construction of Electric Power Knowledge Graph[J]. High Voltage Engineering, 2024, 50(11): 4912-4922. DOI: 10.13336/j.1003-6520.hve.20230772
Citation: SHU Jiawei, YANG Ting, GENG Yinan, YU Jie. Joint Extraction Method for Overlapping Entity Relationships in the Construction of Electric Power Knowledge Graph[J]. High Voltage Engineering, 2024, 50(11): 4912-4922. DOI: 10.13336/j.1003-6520.hve.20230772

面向电力知识图谱构建的重叠实体关系联合抽取方法

Joint Extraction Method for Overlapping Entity Relationships in the Construction of Electric Power Knowledge Graph

  • 摘要: 作为构建电力知识图谱的关键步骤,知识抽取可以从海量非结构化电力文本中准确抽取出实体和关系。但是,传统流水线式方法存在识别的错误信息后向传递、实体识别和关系抽取任务割裂以及易产生冗余信息的问题,进而导致抽取准确率低、抽取信息不全面,最终影响知识图谱的准确构建。针对上述问题,提出面向电力知识图谱构建的重叠实体关系联合抽取方法,通过改进的序列标注方案进行联合抽取,构建了电力领域专属预训练PowerRoberta模型,并增加对抗训练,提高了模型抽取电力知识的准确度和对陌生信息的预测能力。最后,以实际变电站巡检数据为例进行了实验分析与配电知识图谱可视化构建,结果表明所提出的联合抽取方法提升了知识抽取的准确率,准确率达到91.67%,可有效支撑配电网智能信息检索、辅助决策高级应用。

     

    Abstract: As a key step in building a power knowledge graph, knowledge extraction can accurately extract entities and relationships from massive unstructured power texts. However, the traditional pipeline method has the problems of backward transmission of error information, separation of entity recognition, and relationship extraction tasks, and is easy to generate redundant information, which results in low extraction accuracy, incomplete extraction of information, and ultimately impairs the accurate construction of the knowledge graph. To solve the above problems, this paper proposes a joint extraction method of overlapping entity relationships for the construction of the power knowledge graph. Through the improved sequence labeling scheme, the joint extraction is carried out, the exclusive pre-training model (the PowerRobertsa model) in the power field is constructed, and the confrontation training is increased, which improves the accuracy of the model extraction of power knowledge and the ability to predict unfamiliar information. Finally, by taking the actual substation patrol data as an example, the experimental analysis and the visual construction of the distribution Knowledge graph are carried out. The results show that the joint extraction method proposed in this paper can be adopted to improve the accuracy of knowledge extraction, which reaches 91.67%, and can effectively support the advanced application of distribution network intelligent information retrieval and decision-making assistance.

     

/

返回文章
返回