伍颖欣, 刘磊, 肖轶婷, 关远鹏. 基于改进注意力机制网络的电力设备图像识别[J]. 中国电机工程学报, 2025, 45(3): 870-883. DOI: 10.13334/j.0258-8013.pcsee.231362
引用本文: 伍颖欣, 刘磊, 肖轶婷, 关远鹏. 基于改进注意力机制网络的电力设备图像识别[J]. 中国电机工程学报, 2025, 45(3): 870-883. DOI: 10.13334/j.0258-8013.pcsee.231362
WU Yingxin, LIU Lei, XIAO Yiting, GUAN Yuanpeng. Power Device Image Recognition Based on Improved Attention Mechanism[J]. Proceedings of the CSEE, 2025, 45(3): 870-883. DOI: 10.13334/j.0258-8013.pcsee.231362
Citation: WU Yingxin, LIU Lei, XIAO Yiting, GUAN Yuanpeng. Power Device Image Recognition Based on Improved Attention Mechanism[J]. Proceedings of the CSEE, 2025, 45(3): 870-883. DOI: 10.13334/j.0258-8013.pcsee.231362

基于改进注意力机制网络的电力设备图像识别

Power Device Image Recognition Based on Improved Attention Mechanism

  • 摘要: 在复杂工作环境下,电力设备的有效图像识别和状态分析可提升其运行维护能力,降低潜在停电风险。然而,传统的电力设备图像识别方法存在目标与背景特征信息难以分辨和特征信息提取能力不足等问题。该文提出一种改进注意力机制网络的电力设备图像检测识别方法。该方法提出面向电力设备目标特征信息的预测策略:引入深度值的变化过程学习机制,提取图像深层语义信息;通过叠加卷积核和剔除前置网络池化层,以改进全局结构信息学习网络模块,获得富含细节特征且关联图像特征的电力设备图像先验信息,进一步采用基于长短期记忆网络(long short-term memory,LSTM)门控机制在不同层级图像特征信息上预测其电力设备目标特征信息,构建LSTM门控机制的注意力机制网络。此外,该方法提出深浅层特征信息交互策略:采用矩阵外积方式整合浅层特征信息与深层特征信息,使模型充分学习电力设备的多层次特征信息。相比于传统的电力设备图像识别方法,所提的改进方案可加强目标的细节特征识别和提取,精确区分背景与目标模糊界限信息,提升深浅层特征信息的交互能力,有效提高在复杂背景环境下目标识别的准确率。实验结果表明,针对绝缘子、变压器、断路器、输电线电杆以及输电线铁塔5种电力设备图像数据集,该文所提出的模型识别准确率达92%,比CvT模型高1.6%。

     

    Abstract: The effective image recognition and status analysis of power equipment can improve its operation and reduce the risk of potential power outages in working environment. However, traditional power equipment image recognition methods have problems which are difficult in distinguishing target and background. Apart from this, it is insufficient to extract feature information. An improved attention mechanism network based on image detection and recognition method for power equipment is proposed in this paper. This method proposes a prediction strategy for power equipment feature information: it obtains the prior information of power equipment images through a global structure information learning network module that is rich in detailed features and associated image feature information. At the same time, the learning mechanism of the change process of the depth value is introduced to extract the deep semantic information. Further, the gating mechanism of LSTM (long short-term memory) network is used to predict the feature information of power equipment on different levels of image feature information, constructing an attention mechanism network based on LSTM gating mechanism. In addition, this method proposes a deep and shallow feature information interaction strategy. The matrix outer product method is used to integrate the shallow feature information and deep feature information of the image, so that the model can fully learn the multi-level feature information of the power equipment image. Compared with the traditional power equipment image recognition method, the proposed improvement scheme can strengthen the recognition, accurately distinguish fuzzy boundary information about background and target, enhance the interaction ability of deep and shallow feature information, and improve the accuracy of target recognition in complex background environment effectively. The experimental result shows that the recognition accuracy of the model proposed in this paper reaches 92%, which is higher than that of CvT model.

     

/

返回文章
返回