谢庆, 王春鑫, 张雨桐, 刘景立, 谢晨昊, 郑炎, 律方成, 谢军. 基于选择性标注与样本平衡的局部放电模式识别在线学习方法[J]. 中国电机工程学报, 2025, 45(2): 778-790. DOI: 10.13334/j.0258-8013.pcsee.232282
引用本文: 谢庆, 王春鑫, 张雨桐, 刘景立, 谢晨昊, 郑炎, 律方成, 谢军. 基于选择性标注与样本平衡的局部放电模式识别在线学习方法[J]. 中国电机工程学报, 2025, 45(2): 778-790. DOI: 10.13334/j.0258-8013.pcsee.232282
XIE Qing, WANG Chunxin, ZHANG Yutong, LIU Jingli, XIE Chenhao, ZHENG Yan, LYU Fangcheng, XIE Jun. An Online Learning Method for Pattern Recognition of Partial Discharge Based on Selective Labeling and Sample Balancing[J]. Proceedings of the CSEE, 2025, 45(2): 778-790. DOI: 10.13334/j.0258-8013.pcsee.232282
Citation: XIE Qing, WANG Chunxin, ZHANG Yutong, LIU Jingli, XIE Chenhao, ZHENG Yan, LYU Fangcheng, XIE Jun. An Online Learning Method for Pattern Recognition of Partial Discharge Based on Selective Labeling and Sample Balancing[J]. Proceedings of the CSEE, 2025, 45(2): 778-790. DOI: 10.13334/j.0258-8013.pcsee.232282

基于选择性标注与样本平衡的局部放电模式识别在线学习方法

An Online Learning Method for Pattern Recognition of Partial Discharge Based on Selective Labeling and Sample Balancing

  • 摘要: 训练与检测样本分布不一致是深度学习方法对现场新增局部放电(partial discharge,PD) (简称“局放”)识别准确率低的主要原因,为实现局放模式识别对现场数据分布变化的持续适配、降低样本标注工作量,该文提出一种局部放电模式识别在线学习方法。首先,以局放模式识别模型各层特征为信息来源,利用推理模型区分处于训练集分布内外的新增局放样本,并分别采用软标签及人工方式对两种样本进行在线标注;其次,为平衡训练集分布内外样本的数量、提升新增样本识别准确率,采用基于梯度惩罚的条件式wasserstein距离生成对抗网络(conditional wasserstein generative adversarial network with gradient penalty,CWGAN-GP)扩充两类局放样本,并以联合训练的方式更新局放模式识别模型。利用实验及现场采样得到的局放样本对所提方法进行验证,结果表明,所提方法标注工作量降低66.68%,在线学习结束后,新增样本集与训练集分布相同时识别准确率可提升4.61%,分布不同时识别准确率最低提升22.27%。

     

    Abstract: The inconsistency between the distributions of training and testing samples is a major factor contributing to the low accuracy of on-site partial discharge (PD) recognition in deep learning methods. In order to achieve continuous adaptation of PD pattern recognition to changes in data distribution and reduce the workload associated with sample labeling, an online learning method for PD pattern recognition is proposed. First, leveraging features from different layers of the PD pattern recognition model, an inference model is employed to distinguish newly added on-site PD samples within and outside the training set distribution. Soft labels and manual annotation are utilized for online labeling of these two types of samples. Then, to balance the quantity of samples within and outside the training set distribution and enhance the recognition accuracy of newly added samples, the paper employs a conditional wasserstein generative adversarial network with gradient penalty (CWGAN-GP) to augment both types of PD samples. This augmentation is integrated into the joint training of the PD pattern recognition model. Experimental validation using PD samples obtained from both laboratory experiments and on-site measurements demonstrates that the proposed method reduces the labeling workload by 66.68%. After online learning, the recognition accuracy for newly added samples within the training set distribution improves by 4.61%, and for samples outside the distribution, the minimum improvement in accuracy is 22.27%.

     

/

返回文章
返回