荆澜涛, 张野, 张彬, 姚晔, 许东, 王亮. 基于改进LightGBM混合集成模型的变压器故障识别方法[J]. 高电压技术, 2024, 50(12): 5289-5300. DOI: 10.13336/j.1003-6520.hve.20231800
引用本文: 荆澜涛, 张野, 张彬, 姚晔, 许东, 王亮. 基于改进LightGBM混合集成模型的变压器故障识别方法[J]. 高电压技术, 2024, 50(12): 5289-5300. DOI: 10.13336/j.1003-6520.hve.20231800
JING Lantao, ZHANG Ye, ZHANG Bin, YAO Ye, XU Dong, WANG Liang. Transformer Fault Identification Method Based on Improved LightGBM Hybrid Integration Model[J]. High Voltage Engineering, 2024, 50(12): 5289-5300. DOI: 10.13336/j.1003-6520.hve.20231800
Citation: JING Lantao, ZHANG Ye, ZHANG Bin, YAO Ye, XU Dong, WANG Liang. Transformer Fault Identification Method Based on Improved LightGBM Hybrid Integration Model[J]. High Voltage Engineering, 2024, 50(12): 5289-5300. DOI: 10.13336/j.1003-6520.hve.20231800

基于改进LightGBM混合集成模型的变压器故障识别方法

Transformer Fault Identification Method Based on Improved LightGBM Hybrid Integration Model

  • 摘要: 针对变压器故障识别方法在处理不均衡故障数据时存在较大偏差的问题,构建了一种基于改进轻量级梯度提升机的混合集成模型,用以变压器故障识别。首先,提出一种结合梯度调和损失函数和交叉熵损失函数的改进轻量级梯度提升机(gradient harmonizing mechanism loss and cross entropy loss improved light gradient boosting machine,GCLightGBM),提升模型对数据集中少数样本的关注度。然后,针对GCLightGBM中参数特异性取值影响模型识别能力的问题,提出一种基于GCLightGBM的混合集成模型,进一步提高其准确率的同时,确保模型对现实多变不均衡数据集依然保持良好的准确率。实验结果表明,GCLightGBM可有效解决少数类样本准确率低的问题,整体准确率高达0.911。且针对其他多变不均衡数据集,基于GCLightGBM混合集成模型故障识别方法平均准确率高达0.988。

     

    Abstract: There is a significant bias when imbalanced fault data are dealt with by transformer fault identification methods. To solve this problem, we have built a hybrid ensemble model based on an improved lightweight gradient boosting machine. First, we propose a gradient harmonizing mechanism loss and cross entropy loss improved light gradient boosting machine (GCLightGBM). This approach enhances the model's attention to minority samples in the dataset. Then, to address the issue of parameter specificity affecting the model's recognition capability in GCLightGBM, we propose a hybrid ensemble model based on GCLightGBM. This model further improves accuracy while ensuring good performance on real-world varied and imbalanced datasets. The experimental results show that GCLightGBM can effectively solve the problem of low accuracy of minority samples, and the overall accuracy is as high as 0.911. In addition, for other variable and unbalanced datasets, the fault identification method based on GCLightGBM hybrid ensemble model has an average accuracy of 0.988.

     

/

返回文章
返回