赵振兵, 吕雪纯, 苗思雨, 赵文清, 翟永杰, 张珂. 基于共有特征评分的解耦知识蒸馏输电线路金具检测方法[J]. 高电压技术, 2024, 50(12): 5229-5237. DOI: 10.13336/j.1003-6520.hve.20231541
引用本文: 赵振兵, 吕雪纯, 苗思雨, 赵文清, 翟永杰, 张珂. 基于共有特征评分的解耦知识蒸馏输电线路金具检测方法[J]. 高电压技术, 2024, 50(12): 5229-5237. DOI: 10.13336/j.1003-6520.hve.20231541
ZHAO Zhenbing, LYU Xuechun, MIAO Siyu, ZHAO Wenqing, ZHAI Yongjie, ZHANG Ke. Inspection Method for Decoupled Knowledge Distillation Transmission Line Fittings Based on Shared Feature Scoring[J]. High Voltage Engineering, 2024, 50(12): 5229-5237. DOI: 10.13336/j.1003-6520.hve.20231541
Citation: ZHAO Zhenbing, LYU Xuechun, MIAO Siyu, ZHAO Wenqing, ZHAI Yongjie, ZHANG Ke. Inspection Method for Decoupled Knowledge Distillation Transmission Line Fittings Based on Shared Feature Scoring[J]. High Voltage Engineering, 2024, 50(12): 5229-5237. DOI: 10.13336/j.1003-6520.hve.20231541

基于共有特征评分的解耦知识蒸馏输电线路金具检测方法

Inspection Method for Decoupled Knowledge Distillation Transmission Line Fittings Based on Shared Feature Scoring

  • 摘要: 输电线路金具航拍图像具有目标占比小和背景过于复杂的特点,为了将对金具目标检测性能优越的教师网络性能迁移到学生网络,提出了一种基于共有特征评分的解耦知识蒸馏输电线路金具检测方法。首先,根据Ground-truth框将图像特征解耦为前景和背景区域,在前景区域对教师模型特征金字塔中每一层特征信息分类评分,聚合所有类别分类分数作为共有特征蒸馏掩码。其次,为了保证整幅图像的完整性,采用GcBlock模块捕获金具目标与背景、其他金具之间的关系,以此将教师模型生成的图像特征知识完整地迁移到学生网络。最后,通过自建金具图像检测数据集验证该文提出方法的有效性,实验结果表明:将该文方法应用于双阶段、单阶段模型均可大幅提升参数量小的网络检测性能,Faster R-CNN、RetinaNet的学生网络交并比阈值为0.5的平均精度(mAP_50)分别提升了25.9%、31.4%,有的甚至超过教师网络检测精度;针对防震锤、调整板、重锤目标,Cascade R-CNN模型中学生网络的平均精度显著提升。该文研究方法实现了对金具目标的高效检测,达到了检测性能与资源消耗的平衡。

     

    Abstract: Aerial images of transmission line fittings have the characteristics of small target proportion and excessively complex background. In order to transfer the performance of a teacher network which has excellent performance in detecting fitting targets to a student network, a decoupled knowledge distillation method based on shared feature scoring for transmission line fitting detection is proposed. Firstly, the image features are decoupled into foreground and background regions according to the Ground-truth box. In the foreground region, the feature information of each layer in the teacher model's feature pyramid is classified and scored, and the classification scores of all categories are aggregated as the common feature distillation mask. Secondly, in order not to maintain the integrity of the whole image, the GcBlock module is used to capture the relationship among the object of the fittings, the background and other fittings, so as to transfer the image feature knowledge generated by the teacher model to the student network completely. Finally, the validity of the proposed method is verified by a self-built fitting image detection data set. The experimental results show that, by applying this method to both two-stage and single-stage models, the detection performance of the network with a small number of participants can be greatly improved. The mean average precision of student network at 0.5 intersection over union of Faster R-CNN and RetinaNet can be improved by 25.9% and 31.4%, respectively, surpassing even the detection accuracy of the teacher network. The cascade R-CNN's student network has significantly improved targets for the shock hammer, adjustment board, and weight. The method in this paper can be adopted to realize the high efficiency detection of the fitting target, and to achieve the balance between detection performance and resource consumption.

     

/

返回文章
返回