周震震, 宋云海, 何宇浩, 王黎伟, 黄和燕, 何珏, 朱志航, 闫云凤. 基于分组查询注意力的可扩展电力人员行为分类方法[J]. 中国电力, 2023, 56(11): 77-85. DOI: 10.11930/j.issn.1004-9649.202305046
引用本文: 周震震, 宋云海, 何宇浩, 王黎伟, 黄和燕, 何珏, 朱志航, 闫云凤. 基于分组查询注意力的可扩展电力人员行为分类方法[J]. 中国电力, 2023, 56(11): 77-85. DOI: 10.11930/j.issn.1004-9649.202305046
ZHOU Zhenzhen, SONG Yunhai, HE Yuhao, WANG Liwei, HUANG Heyan, HE Jue, ZHU Zhihang, YAN Yunfeng. Extensible Classification Method for Power Personnel Behavior Based on Pose Estimation[J]. Electric Power, 2023, 56(11): 77-85. DOI: 10.11930/j.issn.1004-9649.202305046
Citation: ZHOU Zhenzhen, SONG Yunhai, HE Yuhao, WANG Liwei, HUANG Heyan, HE Jue, ZHU Zhihang, YAN Yunfeng. Extensible Classification Method for Power Personnel Behavior Based on Pose Estimation[J]. Electric Power, 2023, 56(11): 77-85. DOI: 10.11930/j.issn.1004-9649.202305046

基于分组查询注意力的可扩展电力人员行为分类方法

Extensible Classification Method for Power Personnel Behavior Based on Pose Estimation

  • 摘要: 电力人员行为识别是电力系统安全运维的重要环节,现有的人员行为识别算法主要采用支持向量机和多层感知机进行行为分类,存在识别精度低、未考虑人体骨架之间交互关系、迁移性、通用性差等问题。针对上述问题,提出一种基于自注意力与交叉注意力机制的行为分类解码器,充分考虑了人体骨架之间的关联。其分类精度相比传统分类方法提升10%~20%,较深度学习多层感知机(multilayer perceptron,MLP)分类方法提升2%以上。该方法运用编码器-解码器架构的二阶段方法进行行为识别,使得解码器可以适用于任意姿态估计,网络后端具有很强的可扩展性。此外,采用分组解码的方式克服了注意力机制带来的二次方复杂度,使得该解码器可以扩展到更多行为类别,具有更好的普适性。该行为识别算法能够在基于变电站工作场景下的人员图像数据集验证中达到优异的识别效果,综合识别率达91.1%,验证了所提电力人员行为分类方法的有效性和适用性。

     

    Abstract: Power personnel behavior recognition is a critical component for the safe operation and maintenance of the power system. However, current personnel behavior recognition algorithms, which primarily rely on support vector machines and multi-layer perceptrons for behavior classification, have a number of shortcomings, including low recognition accuracy, insufficient consideration of the interactions between human skeletons, and poor mobility and universality. To address these challenges, we proposes a novel behavior classification decoder based on a self-attention and cross-attention mechanism, which fully considers the associations between human skeletons. Compared to the traditional classification methods, the proposed approach improves the classification accuracy by approximately 10%~20%, and outperforms the deep learning MLP classification methods by more than 2%. To implement behavior recognition, we use a two-stage encoder-decoder architecture method, which has good extensibility while making the decoder suitable for the back end of any pose estimation network. Additionally, we use a grouped decoding method to overcome the quadratic complexity induced by the attention mechanism, which enables the decoder to extend to include more behavior categories, thus being more universal. The proposed behavior recognition algorithm achieves the optimal recognition effect in the personnel image data set based on the substation working scenarios. The comprehensive recognition rate reaches 91.1%, which verifies the efficacy and practicality of the proposed power personnel behavior classification method.

     

/

返回文章
返回