应用科学学报 ›› 2021, Vol. 39 ›› Issue (4): 605-614.doi: 10.3969/j.issn.0255-8297.2021.04.008

• CCF NCCA 2020专辑 • 上一篇    

基于强化学习的多模态场景人体危险行为识别方法

张晓龙1, 王庆伟2, 李尚滨3   

  1. 1. 东北林业大学 体育部, 黑龙江 哈尔滨 150040;
    2. 哈尔滨华德学院 体育教研部, 黑龙江 哈尔滨 150025;
    3. 哈尔滨工程大学 体育部, 黑龙江 哈尔滨 150001
  • 收稿日期:2020-08-30 发布日期:2021-08-04
  • 通信作者: 李尚滨,教授,研究方向为人工智能、体育工程学。E-mail:sports@hrbeu.edu.cn E-mail:sports@hrbeu.edu.cn
  • 基金资助:
    国家自然科学基金(No.61163025)资助

Recognition Method of Human Dangerous Behavior in Multimodal Scenes Using Reinforcement Learning

ZHANG Xiaolong1, WANG Qingwei2, LI Shangbin3   

  1. 1. P. E. Department, Northeast Forestry University, Harbin 150040, Heilongjiang, China;
    2. Physical Education Department, Harbin Huade University, Harbin 150025, Heilongjiang, China;
    3. Physical Education Department, Harbin Engineering University, Harbin 150001, Heilongjiang, China
  • Received:2020-08-30 Published:2021-08-04

摘要: 在多模态场景下,常规人体危险行为识别方法对人体危险行为的识别精度较低,于是提出了基于强化学习的多模态场景人体危险行为识别方法。首先根据强化学习的特征提取算法获取多模态场景人体危险行为特征集,其次基于强化学习数据决策提取多模态场景人体危险行为,构建人体危险行为模糊识别模型。最后将上述人体危险行为特征子集代入模型,计算不同感官下危险行为的隶属度,实现多模态场景人体危险行为的识别。实验结果表明:该方法对危险行为的识别准确率较高,其识别延迟时间低于300 ms。

关键词: 强化学习, 多模态, 场景, 行为识别

Abstract: In multimodal scenes, conventional human dangerous behavior recognition methods generally perform low recognition accuracy. Therefore, this paper proposes a human dangerous behavior recognition method based on reinforcement learning. Firstly, a feature extraction algorithm of reinforcement learning is used to obtain feature subsets of human dangerous behavior in multimodal scenes. Secondly, human dangerous behaviors in multimodal scenes are extracted by reinforcement learning data decision-making, and a fuzzy recognition model of human dangerous behavior is constructed. Finally, by bringing the obtained feature subsets of human dangerous behavior into the model and calculating the membership degree of dangerous behavior under different senses, the recognition of human dangerous behavior in multimodal scenes can be realized. Experimental results show that the method in this paper has a high recognition accuracy and a recognition delay of less than 300 ms.

Key words: reinforcement learning, multimodality, scene, behavior recognition

中图分类号: