基于车辆外观特征和帧间光流的目标跟踪算法

doi:10.3969/j.issn.0255-8297.2024.01.009

应用科学学报 ›› 2024, Vol. 42 ›› Issue (1): 103-118.doi: 10.3969/j.issn.0255-8297.2024.01.009

基于车辆外观特征和帧间光流的目标跟踪算法

李绍骞¹, 程鑫¹, 周经美², 赵祥模¹

1. 长安大学信息工程学院, 陕西西安 710064;
2. 长安大学电子与控制工程学院, 陕西西安 710064

收稿日期:2023-06-29 出版日期:2024-01-30 发布日期:2024-02-02
通信作者: 程鑫,副教授,研究方向为深度学习与计算机视觉、人工智能与车联网技术。E-mail:xincheng@chd.edu.cn E-mail:xincheng@chd.edu.cn
基金资助:
国家重点研发计划（No. 2021YFB2501200）；国家自然科学基金（No. 52102452）；陕西省重点研发计划（No. 2023-YBGY-119）；陕西省自然科学基础研究计划面上项目（No. 2023-JC-YB-523）；陕西省创新能力支撑计划（No. 2022KJXX-02）；陕西省交通运输厅交通科研项目（No. 21-05X）；陕西省高校科协青年人才托举计划（No. 20210122）；中央高校基本科研业务费专项资金项目（No. 300102242203）资助

Object Tracking Algorithm Based on Vehicle Appearance Features and Inter-frame Optical Flow

LI Shaoqian¹, CHENG Xin¹, ZHOU Jingmei², ZHAO Xiangmo¹

1. College of Information Engineering, Chang'an University, Xi'an 710064, Shaanxi, China;
2. College of Electronic and Control Engineering, Chang'an University, Xi'an 710064, Shaanxi, China

Received:2023-06-29 Online:2024-01-30 Published:2024-02-02

摘要/Abstract

摘要： 在复杂道路场景下，车辆目标之间频繁遮挡、车辆目标之间相似的外观、目标整个运动过程中采用静态预设参数都会引起跟踪准确率下降等问题。该文提出了一种基于车辆外观特征和帧间光流的目标跟踪算法。首先，通过YOLOv5算法中的YOLOv5x网络模型获得车辆目标框的位置信息；其次，利用RAFT(recurrent all-pairs field transforms for opticalflow)算法计算当前帧和前一帧之间的光流，并根据得到的位置信息对光流图进行裁剪；最后，在卡尔曼滤波过程中利用帧间光流进行补偿得到更精确的运动状态信息，并利用车辆外观特征和交并比特征完成轨迹匹配。实验结果表明，基于车辆外观特征和帧间光流的目标跟踪算法在MOT16数据集上表现良好，相较于跟踪算法DeepSORT，成功跟踪帧数占比提高了1.6%，跟踪准确度提升了1.3%，跟踪精度提升了0.6%，改进的车辆外观特征提取模型准确率在训练集和验证集上分别提高了1.7%、6.3%。因此，基于高精度的车辆外观特征模型结合关联帧间光流的运动状态信息能够有效实现交通场景下的车辆目标跟踪。

关键词: 目标跟踪, 车辆外观特征, 帧间光流, 卡尔曼滤波

Abstract: In complex road scenes, frequent occlusions and similar appearances between vehicle targets, coupled with the use of static preset parameters used throughout the entire movement of the targets collectively contribute to a decline in tracking accuracy. This paper proposes an object tracking algorithm based on vehicle appearance features and inter-frame optical flow. Firstly, the position information of the vehicle target frame is obtained through the YOLOv5x network model. Secondly, the optical flow between the current frame and the previous frame is calculated using the RAFT (recurrent all-pairs field transforms for optical flow) algorithm, and the optical flow map is clipped according to the obtained position information. Finally, in the process of Kalman filtering, inter-frame optical flow is used to compensate for more accurate motion state information, while vehicle appearance features and intersection over union (IOU) features are used to complete trajectory matching. Experimental results show that the tracking algorithm correlating inter-frame optical flow performs well on the MOT16 data set. Compared with simple online and realtime tracking with a deep association metric (DeepSORT), mostly tracked trajectories (MT) has increased by 1.6%, multiple object tracking accuracy (MOTA) has increased by 1.3%, and multiple object tracking precision (MOTP) has increased by 0.6%. The accuracy of the improved vehicle appearance feature extraction model has been improved by 1.7% and 6.3% on the training and verification sets, respectively. Consequently, leveraging the high-precision vehicle appearance feature model and motion state information from the associated inter-frame optical flow enables effective vehicle target tracking in traffic scenes.

Key words: object tracking, vehicle exterior features, inter-frame optical flow, Kalman filter

中图分类号:

TP391.4

李绍骞, 程鑫, 周经美, 赵祥模. 基于车辆外观特征和帧间光流的目标跟踪算法[J]. 应用科学学报, 2024, 42(1): 103-118.

LI Shaoqian, CHENG Xin, ZHOU Jingmei, ZHAO Xiangmo. Object Tracking Algorithm Based on Vehicle Appearance Features and Inter-frame Optical Flow[J]. Journal of Applied Sciences, 2024, 42(1): 103-118.

参考文献

[1] Chen X, Yan B, Zhu J W, et al. Transformer tracking [C]//IEEE Conference on Computer Vision and Pattern Recognition, 2021: 8122-8131.
[2] Zhang Y F, Wang C Y, Wang X G, et al. FairMOT: on the fairness of detection and reidentification in multiple object tracking [J]. International Journal of Computer Vision, 2021, 129(11): 3069-3087.
[3] Meinhardt T, Kirillov A, Leal-Taixe L, et al. Trackformer: multi-object tracking with transformers [C]//IEEE Conference on Computer Vision and Pattern Recognition, 2022: 8834-8844.
[4] Zeng F G, Dong B, Zhang Y, et al. MOTR: end-to-end multiple-object tracking with transformer [C]//European Conference on Computer Vision, 2022: 659-675.
[5] Cai J, Xu M, Li W, et al. MeMOT: multi-object tracking with memory [C]//IEEE Conference on Computer Vision and Pattern Recognition, 2022: 8080-8090.
[6] Jin J T, Li X W, Li X L, et al. Online multi-object tracking with Siamese network and optical flow [C]//IEEE 5th International Conference on Image, Vision and Computing (ICIVC), 2020: 193-198.
[7] Zhang Y F, Sun P Z, Jiang Y, et al. ByteTrack: multi-object tracking by associating every detection box [C]//European Conference on Computer Vision, 2022: 1-21.
[8] Liang C, Zhang Z, Lu Y, et al. Rethinking the competition between detection and ReID in multiobject tracking [J].IEEE Transactions on Image Processing: a Publication of the IEEE Signal Processing Society, 2022, 31: 3182-3196.
[9] Pang J M, Qiu L L, Li X, et al. Quasi-dense similarity learning for multiple object tracking [C]//IEEE Conference on Computer Vision and Pattern Recognition, 2021: 164-173.
[10] Wang Z D, Zheng L, Liu Y X, et al. Towards real-time multi-object tracking [C]//European Conference on Computer Vision, 2020: 107-122.
[11] Paredes-Vallés F, Scheper K Y W, de Roon G C H E. Unsupervised learning of a hierarchical spiking neural network for optical flow estimation: from events to global motion perception [J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2020, 42(8): 2051-2064.
[12] Chaney K, Panagopoulou A, Lee C, et al. Self-supervised optical flow with spiking neural networks and event based sensors [C]//IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2021: 5892-5899.
[13] Sun D Q, Yang X D, Liu M Y, et al. PWC-net: CNNs for optical flow using pyramid, warping, and cost volume [C]//IEEE Conference on Computer Vision and Pattern Recognition, 2018: 8934-8943
[14] Teed Z, Deng J. RAFT: recurrent all-pairs field transforms for optical flow [C]//European Conference on Computer Vision (ECCV 2020), 2020: 402-419.
[15] Zhou X Y, Yin T W, Koltun V, et al. Global tracking transformers [C]//IEEE Conference on Computer Vision and Pattern Recognition, 2022: 8761-8770.
[16] Bewley A, Ge Z Y, Ott L, et al. Simple online and realtime tracking [C]//2016 IEEE International Conference on Image Processing (ICIP), 2016: 3464-3468.
[17] Wojke N, Bewley A, Paulus D. Simple online and realtime tracking with a deep association metric [C]//IEEE International Conference on Image Processing (ICIP), 2018: 3645-3649.
[18] Milan A, Leal-Taixe L, Reid I, et al. MOT16: a benchmark for multi-object tracking [DB/OL]. 2016[2023-0629]. https://arxiv.org/abs/1603.00831.pdf

基于车辆外观特征和帧间光流的目标跟踪算法

Object Tracking Algorithm Based on Vehicle Appearance Features and Inter-frame Optical Flow

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

本文评价

[1]	王嫣, 常晓鹏, 张建平. 基于EKF算法的UWB和ZigBee测量技术的混合运动目标定位[J]. 应用科学学报, 2019, 37(6): 815-824.
[2]	张磊, 陆宇平, 殷明. 多传感器融合四旋翼协同控制算法及其实现[J]. 应用科学学报, 2016, 34(2): 190-202.
[3]	王伟, 张金艺, 张洪辉, 蔡春艳, 李建宇. 异质双9轴MEMS惯性传感器数据互补-加权迭代融合算法[J]. 应用科学学报, 2015, 33(5): 491-501.
[4]	胡昭华, 徐玉伟, 赵孝磊, 何军, 周游. 基于支持向量机的多特征选择目标跟踪[J]. 应用科学学报, 2015, 33(5): 502-517.
[5]	周逸秋1，陈兵1，钱红燕1，吕宗磊2. 一种高精度低负载的可用带宽测量机制[J]. 应用科学学报, 2015, 33(2): 155-166.
[6]	刘立昕，卞红雨. 用于水下目标跟踪的多特征融合PSOPF 算法[J]. 应用科学学报, 2013, 31(6): 564-568.
[7]	陈志敏，薄煜明，吴盘龙，宋公飞，段文勇. 一种新型自适应粒子群优化粒子滤波算法及应用[J]. 应用科学学报, 2013, 31(3): 285-293.
[8]	郑天宇，顾晓东. 四元数和脉冲耦合神经网络应用于足球检测[J]. 应用科学学报, 2013, 31(2): 183-189.
[9]	陈志敏，薄煜明，吴盘龙，刘正凡. 拟蒙特卡罗粒子滤波改进算法及其在雷达目标跟踪中的应用[J]. 应用科学学报, 2012, 30(6): 607-612.
[10]	李静1，王惠南2，刘海颖3. 对偶四元数在航天器相对导航中的应用[J]. 应用科学学报, 2012, 30(3): 311-316.
[11]	王龙1，董新民1，贾海燕2. 机器视觉辅助的无人机空中加油相对导航[J]. 应用科学学报, 2012, 30(2): 209-214.
[12]	李雪松1，李颖晖1，李霞1，王志科2. 基于鲁棒时变卡尔曼滤波估计的无人机视觉编队[J]. 应用科学学报, 2011, 29(5): 545-550.
[13]	王丹，熊智，陈方，刘建业. 考虑SAR图像导航量测特性的SAR/INS组合导航非线性滤波[J]. 应用科学学报, 2010, 28(4): 381-386.
[14]	郑智明刘建业钱伟行. 卫星导航系统与气压高度计的信息处理与融合[J]. 应用科学学报, 2010, 28(3): 277-282.
[15]	赵宾刘建业曾庆化钱伟行. 基于比力积分匹配的惯导系统传递对准[J]. 应用科学学报, 2009, 27(4): 425-429.