Towards High Performance Video Object Detection for Mobiles - 代码天地

Towards High Performance Video Object Detection for Mobiles

其他 2019-04-08 11:38:04 阅读次数: 0

motivation：

　　近来以桌面版GPU为计算平台的视频目标检测取得了较多成果，如DFF,FGFA等，但这些算法没法用在移动端，移动端的计算资源有限不足以运行这些算法。

本文提出了一个用于移动端的轻量化视频目标检测网络。在稀疏关键帧上运行轻量化的图片目标检测器，使用了一个非常小的轻光流网络来提取光流场。

同时文章还提出了一个GRU模型来在关键帧上进行高效的特征聚合。在非关键帧上进行稀疏特征传播。整个网络可以被端到端地训练，在imagenet VID上达到了60.2的mAP，

在华为Mate8上达到了25.6的fps。

介绍：文章提出了一个轻量化的网络结果用于视频目标检测，该算法基于两个原则，一个是在非关键帧上进行特征传播，另一个是在关键帧之间进行特征聚合，同Towards High Performance Video Object Detection。

但是考虑到速度、模型大小、准确率，算法中用到的网络结构都需要重新设计。对所有帧，利用一个非常小的可以在移动端运行的Light Flow网络来估计光流。对稀疏关键帧，作者提出了一个flow-guided gated recurrent unit来进行特征聚合。

此外，文章还使用了一个轻型图片目标检测器来在关键帧上计算特征。

Light Flow：

基于FlowNet重新设计的轻型光流估计网络。损失了15%的精度换来65倍的提速。具体结构设计可参见文章3.1.

Flow-guided GRU based feature aggregation：

特征聚合无疑对提高精度是非常重要的，FGFA中的融合方法是线性的没有记忆能力，递归特征融合虽然有了进步，但是难以训练去建模更长的帧间信息，部分原因是递归网络中的梯度消失和梯度爆炸。GRU在建模较长时间信息方面优于LSTM和RNN，因为在网络状态更新中考虑了非线性性。受这一点启发，本文在特征聚合中引入了卷积GRU用作特征集成，而不是仅仅进行加权平均。

在这里星号表示3x3卷积，圈表示点乘.

Lightweight key-frame object detector:

检测器的backbone使用了MobileNet，任务网络采用RPN和Light Head RCNN。

猜你喜欢

转载自www.cnblogs.com/hf19950918/p/10669462.html

Towards High Performance Video Object Detection for Mobiles

【深度学习 video detect】Towards High Performance Video Object Detection for Mobiles

Towards High Performance Video Object Detection

CVPR 2018 《Towards High Performance Video Object Detection》论文笔记

《D2Det：Towards High Quality Object Detection and Instance Segmentation》论文笔记

Towards Open World Object Detection

SparseBEV：High-Performance Sparse 3D Object Detection from Multi-Camera Videos

Towards Universal Object Detection by Domain Attention

论文阅读——Towards Adversarially Robust Object Detection

论文精读《BEVDet: High-Performance Multi-Camera 3D Object Detection in Bird-Eye-View》

论文速读 -- BEVDet: High-Performance Multi-Camera 3D Object Detection in Bird-Eye-View

Fast and accurate object detection in high resolution 4K and 8K video using GPUs 论文笔记

HyperNet: Towards Accurate Region Proposal Generation and Joint Object Detection

《Quantization Mimic: Towards Very Tiny CNN for Object Detection》的阅读总结

Towards Adversarially Robust Object Detection 论文笔记

ThunderNet: Towards Real-time Generic Object Detection on Mobile Devices

Object Detection in Video with Spatiotemporal Sampling Networks

Progressive Sparse Local Attention for Video object detection

Relation Distillation Networks for Video Object Detection

Fast Object Detection in Compressed Video论文详读

Cascade R-CNN: Delving into High Quality Object Detection

视频物体检测(VID) Impression Network for Video Object Detection

Flow-Guided Feature Aggregation for Video Object Detection

《Video Saliency Detection Using Object Proposals》阅读笔记

Flow Guided Recurrent Neural Encoder for Video Salient Object Detection

20.Flow-Guided Feature Aggregation for Video Object Detection

Object detection from video tubelets with CNN翻译（未完成）

Fully Motion-Aware Network for Video Object Detection

CATDET: Cascaded Tracked Detector for Efficient Object Detection from Video

Video Object Detection with an Aligned Spatial-Temporal Memory

今日推荐

TIOBE 5 月榜单：Fortran “复活”进入 Top 10

GCC 14.1 发布

面壁智能发布 Eurux-8x22B 开源大模型 —— 堪称「理科状元」

开源日报 | 谷歌扶持鸿蒙上位；开源Rabbit R1；Docker加持的安卓手机；微软的焦虑和野心；海尔电器把开放平台关了

中国码农的“35岁魔咒”

蘭雅 CorelDRAW 插件 2024.5.1 国际劳动节版，免费下载

Arc Browser for Windows 1.0 正式 GA

90后程序员开发视频搬运软件、不到一年获利超 700 万，结局很刑！

周排行

基本数据类型封装类比较 Java源码解读(一) 8种基本类型对应的封装类型

JS实现无缝滚动上

深入解析HashMap原理（基于JDK1.8）

mysql的连接池

关于.htc

linux下的ubuntu12.04图形界面

【数论】好推不好记的扩展欧几里德

设备树详解

cscope + tags 简单设置

xml学习

每日归档

更多

2024-05-09(35)

2024-05-08(42)

2024-05-07(14)

2024-05-06(40)

2024-05-05(0)

2024-05-04(7)

2024-05-03(19)

2024-05-02(0)

2024-05-01(4)

2024-04-30(1)