视频物体检测(VID) Impression Network for Video Object Detection - 代码天地

视频物体检测(VID) Impression Network for Video Object Detection

其他 2018-06-08 16:17:25 阅读次数: 0

SenseTime出品

来源：https://arxiv.org/pdf/1712.05896.pdf

基于印象机制的高效多帧特征融合，解决defocus and motion

blur等问题（即视频中某帧的质量低的问题），同时提高速度和性能。
这里写图片描述

这里写图片描述

类似TSN，每个segment选一个key frame（注意，TSN做视频分类是在cnn最后才融合不同的segments）。特征融合前需要用Optical flow（FlowNet-S）来对齐。
目前使用的是fixed segment length，联想Deep Alternative Neural Network使用的自适应视频分段方法。

Detect to Track and Track to Detect
这里写图片描述
思考：track是不是可以代替印象网络中的光流来自动做对齐？

Mobile Video Object Detection with Temporally-Aware Feature Maps
这里写图片描述
哈哈，看来帧间特征的关联就是光流，TSN，印象机制，RNN，3d conv这几种常见办法了。
注意这里用的是卷积LSTM！且改进成了高效的Bottleneck-LSTM：

Spatial-Temporal Memory Networks for Video Object Detection
也是为了通过简单的帧来加强质量差的帧：
这里写图片描述

记忆机制（是不是和印象机制差不多？）：
这里写图片描述

STMM是ConvGRU的改进，以更好地利用ImageNet预训练权重。
使用更高效的MatchTrans module来对齐帧间的特征（而不是光流。可以看出最近的文章思路都很像==），大概是基于近邻的思路。
动作分类中记忆机制会不会比TSN好，是否需要做对齐？

Towards High Performance Video Object Detection
这里写图片描述

路子和印象机制那篇很像，也是稀疏的特征传递，用flow对齐。好像方法更精致一些（虽然论文好像上传的很仓促，是因为最近太多类似工作上传了吗？）？比如对key frame进行了自适应？、

参考：

【1】https://blog.csdn.net/wayne2019/article/category/7077174

猜你喜欢

转载自blog.csdn.net/u012426298/article/details/80487948

视频物体检测(VID) Impression Network for Video Object Detection

视频物体检测(VID) Object Detection from Video Tubelets with Convolutional Neural Networks

视频物体检测(VID) FGFA：Flow-Guided Feature Aggregation for Video Object Detection

【论文笔记】视频物体检测(VID)系列 FGFA：Flow-Guided Feature Aggregation for Video Object Detection

视频物体检测(VID) T-CNN: Tubelets with Convolutional Neural Networks for Object Detection from Videos

ImageAI (三) 使用Python快速简单实现视频中物体检测 Video Object Detection and Tracking

Fully Motion-Aware Network for Video Object Detection

物体检测之RefineDet:Single-Shot Refinement Neural Network for Object Detection

Object Detection物体检测（图像、视频、摄像头）

视频物体检测(VID) Deep Feature Flow for Video Recognition

视频目标检测(video object detection)简单综述

视频物体检测(VID) MR-FLOW & FlowNet 2.0 视频物体检测(VID) Deep Feature Flow for Video Recognition

Patchwork: A Patch-wise Attention Network for Efficient Object Detection and Segmentation in Video Streams

FMAN(Fully Motion-Aware Network for Video Object Detection)论文详读

DetNet: A Backbone network for Object Detection

2018.12.03——目标检测或物体检测（画方框）object detection

[深度学习]Object detection物体检测之SSD(8)

[深度学习]Object detection物体检测之FPN(11)

[深度学习]Object detection物体检测之SPPNet(3)

[深度学习]Object detection物体检测之DSSD(10)

[深度学习]Object detection物体检测之Retinanet(12)

[深度学习]Object detection物体检测之概述

【论文解读】伪装物体检测 Camouflaged Object Detection

视频显著性检测(Video Salient Object Detection)部分论文汇总

Towards High Performance Video Object Detection

Object Detection in Video with Spatiotemporal Sampling Networks

Progressive Sparse Local Attention for Video object detection

Towards High Performance Video Object Detection for Mobiles

Relation Distillation Networks for Video Object Detection

Fast Object Detection in Compressed Video论文详读

今日推荐

面壁智能发布 Eurux-8x22B 开源大模型 —— 堪称「理科状元」

开源日报 | 谷歌扶持鸿蒙上位；开源Rabbit R1；Docker加持的安卓手机；微软的焦虑和野心；海尔电器把开放平台关了

中国码农的“35岁魔咒”

蘭雅 CorelDRAW 插件 2024.5.1 国际劳动节版，免费下载

Arc Browser for Windows 1.0 正式 GA

90后程序员开发视频搬运软件、不到一年获利超 700 万，结局很刑！

周排行

【转】spring中对控制反转和依赖注入的理解

tms webcore 安装和使用

java程序员进阶相关书籍

SpringMVC接受请求参数、

如何保存训练好的机器学习模型

MyEclipse、Eclipse设置项目JDK的三个地方

商超行业微信小程序开发定制一般多少钱（行业技术人员解读）

Markdown编辑器语言——30分钟入门到到精通

Linux系统下MongoDB的简单安装与基本操作

Power Strings

每日归档

更多

2024-05-07(14)

2024-05-06(40)

2024-05-05(0)

2024-05-04(7)

2024-05-03(19)

2024-05-02(0)

2024-05-01(4)

2024-04-30(1)

2024-04-29(40)

2024-04-28(0)