Flow-Guided Feature Aggregation for Video Object Detection - 代码天地

Flow-Guided Feature Aggregation for Video Object Detection

其他 2018-07-06 14:50:39 阅读次数: 0

论文：https://arxiv.org/pdf/1703.10025.pdf
代码：https://github.com/sunshinezhihuo/Flow-Guided-Feature-Aggregation
推荐：
https://blog.csdn.net/elaine_bao/article/details/78449724
https://blog.csdn.net/zhangjunhit/article/details/76684849

Abstract
把目标检测器从图像扩展到视频是由挑战性的：motion blur, video defocus, rare poses（这些在still images中很少见到）。现有的工作尝试着在box level上利用时间信息，但是这些方法不能端到端的训练。本文为视频目标检测提供了一种以流为导向的特征聚合框架，可进行端到端的学习。It improves the per-frame features by aggregation of nearby features along the motion paths, and thus improves the video recognition accuracy.该方法对快速运动的物体效果显著。
本文方法名字叫做FGFA。

直接将用于静止图片的检测方法用于视频目标检测中是由挑战性的。
关于同一个物体，视频拥有着丰富的信息。在现存的一些视频目标检测方法中，时间信息被以一种简单的方式进行挖掘。这些方法首先在单个帧中应用对象检测器，然后在一个专用的后处理步骤中在时间维度上聚集检测到的边界框。这一步依赖于现成的运动估计(如光流)，以及手工制定的边界框关联规则(如对象跟踪)。但是这类方法不能提高检测质量，The performance improvement is from heuristic post-processing instead of principled learning。==》 box level methods

本文通过时间聚合的方式改善每帧的特征学习。注意，由于视频运动，同一对象实例的特性通常不会跨帧进行空间对齐。朴素的特征聚合或许会降低性能，所以在学习的过程中进行运动建模是很关键的。
提出了FGFA：流指导下的特征聚合。
这里写图片描述
因为本文方法旨在提供特征质量，所以可以作为补充，用于现存的box-level framework中。

猜你喜欢

转载自blog.csdn.net/sunshinezhihuo/article/details/80522093

Flow-Guided Feature Aggregation for Video Object Detection

视频物体检测(VID) FGFA：Flow-Guided Feature Aggregation for Video Object Detection

ICCV 2017 《Flow-Guided Feature Aggregation for Video Object Detection》论文笔记

FGFA(Flow-Guided Feature Aggregation for Video Object Detection)论文详读

【论文笔记】视频物体检测(VID)系列 FGFA：Flow-Guided Feature Aggregation for Video Object Detection

20.Flow-Guided Feature Aggregation for Video Object Detection

Temporal Context Enhanced Feature Aggregation for Video Object Detection

Flow Guided Recurrent Neural Encoder for Video Salient Object Detection

18.Flow Guided Recurrent Neural Encoder for Video Salient Object Detection

【目标检测论文阅读笔记】Small Object Detection in Remote Sensing Images with Residual Feature Aggregation-Based

Motion Guided Attention for Video Salient Object Detection论文详读

Feature Pyramid Networks for Object Detection

2019 - CVPR - 视频修复 Video Inpainting论文导读 -《Deep Flow-Guided Video Inpainting》

【转】Looking Fast and Slow: Memory-Guided Mobile Video Object Detection

【论文阅读笔记】Looking Fast and Slow: Memory-Guided Mobile Video Object Detection

Feature Pyramid Networks for Object Detection翻译

Feature Pyramid Networks for Object Detection 总结

Parallel Feature Pyramid Network for Object Detection

Object detection networks on convolutional feature maps

Feature Pyramid Networks for Object Detection（接）

Feature Pyramid Networks for Object Detection（一）

FPN：Feature Pyramid Networks for Object Detection

(FPN)Feature Pyramid Networks for Object Detection

解读 Centralized Feature Pyramid for Object Detection

《DFC-Net：Deep Flow-Guided Video Inpainting》论文笔记

Object Detection in Video with Spatiotemporal Sampling Networks

Towards High Performance Video Object Detection

Progressive Sparse Local Attention for Video object detection

Towards High Performance Video Object Detection for Mobiles

Relation Distillation Networks for Video Object Detection

今日推荐

TIOBE 5 月榜单：Fortran “复活”进入 Top 10

GCC 14.1 发布

面壁智能发布 Eurux-8x22B 开源大模型 —— 堪称「理科状元」

开源日报 | 谷歌扶持鸿蒙上位；开源Rabbit R1；Docker加持的安卓手机；微软的焦虑和野心；海尔电器把开放平台关了

中国码农的“35岁魔咒”

蘭雅 CorelDRAW 插件 2024.5.1 国际劳动节版，免费下载

Arc Browser for Windows 1.0 正式 GA

90后程序员开发视频搬运软件、不到一年获利超 700 万，结局很刑！

周排行

基本数据类型封装类比较 Java源码解读(一) 8种基本类型对应的封装类型

JS实现无缝滚动上

深入解析HashMap原理（基于JDK1.8）

mysql的连接池

关于.htc

linux下的ubuntu12.04图形界面

【数论】好推不好记的扩展欧几里德

设备树详解

cscope + tags 简单设置

xml学习

每日归档

更多

2024-05-09(35)

2024-05-08(42)

2024-05-07(14)

2024-05-06(40)

2024-05-05(0)

2024-05-04(7)

2024-05-03(19)

2024-05-02(0)

2024-05-01(4)

2024-04-30(1)