Adversarial Spatio-Temporal Learning for Video Deblurring - 代码天地

Adversarial Spatio-Temporal Learning for Video Deblurring

其他 2020-03-12 10:41:09 阅读次数: 0

1. 概述

作者将GAN（原始的）应用到视频去雾中，由于2D卷积只能提取输入的位置信息，针对视频连续帧具有时间信息的特点作者采用了3D卷积（部分卷积层中），取得了SOTA的效果。

2. 模型结构

生成模型如图1所示，

在这里插入图片描述

图1 生成模型。

在这里插入图片描述

表1 生成模型。它是由两个卷积层(L1和L2)， 14个残差块，两个卷积层(L31和L32)没有跳转连接，和三个额外的卷积层数(L33、L34和L35)组成。

作者通过对不同数量（3,5,7,9）的连续帧作为输入，对模型性能进行比较，选择了5作为模型输入连续帧的数量。因为作者采用的是3×3×3的卷积核，因此将三张连续帧进行concat，然后进行卷积。

在输入前，作者将RGB图像转换到YCbCr色彩空间，并将Y通道图像（光照强度通道，即灰度图）作为输入（“since the illumination is the most salient one”），得到模型输出后在利用原始CbCr信息将输出转换到RGB空间。

由于作者在卷积过程中保持输出特征图大小不变，因此没有采用上采样，下采样和反卷积。

将该模型称为视频去雾是因为它的输入是视频，输出也是视频。具体来说，模型对输入拦截五张连续帧作为输入，输出一张复原图，如此连续的输入输出便达到视频输入视频输出的效果。

模型整体结构如图2所示：

在这里插入图片描述

图2 GAN的整体网络结构

其中判别器借鉴了VGG网络。

3. 损失函数

$\mathcal{L}_{\text {content }}=\frac{1}{W H} \sum_{x=1}^{W} \sum_{y=1}^{H}\left(I_{x, y}^{\text {sharp }}-G\left(I^{\text {blurry }}\right)_{x, y}\right)^{2}\tag{1}$ $L_{content}$ 即为图2中的loss1。 $I^{sharp}$ 表示清晰图像，即GT。
$\mathcal{L}_{\text {adversarial}}=\log \left(1-D\left(G\left(I^{\text {blurry}}\right)\right)\right)\tag{2}$ $L_{adversarial}$ 即为图2中的loss2。

整体损失如式3表示：
$\mathcal{L}=\mathcal{L}_{\text {content}}+\alpha \cdot \mathcal{L}_{\text {adversarial}}\tag{3}$

4. 参考文献：

[1] Kaihao Zhang , Wenhan Luo , Yiran Zhong, Lin Ma , Member, IEEE, Wei Liu , and Hongdong Li, “Adversarial Spatio-Temporal Learning
for Video Deblurring.” IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 28, NO. 1, JANUARY 2019.

larkii

发布了108 篇原创文章 · 获赞 7 · 访问量 4383

私信关注

猜你喜欢

转载自blog.csdn.net/weixin_44795555/article/details/104731927

Adversarial Spatio-Temporal Learning for Video Deblurring

[TPAMI-2023] Enhanced Spatio-Temporal Interaction Learning for Video Deraining: Faster and Better

Learning Spatial and Spatio-Temporal Pixel

Learning hierarchical spatio-temporal features for action recognition with ISA

Progressive Fusion Video Super-Resolution Network via Exploiting Non-Local Spatio-Temporal Correlati

论文阅读：Beyond Short-Term Snippet: Video Relation Detection with Spatio-Temporal Global Contex

【论文阅读】Learning Spatio-Temporal Features with 3D Residual Networks for Action Recognition

STGCN:Spatio-Temporal Graph Convolutional Networks: A Deep Learning Framework for Traffic Forecastin

论文学习：Learning spatio-temporal features with 3D convolutional networks

《Learning Spatio-Temporal Representation with Pseudo-3D Residual Networks》算法详解

【IEEE TDKE 2020】Flow Prediction in Spatio-Temporal Networks Based on Multitask Deep Learning

ICCV2021跟踪算法Stark的配置（Learning Spatio-Temporal Transformer for Visual Tracking）

视频去模糊论文阅读-VDFlow: Joint Learning for Optical Flow and Video Deblurring

视频去模糊论文阅读-Cascaded Deep Video Deblurring Using Temporal Sharpness Prior

视频去模糊论文阅读-Online Video Deblurring via Dynamic Temporal Blending Network

CNN in MRF: Video Object Segmentataion via Inference in A CNN-Based Higher-Order Spatio-Temporal MRF

cvpr论文阅读之Deep Spatio-Temporal Random Fields for Efficient Video Segmentation（用于视频分割的深度时空随机场）

Real-Time Video Super-Resolution with Spatio-Temporal Networks and Motion Compensation论文解析（视频超分）

视频去噪EMVD：Efficient Multi-Stage Video Denoising with Recurrent Spatio-Temporal Fusion 全文翻译

视频超分算法VESPCN：Real-Time Video Super-Resolution with Spatio-Temporal Networks and Motion Compensation

【时空序列预测第八篇】Spatio-Temporal Graph Convolutional Networks: A Deep Learning Framework for Traffic Forec

论文阅读笔记（四）【TIP2017】：Video-Based Pedestrian Re-Identiﬁcation by Adaptive Spatio-Temporal Appearance Model

SAF-Net论文代码复现：台风强度变化预测《SAF-Net:A spatio-temporal deep learning method for typhoon intensity predic》

DeblurGAN：Blind Motion Deblurring Using Conditional Adversarial Networks

Robust Adversarial Reinforcement Learning

视频去模糊论文阅读-Learning Blind Motion Deblurring

Decoupled Learning for Conditional Adversarial Networks

adversarial Learning and attacks 学习笔记

论文笔记 DeblurGAN： Blind Motion Deblurring Using Conditional Adversarial Networks

[论文阅读] Generative Adversarial Networks for Video-to-Video Domain Adaptation

今日推荐

《美国对全球网络空间安全与发展的威胁和破坏》报告发布

火速冲上 GitHub 热榜 —— 开源编程语言、框架哪有这么可爱？

北京人形机器人创新中心发布全球首个纯电驱拟人奔跑的全尺寸人形机器人“天工”

LFOSSA 源来如此公开课 | 掌握云原生未来：CNCF 认证全面攻略与备考秘籍

周排行

循环神经网络（rnn）讲解

Tigao教程四：单独的关节运动

金蝶K3WISE15.0-注册套打教程

如何在Mac上配置Kubernetes

Android应用结束自身进程的方法

SpringMVC学习十三拦截器栈

中国驻洛杉矶总领馆举行新春招待会

HttpClient get post 发送

11 - three.js 笔记 - 绘制三维字体模型

Mysql递归获取某个父节点下面的所有子节点和子节点上的所有父节点

每日归档

更多

2024-05-01(4)

2024-04-30(1)

2024-04-29(40)

2024-04-28(0)

2024-04-27(56)

2024-04-26(39)

2024-04-25(22)

2024-04-24(36)

2024-04-23(26)

2024-04-22(39)