[GAN]Generative Image Inpainting with Contextual Attention论文阅读

其他 2021-12-12 22:47:13 阅读次数: 0

1 论文简介

Generative Image Inpainting with Contextual Attention是UIUC的Jiahui Yu在Thomas S. Huang的指导下，联合Adobe Research完成的一项工作，发表于CVPR 2018。
作者在Iizuka等人提出的Globally and locally consistent image completion工作的基础上进行改进（Improved Generative Inpainting Network），并提出Contextual Attention，以利用传统方法中要求图像之中的patch之间存在相似性的思路，弥补卷积神经网络不能有效的从图像较远的区域提取信息的不足。

1.1 网络架构

在这里插入图片描述

生成器：包括两个阶段。第一个阶段是一个粗糙网络（Coarse Network），利用空间衰减重构损失训练。第二个阶段是一个细化网络（Refinement Network），利用重构损失和WGAN损失训练。
判别器：包括两个部分。第一个部分负责局部判别（Local Critic），第二个部分负责全局判别（Global Critic），都是基于 WGAN-GP损失（带梯度惩罚的WGAN损失）。

1.2 上下文注意力机制

在这里插入图片描述

思路为从已知图像中借鉴特征信息，以此生成缺失的patch。首先在背景区域提取3x3的patch，并作为卷积核。为了匹配前景（待修复区域）的patch，使用标准化内积（即余弦相似度）来测量，然后用softmax来为每个背景中的patch计算权值，最后选取出一个最好的patch，并反卷积出前景区域。对于反卷积过程中的重叠区域取平均值。
通俗一点讲，假设有待修补区域x，通过卷积的方法，从整个图中匹配几个像x的区域a，b，c，d，然后从上述区域中利用softmax找出最像x的区域，最终通过反卷积的方式，来生成x区域的图像。

1.3 损失函数

WGAN损失：

其中 $P_r$ 是真实的分布， $P_g$ 是生成数据的分布，这损失在GAN损失的基础上去掉了log。
梯度惩罚项：

只对位于空洞区域的像素点进行梯度惩罚，利用一个mask实现：
重构损失：
空间衰减重构损失：改变重构损失的mask权重，每一点的权值为 $\gamma^{l}$ ， $\gamma = 0.99$ ， $l$ 表示该点到已知的像素点最近的距离。

1.4 实验

优化器为Adam，学习率为0.0001，batch-size为48，单卡1080Ti训练，在Place2数据集上进心训练，输入图片size为256256，patch大小为128128。

定性对比

从左往右为原图，输入图片，baseline输出，model输出。
定量对比

2 项目简介

2.1项目背景

项目为2021飞桨启航菁英计划实习项目。项目基于 Paddle 2.1.2 进行开发并实现论文精度，十分感谢百度提供的实习机会和GPU资源。

2.2项目结果

TO DO

2.3项目使用

见百度aistudio

3 关于论文

References

猜你喜欢

转载自blog.csdn.net/weixin_44145782/article/details/121061835

[GAN]Generative Image Inpainting with Contextual Attention论文阅读

【论文译文】Generative Image Inpainting with Contextual Attention

【图像修复】AOT-GAN《Aggregated Contextual Transformations for High-Resolution Image Inpainting》

《Generative Image Inpainting with Adversarial Edge Learning》论文阅读之edge-connect

Coherent Semantic Attention for Image Inpainting

Semantic Image Inpainting with Deep Generative Models

Natural Image Matting via Guided Contextual Attention

[GAN]Free-Form Image Inpainting with Gated Convolution论文翻译（回归帖~\(￣︶￣*\))）

Image inpainting

论文阅读之《Image Inpainting for Irregular Holes Using Partial Convolutions》

Text to image论文精读 GAN-CLS和GAN-INT：Generative Adversarial Text to Image Synthesis

2019 - ICCV - 图像修复 Image Inpainting 论文导读《StructureFlow: Image Inpainting via Structure-aware ~~》

《GCAMatting：Natural Image Matting via Guided Contextual Attention》

GAN注意力机制研究——SPA-GAN: Spatial Attention GAN for Image-to-Image Translation 论文阅读笔记

【论文译文】Image Inpainting for Irregular Holes Using Partial Convolutions

论文|Free-Form Image Inpainting with Gated Convolution

论文解读：Inpaint Anything: Segment Anything Meets Image Inpainting

图像修复 Image Inpainting

论文阅读笔记《The Contextual Loss for Image Transformationwith Non-Aligned Data》（ECCV2018 oral）

论文阅读笔记《The Contextual Loss for Image Transformationwith Non-Aligned Data》（ECCV2018 oral）

【图像修复】论文阅读笔记 ----- 《Image inpainting based on deep learning: A review》

Text to image论文精读 DM-GAN: Dynamic Memory Generative Adversarial Networks for t2i

GAN + Video Inpainting的一些思考和相关论文

（RN）Region Normalization for Image Inpainting

Residual Attention Network for Image Classification 论文阅读

基于深度学习的Image Inpainting (图像修复)论文推荐(持续更新)

深度学习的Image Inpainting (图像修复)论文推荐(持续更新) （转）

论文精度 —— 2017 CVPR《High-Resolution Image Inpainting using Multi-Scale Neural Patch Synthesis》

论文阅读——Deformable Medical Image Registration Using Generative Adversarial Networks

Generative Diffusion Prior for Unified Image Restoration and Enhancement 论文阅读笔记

今日推荐

TIOBE 5 月榜单：Fortran “复活”进入 Top 10

GCC 14.1 发布

面壁智能发布 Eurux-8x22B 开源大模型 —— 堪称「理科状元」

开源日报 | 谷歌扶持鸿蒙上位；开源Rabbit R1；Docker加持的安卓手机；微软的焦虑和野心；海尔电器把开放平台关了

中国码农的“35岁魔咒”

蘭雅 CorelDRAW 插件 2024.5.1 国际劳动节版，免费下载

Arc Browser for Windows 1.0 正式 GA

90后程序员开发视频搬运软件、不到一年获利超 700 万，结局很刑！

周排行

基本数据类型封装类比较 Java源码解读(一) 8种基本类型对应的封装类型

JS实现无缝滚动上

深入解析HashMap原理（基于JDK1.8）

mysql的连接池

关于.htc

linux下的ubuntu12.04图形界面

【数论】好推不好记的扩展欧几里德

设备树详解

cscope + tags 简单设置

xml学习

每日归档

更多

2024-05-09(35)

2024-05-08(42)

2024-05-07(14)

2024-05-06(40)

2024-05-05(0)

2024-05-04(7)

2024-05-03(19)

2024-05-02(0)

2024-05-01(4)

2024-04-30(1)