Paper Reading - CNN+CNN: Convolutional Decoders for Image Captioning - 代码天地

Paper Reading - CNN+CNN: Convolutional Decoders for Image Captioning

企业开发 2018-08-27 15:58:14 阅读次数: 0

Link of the Paper: https://arxiv.org/abs/1805.09019

Innovations:

The authors propose a CNN + CNN framework for image captioning. There are four modules in the framework: vision module ( VGG-16 ), which is adopted to "watch" images; language module, which is to model sentences; attention module, which connects the vision module with the language module; prediction module, which takes the visual features from the attention module and concepts from the language module as input and predicts the next word.

　　　　　　　　

General Points:

RNNs or LSTMs cannot be calculated in parallel and ignore the underlying hierarchical structure of a sentence.
Directly feeding the output of the CNN into the RNN treats objects in an image the same and ignores the salient objects when generating one word.
In both m-RNN and NIC, an image is represented by a single vector, which ignores different areas and objects in the image. A spatial attention mechanism is introduced into image captioning model in Show, attend and tell: Neural image caption generation with visual attention, which allows the model to pay attention to different areas at each time step.

猜你喜欢

转载自www.cnblogs.com/zlian2016/p/9542632.html

Paper Reading - CNN+CNN: Convolutional Decoders for Image Captioning

Paper Reading - Convolutional Image Captioning ( CVPR 2018 )

Paper Reading - Learning to Evaluate Image Captioning ( CVPR 2018 ) ★

Paper Reading - 基础系列 - Tricks for Image Classification with CNN

Paper Reading - Show and Tell: Lessons learned from the 2015 MSCOCO Image Captioning Challenge

[Paper Reading] Image Captioning using Deep Neural Architectures (arXiv: 1801.05568v1)

[Paper Reading] 发表在 arXiv 上的 Image Captioning 方向的论文 -- 持续更新

《SCA-CNN：Spatial and Channel-wise Attention in Convolutional Networks for Image Captioning》论文笔记

《SCA-CNN: Spatial and Channel-wise Attention in Convolutional Networks for Image Captioning》论文笔记

[ Continuously Update ] The Paper List of Image Captioning

Feb20-paper reading-Convolutional Recurrent Neural Networks for Dynamic MR Image Reconstruction

图像理解（Image Captioning）（1）CNN部分

Paper Reading - Convolutional Sequence to Sequence Learning ( 2017 )

Paper Reading -- CBAM: Convolutional Block Attention Module

《Conditional Image Generation with PixelCNN Decoders》之Pixcel CNN---gated Pixcel CNN 阅读笔记

paper reading：Part-based Graph Convolutional Network for Action Recognition

[Paper Reading]FCOS: Fully Convolutional One-Stage Object Detection

【Paper Reading】R-CNN（V5）论文解读

Paper Reading - Deep Captioning with Multimodal Recurrent Neural Networks ( m-RNN ) ( ICLR 2015 ) ★

Paper Reading - Learning a Recurrent Visual Representation for Image Caption Generation

Paper Reading - Show and Tell: A Neural Image Caption Generator ( CVPR 2015 )

paper reading- Feb 25 about optimization problem used in image

Feb.27~image super-resolution reconstruction， paper reading

[Paper Reading] Show and Tell: A Neural Image Caption Generator

Paper Reading -- 《Spectral-Spatial Attention Networks for Hyperspectral Image Classification》

Guided Diffusion/Diffusion Models Beat GANs on Image Synthesis (Paper reading)

Generative Diffusion Prior for Unified Image Restoration and Enhancement (Paper reading)

Cascaded Diffusion Models for High Fidelity Image Generation (Paper reading)

DriftRec: Adapting diffusion models to blind image restoration tasks (Paper reading)

Paper | Fast image processing with fully-convolutional networks

今日推荐

《美国对全球网络空间安全与发展的威胁和破坏》报告发布

火速冲上 GitHub 热榜 —— 开源编程语言、框架哪有这么可爱？

北京人形机器人创新中心发布全球首个纯电驱拟人奔跑的全尺寸人形机器人“天工”

LFOSSA 源来如此公开课 | 掌握云原生未来：CNCF 认证全面攻略与备考秘籍

国产云输入法——仅华为无云端数据上传安全问题

周排行

Python环境安装与基础语法（1）——计算机基础知识

IMU预积分

ADAS中的LDW、FCW、BSD、LCA、ACC、AEB、APA、DMS代表的含义

B站笔试两道题

skyeye arm 硬件虚拟机环境的搭建

Web前端静态页面示例

数组-合并排序数组 II-简单

springcloud之版本问题启动报错

面向对象-------------匿名对象(六)

输入URL到页面呈现中间发生了什么？

每日归档

更多

2024-04-30(1)

2024-04-29(40)

2024-04-28(0)

2024-04-27(56)

2024-04-26(39)

2024-04-25(22)

2024-04-24(36)

2024-04-23(26)

2024-04-22(39)

2024-04-21(0)