Learning Deep Features for Discriminative Localization论文笔记 - 代码天地

Learning Deep Features for Discriminative Localization论文笔记

业界资讯 2023-07-29 02:49:56 阅读次数: 0

论文地址：https://arxiv.org/abs/1512.04150
github地址：http://cnnlocalization.csail.mit.edu

本文提出了一种研究网络可解释性的方式，用global average pooling的方式得到网络的注意力机制。

Motivation

过去global average pooling (GAP)通常被用于正则化训练卷积神经网络，而作者发现这一操作实际上使得卷积神经网络具有了定位能力。并且由于一般网络最后的全连接层会丢失空间信息，从而使得输出不具有可解释性。因此，为了使得最后的结果具有空间可解释性，作者提出使用GAP取代最后的全连接层的方式，这样得到的特征图经过简单的处理后可以反映出网络的空间识别能力，也即定位能力。

Methods

作者提出类激活层（Class Activation Mapping, CAM），其生成方式如下图所示。网络训练过程调整为在网络的最后一层卷积层的后面，加入GAP，去掉原本的全连接层，得到每个类别的score，再通过softmax层得到概率值。在网络收敛后，通过每一层卷积层的输出乘上该层对应分类的权重，然后对结果进行加权，经过上采样就可以得到热力图，也就得到了CAM，在与原始图像叠加后就可以得到下图等式右边的效果。
CAM
在文中，作者提出最后一层的卷积层输出尺寸一般选择在14*14左右，将之后的卷积层去掉，直接连到GAP层中。

Experiment

模型：AlexNet，VggNet，GoogLeNet，GoogLeNet-GMP
数据集：ILSVRC

分类性能比较：
定位能力比较
GAP与GMP比较（平均池化和最大值池化）：
另外，实验中，作者提出使用阈值的方式通过CAM得到目标的预测框。结果在弱监督的方式下比利用backpropogation的方法得到的结果整体更好，但在全监督模式下则相差比较大。

Thoughts

这篇文章想法挺不错的，但我觉得不太能用在目前自己的这个框架下，因为CAM的局限比较大，对框架的结构有要求，对于目标检测网络有些局限性。不过我觉得另一篇Grad-CAM相对来说可能能适应我目前的框架，它是基于梯度的一种热力图，适应性比CAM要好很多，这篇论文留待下篇博客再整理。

猜你喜欢

转载自blog.csdn.net/qq_43812519/article/details/105777157

Learning Deep Features for Discriminative Localization论文笔记

Learning Deep Features for Discriminative Localization

【Discriminative Localization】Learning Deep Features for Discriminative Localization 论文解析（转）

《Learning Deep Features for Discriminative Localization》文章解读

Learning Deep Features for Discriminative Localization -CAM方法帮助若监督学习研究实现物体定位论文阅读笔记

A Discriminative Feature Learning Approach for Deep Face Recognition 论文笔记

《A Discriminative Feature Learning Approach for Deep Face Recognition》论文笔记

论文笔记-------Learning Discriminative Features with Multiple Granularity for Person Re-Identification

论文笔记（2）--（Re-ID） Learning Discriminative Features with Multiple Granularities for Person Re-Id

论文笔记 — L2-Net: Deep Learning of Discriminative Patch Descriptor in Euclidean Space

【论文笔记】DEEP JOINT DISCRIMINATIVE LEARNING FOR VEHICLE REID（车辆重识别的深度联合判别学习）

计算机视觉论文阅读三：Learning Discriminative Features via Label Consistent Neural Network

论文之Learning Discriminative Features with Multiple Granularities for Person Re-Identification

[CVPR2018笔记] Discriminative Learning of Latent Features for Zero-Shot Recognition

Deep learning 论文笔记

A Discriminative Feature Learning Approach for Deep Face Recognition

【论文阅读】Deep Clustering for Unsupervised Learning of Visual Features

论文阅读笔记：Center Loss: A Discriminative Feature Learning Approach for Deep Face Recognition

论文阅读笔记（三十二）【ACM Multimedia 2018】：Learning Discriminative Features with Multiple Granularities for Person Re-Identification

Learning Discriminative Features with Multiple Granularities for Person Re-Identification

[CVPR 2018]Discriminative Learning of Latent Features for Zero-Shot Recognition

How transferable are features in deep neural networks? 论文笔记

论文笔记(1)：Deep Learning.

Deep Learning: A Critical Appraisal 论文笔记

Learning Transferable Features with Deep Adaptation Networks

【人脸识别】A Discriminative Feature Learning Approach for Deep Face Recognition

Center Loss - A Discriminative Feature Learning Approach for Deep Face Recognition

论文笔记：Learning Region Features for Object Detection

论文笔记（一）——Learning Rich Features for Image Manipulation Detection

无监督系列论文：深度聚类（一）：Deep Clustering for Unsupervised Learning of Visual Features

今日推荐

LFOSSA 源来如此公开课 | 掌握云原生未来：CNCF 认证全面攻略与备考秘籍

国产云输入法——仅华为无云端数据上传安全问题

开源日报 | 工业开源项目OGG 1.0；姐姐，你要和我一起配置火狐吗；苹果AI遥遥落后？Fedora 40

开放签电子签章：停止新增，优化体验，前进更进（五一假期前工作）

开源日报 | 中学生开源前端动画引擎；全球首个Llama3 8B中文版开源模型；联想电脑恐出局；Linus讽刺AI炒作

“百模大战”必有一战 | 2024中国“百模大战”竞争格局分析

周排行

Family Tree 题解

BZOJ 1093 最大半连通子图 SCC + DP

幂等处理

Spring----学习（2）----XML 配置Bean 自动装配

SQL Server 远程更新目标表数据

HIbernate3.6 环境搭建

特殊符号正则表达式

【Linux】第一章进程的理解

843. n-皇后问题（dfs+输出各种情况）

空间数据库2

每日归档

更多

2024-04-26(39)

2024-04-25(22)

2024-04-24(36)

2024-04-23(26)

2024-04-22(39)

2024-04-21(0)

2024-04-20(6)

2024-04-19(5)

2024-04-18(0)

2024-04-17(5)