Learning Deep Features for Discriminative Localization

×

CAM论文剖析(Learning Deep Features for Discriminative Localization)

96
刘开心_8a6c
2018.05.03 18:48 字数 894 阅读 621 评论 0

英文原文请点这里
译文请点这里

Agenda

文章内容剖析

实验代码在这里.

文章剖析

摘要

本文主要工作
  • 1、阐述GAP如何使CNN具有卓越定位能力
  • 2、证明了所提出网络能定位出“区别性”区域,且分类效果也不差

1 介绍

扩写摘要,展示结果图
  • 引出GAP的功能远不止正则化,更重要的是定位能力
  • 指出本文所提网络保留了定位能力且分类能力也不差
1.1 相关研究(研究现状)
  • 已有研究证明:CNN分类能和定位能力都很好
  • 介绍已有研究的两个主要工作&提出缺陷
    • 弱监督物体定位
      • 缺陷:作者并没有评估定位能力;不是端到端的训练,需要额外工作,可扩展性差
    • 可视化CNN内在表现
      • 缺陷:只分析了卷积层,忽略了全连接层,因此是不全面的;只展示了深层特征中什么信息被保留,并没有凸显出这些信息的相对重要性

2 类激活图(CAM)

  • 作用:展示被划分为某个特定类的“区别性”区域

  • 网络结构

  • 详细计算推理过程
    对于一个图,用fk(x, y)代表最后一个卷积层的单元k在空间坐标(x,y)中的激活值。对于每个单元k,通过GAP后的结果Fk为∑x,yfk(x, y)。则,对于每个类c,输入softmax的Sc为∑kwc,kFk,wc,k代表单元k对应的类c的权重。实际上,wc,k就是Fk对类c的重要性。最后类c的sotfmax输出Pc为exp(Sc)/∑cexp(Sc)。这里我们忽略偏差项:我们明确地把softmax的偏差项设置为0因为它几乎对分类表现没有影响。
    把Fk=∑x,yfk(x, y)带入Sc,得

    公式1


    我们用Mc定义类别c的CAM,则空间每个元素为

    公式2


    则Sc = ∑x,yMc(x,y),所以Mc(x,y)直接表明了把空间网格(x,y)激活对图片划分为类别c的的重要性。

    简陋的图

  • 对比使用GAP 和 GMP

3 弱监督物体定位

  • 评估了CAM的定位能力
    • 3.1 不同CNN网络的设置
    • 3.2 定位的同时不影响分类性能

4 通用的定位特征

  • 我们网络即使在陌生数据上(不训练)也能定位
    • 4.1 鸟类细粒度识别与定位能力评估
    • 4.2 证明GoogLeNet-GAP能用于图片通用模式识别

5 可视化特定类单元

6 结论

  • 提出CAM技术
  • 证明了CAM在定位方面的泛化能力

写作套路

套路
摘要
    开头
        In this work, we ...
    对比
        While ... , we ...
    介绍自己工作
        Despite ...  we ...
        We demonstrate that our ...
    最先进的技术
         state-of-the-art 
研究现状
    引用前人研究
        Recent work by xxx has shown that ...
        xxx et al propose a technique/method for ...
        These approaches ...
    介绍自己研究
        In our experiments, we found that ...
        our approach is ...
    xxx 远不止能做()工作,更重要的是还能()
        the advantages of xxx extend beyond () - In fact,  () can
        we can generalize this ability beyond just ()
    引出文章创新
        However, their ...
        Both of these works only analyze ... , ignoring ...
        While these works can ..., they only show ...
        Unlike xxx and xxx, our approach can ...
        Overall, our approach provides ...
正文
    开头
        In this section, we describe the procedure for ...
    技术可以用于()
        This technique can be applied to do sth.
    研究成功
        This suggests that our approach works as expected.
通用
    如图所示
         xxx is illustrated in Fig. 2. 
        As shown in Figure 1
    如表所示
         Tbl.1summarizes ...
    本质上讲
        In fact
        Essentially, xxx indicates ...
    形容表现的词
        remarkable ability
        impressive performance
        generic
    连接词
        further
        In general
        specificly

文章参考论文

参考论文
      </div>
    </div>
</div>

<!-- 如果是付费文章,未购买,则显示购买按钮 -->

<!-- 连载目录项 -->

<!-- 如果是付费文章 -->
  <!-- 如果是付费连载,已购买,且作者允许赞赏,则显示付费信息和赞赏 -->
    <div id="free-reward-panel" class="support-author"><p>小礼物走一走,来简书关注我</p> <div class="btn btn-pay">赞赏支持</div> <div class="supporter"><ul class="support-list"></ul> <!----></div> <!----> <!----></div>

  <div class="show-foot">
    <a class="notebook" href="/nb/25082518">
      <i class="iconfont ic-search-notebook"></i>
      <span>英文论文剖析</span>



  <!-- 文章底部作者信息 -->
    <div class="follow-detail">
      <div class="info">
        <a class="avatar" href="/u/f47c25ce7f26">
          <img src="//upload.jianshu.io/users/upload_avatars/4964755/756b7309-b494-41b1-9ee9-45603ab215fa.jpg?imageMogr2/auto-orient/strip|imageView2/1/w/96/h/96" alt="96">


刘开心_8a6c

写了 103206 字,被 279 人关注,获得了 393 个喜欢


迷恋高质量代码的程序员,爱读国学,热爱分享,喜欢积极向上乐观高效的生活。愿与更多志同道合者成为好友,分享交流。

<div class="meta-bottom">
  <div data-v-6ddd02c6="" class="like"><div data-v-6ddd02c6="" class="btn like-group"><div data-v-6ddd02c6="" class="btn-like"><a data-v-6ddd02c6="">喜欢</a></div> <div data-v-6ddd02c6="" class="modal-wrap"><a data-v-6ddd02c6="">3</a></div></div> <!----></div>
  <div class="share-group">
    <a class="share-circle" data-action="weixin-share" data-toggle="tooltip" data-original-title="分享到微信">
      <i class="iconfont ic-wechat"></i>
    </a>
    <a class="share-circle" data-action="weibo-share" data-toggle="tooltip" href="javascript:void((function(s,d,e,r,l,p,t,z,c){var%20f='http://v.t.sina.com.cn/share/share.php?appkey=1881139527',u=z||d.location,p=['&amp;url=',e(u),'&amp;title=',e(t||d.title),'&amp;source=',e(r),'&amp;sourceUrl=',e(l),'&amp;content=',c||'gb2312','&amp;pic=',e(p||'')].join('');function%20a(){if(!window.open([f,p].join(''),'mb',['toolbar=0,status=0,resizable=1,width=440,height=430,left=',(s.width-440)/2,',top=',(s.height-430)/2].join('')))u.href=[f,p].join('');};if(/Firefox/.test(navigator.userAgent))setTimeout(a,0);else%20a();})(screen,document,encodeURIComponent,'','','', '推荐 刘开心_8a6c 的文章《CAM论文剖析(Learning Deep Features for Discriminative Localization)》( 分享自 @简书 )','https://www.jianshu.com/p/20f7c8a2bd5a?utm_campaign=maleskine&amp;utm_content=note&amp;utm_medium=reader_share&amp;utm_source=weibo','页面编码gb2312|utf-8默认gb2312'));" data-original-title="分享到微博">
      <i class="iconfont ic-weibo"></i>
    </a>
    <a class="share-circle" data-toggle="tooltip" id="longshare" target="_blank" data-original-title="" title="">
        <div class="qrcode" id="qrcode">
         <img src="//cdn2.jianshu.io/assets/web/download-index-side-qrcode-cb13fc9106a478795f8d10f9f632fccf.png" alt="Download index side qrcode">
         <p>下载app生成长微博图片</p>
         </div>
      <i class="iconfont ic-picture"></i>
    </a>
    <a class="share-circle more-share" tabindex="0" data-toggle="popover" data-placement="top" data-html="true" data-trigger="focus" href="javascript:void(0);" data-content="
      <ul class=&quot;share-list&quot;>
        <li><a href=&quot;javascript:void(function(){var d=document,e=encodeURIComponent,r='http://sns.qzone.qq.com/cgi-bin/qzshare/cgi_qzshare_onekey?url='+e('https://www.jianshu.com/p/20f7c8a2bd5a?utm_campaign=maleskine&amp;utm_content=note&amp;utm_medium=reader_share&amp;utm_source=qzone')+'&amp;title='+e('推荐 刘开心_8a6c 的文章《CAM论文剖析(Learning Deep Features for Discriminative Localization)》'),x=function(){if(!window.open(r,'qzone','toolbar=0,resizable=1,scrollbars=yes,status=1,width=600,height=600'))location.href=r};if(/Firefox/.test(navigator.userAgent)){setTimeout(x,0)}else{x()}})();&quot;><i class=&quot;social-icon-sprite social-icon-zone&quot;></i><span>分享到QQ空间</span></a></li>
        <li><a href=&quot;javascript:void(function(){var d=document,e=encodeURIComponent,r='https://twitter.com/share?url='+e('https://www.jianshu.com/p/20f7c8a2bd5a?utm_campaign=maleskine&amp;utm_content=note&amp;utm_medium=reader_share&amp;utm_source=twitter')+'&amp;text='+e('推荐 刘开心_8a6c 的文章《CAM论文剖析(Learning Deep Features for Discriminative Localization)》( 分享自 @jianshucom )')+'&amp;related='+e('jianshucom'),x=function(){if(!window.open(r,'twitter','toolbar=0,resizable=1,scrollbars=yes,status=1,width=600,height=600'))location.href=r};if(/Firefox/.test(navigator.userAgent)){setTimeout(x,0)}else{x()}})();&quot;><i class=&quot;social-icon-sprite social-icon-twitter&quot;></i><span>分享到Twitter</span></a></li>
        <li><a href=&quot;javascript:void(function(){var d=document,e=encodeURIComponent,r='https://www.facebook.com/dialog/share?app_id=483126645039390&amp;display=popup&amp;href=https://www.jianshu.com/p/20f7c8a2bd5a?utm_campaign=maleskine&amp;utm_content=note&amp;utm_medium=reader_share&amp;utm_source=facebook',x=function(){if(!window.open(r,'facebook','toolbar=0,resizable=1,scrollbars=yes,status=1,width=450,height=330'))location.href=r};if(/Firefox/.test(navigator.userAgent)){setTimeout(x,0)}else{x()}})();&quot;><i class=&quot;social-icon-sprite social-icon-facebook&quot;></i><span>分享到Facebook</span></a></li>
        <li><a href=&quot;javascript:void(function(){var d=document,e=encodeURIComponent,r='https://plus.google.com/share?url='+e('https://www.jianshu.com/p/20f7c8a2bd5a?utm_campaign=maleskine&amp;utm_content=note&amp;utm_medium=reader_share&amp;utm_source=google_plus'),x=function(){if(!window.open(r,'google_plus','toolbar=0,resizable=1,scrollbars=yes,status=1,width=450,height=330'))location.href=r};if(/Firefox/.test(navigator.userAgent)){setTimeout(x,0)}else{x()}})();&quot;><i class=&quot;social-icon-sprite social-icon-google&quot;></i><span>分享到Google+</span></a></li>
        <li><a href=&quot;javascript:void(function(){var d=document,e=encodeURIComponent,s1=window.getSelection,s2=d.getSelection,s3=d.selection,s=s1?s1():s2?s2():s3?s3.createRange().text:'',r='http://www.douban.com/recommend/?url='+e('https://www.jianshu.com/p/20f7c8a2bd5a?utm_campaign=maleskine&amp;utm_content=note&amp;utm_medium=reader_share&amp;utm_source=douban')+'&amp;title='+e('CAM论文剖析(Learning Deep Features for Discriminative Localization)')+'&amp;sel='+e(s)+'&amp;v=1',x=function(){if(!window.open(r,'douban','toolbar=0,resizable=1,scrollbars=yes,status=1,width=450,height=330'))location.href=r+'&amp;r=1'};if(/Firefox/.test(navigator.userAgent)){setTimeout(x,0)}else{x()}})()&quot;><i class=&quot;social-icon-sprite social-icon-douban&quot;></i><span>分享到豆瓣</span></a></li>
      </ul>
    " data-original-title="" title="">更多分享</a>
  </div>
</div>
  <a id="web-note-ad-1" target="_blank" href="/apps/redirect?utm_source=note-bottom-click"><img src="//cdn2.jianshu.io/assets/web/web-note-ad-1-c2e1746859dbf03abe49248893c9bea4.png" alt="Web note ad 1"></a>
<div><div id="comment-list" class="comment-list"><div><form class="new-comment"><a class="avatar"><img src="//cdn2.jianshu.io/assets/default_avatar/avatar_default-78d4d1f68984cd6d4379508dd94b4210.png"></a> <div class="sign-container"><a href="/sign_in?utm_source=desktop&amp;utm_medium=not-signed-in-comment-form" class="btn btn-sign">登录</a> <span>后发表评论</span></div></form> <!----></div> <!----> <div class="comments-placeholder" style="display: none;"><div class="author"><div class="avatar"></div> <div class="info"><div class="name"></div> <div class="meta"></div></div></div> <div class="text"></div> <div class="text animation-delay"></div> <div class="tool-group"><i class="iconfont ic-zan-active"></i><div class="zan"></div> <i class="iconfont ic-list-comments"></i><div class="zan"></div></div></div> <div id="normal-comment-list" class="normal-comment-list"><div><!----> <div><div class="top-title"><span>评论</span> <a class="close-btn" style="display: none;">关闭评论</a></div> <div class="no-comment"></div> <div class="text">
        智慧如你,不想<a href="/sign_in?utm_source=desktop&amp;utm_medium=not-signed-in-nocomments-text">发表一点想法</a>咩~
      </div></div> <!----> <div class="comments-placeholder" style="display: none;"><div class="author"><div class="avatar"></div> <div class="info"><div class="name"></div> <div class="meta"></div></div></div> <div class="text"></div> <div class="text animation-delay"></div> <div class="tool-group"><i class="iconfont ic-zan-active"></i><div class="zan"></div> <i class="iconfont ic-list-comments"></i><div class="zan"></div></div></div> </div></div> <!----> <div><!----></div></div></div>

猜你喜欢

转载自blog.csdn.net/Jason_mmt/article/details/82588119