转载，2019年3月，目前最强性能的人脸检测算法（Wider Face Dataset）

 
                                          <p></p><section><section><section><section><p style="text-indent: 0em;line-height: 2em;"><img src="http://img.blog.itpub.net/blog/2019/02/19/b614769a0f5dc6f6.jpeg?x-oss-process=style/bb"></p></section></section></section></section><p></p><p style="white-space: normal;"><span style="font-size: 12px;color: rgb(136, 136, 136);">转载自：计算机视觉战队</span></p><p style="white-space: normal;"><span style="font-size: 12px;color: rgb(136, 136, 136);">未经允许不得二次转载</span></p><p style="margin-bottom: 15px;text-indent: 0em;line-height: 2em;"><span style="text-indent: 2em;font-size: 15px;">今天我们不说计算机视觉基础知识，接下来说说<span style="color: rgb(26, 26, 26);font-family: -apple-system, BlinkMacSystemFont, 'Helvetica Neue', 'PingFang SC', 'Microsoft YaHei', 'Source Han Sans SC', 'Noto Sans CJK SC', 'WenQuanYi Micro Hei', sans-serif;font-size: 15px;text-align: start;text-indent: 30px;background-color: rgb(255, 255, 255);">AAAI2019一篇比较新颖的Paper，其是<span style="color: rgb(26, 26, 26);font-family: -apple-system, BlinkMacSystemFont, 'Helvetica Neue', 'PingFang SC', 'Microsoft YaHei', 'Source Han Sans SC', 'Noto Sans CJK SC', 'WenQuanYi Micro Hei', sans-serif;font-size: 15px;text-align: start;text-indent: 30px;background-color: rgb(255, 255, 255);">中科院自动化所和京东AI研究院联合的结果，</span></span>在</span><span style="text-indent: 2em;font-size: 15px;color: rgb(0, 128, 255);"><strong>Wider Face</strong></span><span style="text-indent: 2em;font-size: 15px;">数据集中达到了较高的水准，比arxiv2019_</span><strong><span style="text-indent: 2em;font-size: 15px;color: rgb(0, 128, 255);">VIM-FD</span></strong><span style="text-indent: 2em;font-size: 15px;">的更好一些。今天要说的就是“</span><strong><span style="text-indent: 2em;font-size: 15px;color: rgb(0, 128, 255);">Improved SRN</span></strong><span style="text-indent: 2em;font-size: 15px;">”，现在开始一起学习吧！</span></p><section><section><section><section><section><span style="color: rgb(9, 122, 95);font-size: 16px;"><strong>Improved SRN</strong></span></section></section></section></section></section><section><p style="margin-top: 15px;text-indent: 0em;line-height: 2em;"><span style="font-size: 15px;text-indent: 2em;">人脸检测作为计算机视觉中的一个长期存在的问题，由于其实际应用，近几十年来一直受到人们的关注。</span><br></p><p style="text-indent: 0em;line-height: 2em;"><span style="font-size: 15px;text-indent: 30px;">随着人脸检测基准数据集的广泛应用，近年来各种算法都取得了很大的进展。其中，Selective Refinement Network（</span><strong><span style="font-size: 15px;text-indent: 2em;color: rgb(0, 128, 255);">SRN</span></strong><span style="font-size: 15px;text-indent: 30px;">）人脸检测器有选择地将分类和回归操作引入到</span><strong><span style="font-size: 15px;text-indent: 30px;color: rgb(0, 128, 255);">anchor-based</span></strong><span style="font-size: 15px;text-indent: 30px;">的人脸检测器中，以减少假阳性同时提高定位精度。此外，它还设计了一个感受野增强块，以提供更多样化的感受野。</span></p><p style="margin-top: 5px;margin-bottom: 5px;text-align: center;text-indent: 0em;line-height: 2em;"><span style="font-size: 15px;text-indent: 30px;"><img src="http://img.blog.itpub.net/blog/2019/02/19/25a43a964da08f1d.jpeg?x-oss-process=style/bb"></span></p><p style="text-indent: 0em;line-height: 2em;"><span style="font-size: 15px;text-indent: 30px;">为了进一步提高</span><strong><span style="font-size: 15px;text-indent: 30px;color: rgb(0, 128, 255);">SRN</span></strong><span style="font-size: 15px;text-indent: 30px;">的性能，通过大量的实验，开发了现有的一些技术，包括新的数据增强策略、改进的</span><strong><span style="font-size: 15px;text-indent: 30px;color: rgb(0, 128, 255);">backbone network</span></strong><span style="font-size: 15px;text-indent: 30px;">、</span><strong><span style="font-size: 15px;text-indent: 30px;color: rgb(0, 128, 255);">MS COCO</span></strong><span style="font-size: 15px;text-indent: 30px;">预训练、解耦分类模块（</span><strong><span style="font-size: 15px;text-indent: 30px;color: rgb(0, 128, 255);">decoupled classification module</span></strong><span style="font-size: 15px;text-indent: 30px;">）、分割分支和压缩激励块（</span><strong><span style="font-size: 15px;text-indent: 30px;color: rgb(0, 128, 255);">Squeeze-and-Excitation block</span></strong><span style="font-size: 15px;text-indent: 30px;">）。</span></p><p style="text-indent: 0em;line-height: 2em;"><span style="font-size: 15px;text-indent: 30px;">其中，一些技术带来了性能改进，因此，将这些有用的技术结合在一起，提出了一种改进的</span><strong><span style="font-size: 15px;text-indent: 30px;color: rgb(0, 128, 255);">SRN</span></strong><span style="font-size: 15px;text-indent: 30px;">人脸检测器，并在广泛使用的人脸检测基准的人脸数据集上获得了最佳的性能。</span></p><p style="text-indent: 0em;line-height: 2em;"><span style="font-size: 15px;text-indent: 30px;">人脸检测其实比较简单，就是将图像输入算法框架中，最终返回输入图像中目标人脸的bounding box。</span></p><p style="text-align: center;margin-top: 5px;margin-bottom: 15px;text-indent: 0em;line-height: 2em;"><img src="http://img.blog.itpub.net/blog/2019/02/19/107404524cd10a4f.jpeg?x-oss-process=style/bb"></p><section><p style="text-align: left; margin-top: 10px; margin-bottom: 10px; text-indent: 0em; line-height: 2em;"><strong style="letter-spacing: 2px;"><span style="color: rgb(10, 149, 17);">Review of Baseline</span></strong><br></p></section><section><p style="text-indent: 0em;line-height: 2em;"><span style="font-size: 15px;">接下来，我们先简要回顾<span style="text-indent: 30px;">Selective Refinement Network（</span><strong style="text-indent: 34px;white-space: normal;"><span style="text-indent: 2em;color: rgb(0, 128, 255);">SRN</span></strong><span style="text-indent: 30px;">）</span>。如下图1所示，它包括选择性两步分类(<strong><span style="color: rgb(0, 128, 255);">STC</span></strong>)、选择性两步回归(</span><strong><span style="font-size: 15px;color: rgb(0, 128, 255);">STR</span></strong><span style="font-size: 15px;">)和感受野增强(</span><strong><span style="font-size: 15px;color: rgb(0, 128, 255);">RFE</span></strong><span style="font-size: 15px;">)，这三个模块的详细说明如下。</span></p></section><p style="text-align: center;margin-top: 5px;margin-bottom: 5px;text-indent: 0em;line-height: 2em;"><img src="http://img.blog.itpub.net/blog/2019/02/19/255f4b534de18688.jpeg?x-oss-process=style/bb"></p><p style="text-indent: 0em;line-height: 2em;"><span style="font-size: 15px;text-indent: 2em;"></span><span style="font-size: 15px;">图1&nbsp;<strong style="font-size: 15px;text-indent: 34px;white-space: normal;"><span style="text-indent: 2em;color: rgb(0, 128, 255);">SRN</span></strong>。它包括选择性两步分类(</span><strong><span style="font-size: 15px;color: rgb(0, 128, 255);">STC</span></strong><span style="font-size: 15px;">)、选择性两步回归(</span><strong><span style="font-size: 15px;color: rgb(0, 128, 255);">STR</span></strong><span style="font-size: 15px;">)和感受野增强(</span><strong><span style="font-size: 15px;color: rgb(0, 128, 255);">RFE</span></strong><span style="font-size: 15px;">)。</span></p><section><section><section><section><p><span style="color: rgb(255, 255, 255);font-size: 16px;">01</span></p></section></section></section><section><p><strong><span style="color: rgb(128, 202, 188);font-size: 16px;">S T C</span></strong></p></section></section><p style="text-indent: 0em;line-height: 2em;"><span style="font-size: 15px;text-indent: 2em;">对于单级检测器，大量的正负样本比的anchor（例如，大约有300k个<span style="font-size: 15px;text-indent: 30px;">anchor</span>，SRN中的正/负比约为0.006%）导致了相当多的假阳性。因此，它需要另一个阶段，如RPN过</span><span style="font-size: 15px;text-indent: 2em;">滤掉一些</span><span style="font-size: 15px;text-indent: 2em;">负样本。选择性两步分类是</span><span style="font-size: 15px;text-indent: 2em;">从RefineDet继承而来的，有效地拒绝了大量的负样本<span style="font-size: 15px;text-indent: 30px;">anchor</span>，缓解了类不平衡问题。</span></p><p style="text-indent: 0em;line-height: 2em;"><span style="font-size: 15px;text-indent: 2em;">STC作用于浅层Feature M</span><span style="font-size: 15px;text-indent: 2em;">ap上</span><span style="font-size: 15px;text-indent: 2em;">，来过滤掉大部分比骄傲容易区分的负样本，来减少搜索空间。</span></p><section><p style="text-align: left; margin-top: 10px; margin-bottom: 10px; text-indent: 0em; line-height: 2em;"><strong><span style="color: rgb(128, 202, 188);">S T R</span></strong></p></section><section><p style="text-indent: 0em;line-height: 2em;"><span style="font-size: 15px;text-indent: 2em;">像Cascade RCNN这样的多步回归可以提高Bounding Box位置的准确性，特别是在一些具有挑战性的场景中，例如MS COCO风格的评估指标。然而，将多步回归应用于人脸检测任务中，如果不仔细考虑，可能会影响检测结果。</span></p><p style="text-indent: 0em;line-height: 2em;"><span style="font-size: 15px;text-indent: 2em;">STR作用于高层Feature Map上</span><span style="font-size: 15px;text-indent: 2em;">，用来粗略调整anchor的尺度、位置（类似于RefineDet中ARM的回归任务），并进一步</span><span style="font-size: 15px;text-indent: 2em;">为高层Feature Map上的回归器提供refined后的anchor位置初始化</span><span style="font-size: 15px;text-indent: 2em;">（类似于RefineDet中ODM的回归任务）。</span></p></section><section><section><section><section><p><span style="color: rgb(255, 255, 255);font-size: 16px;">03</span><strong><span style="color: rgb(128, 202, 188);">R F E</span></strong></p></section></section></section></section><p style="margin-bottom: 10px;text-indent: 0em;line-height: 2em;"><span style="font-size: 15px;text-indent: 2em;">当前网络通常都具有<span class="fontstyle0">square感受野</span>，这影响了对不同高宽比目标的检测。为了解决这个问题，</span><strong><span style="font-size: 15px;text-indent: 2em;color: rgb(0, 128, 255);">SRN</span></strong><span style="font-size: 15px;text-indent: 2em;">设计了一个感受野增强（RFE），在预测类和位置之前，将特征的感受野多样化，这有助于在某些极端姿势中很好地捕捉到人脸。</span></p></section><p></p>
                                                        <p class="translate">
                                                来自 “ ITPUB博客 ” ，链接：http://blog.itpub.net/31555081/viewspace-2636352/，如需转载，请注明出处，否则将追究法律责任。
                                        </p>

            </div>
转载，2019年3月，目前最强性能的人脸检测算法（Wider Face Dataset）

猜你喜欢