Joint Face Detection and Alignment using Multi-task Cascaded Convolutional Networks 论文阅读 - 代码天地

Joint Face Detection and Alignment using Multi-task Cascaded Convolutional Networks 论文阅读

其他 2018-12-30 10:51:14 阅读次数: 0

一、概述

本文提出了一个多任务的人脸检测模型，可以同时进行人脸检测和人脸特征点提取。这个框架主要由三个CNN级联的方式实现。

stage1：通过一个浅的CNN来产生一些候选框
stage2：通过一个较复杂的CNN，对候选框进一步删选得到更精细的区域
stage3：通过一个强大的CNN，对结果进一步处理，输出人脸边框和5个特征点位置

二、实现细节

具体的实现过程如上图所示，总的来说分为四个部分

给定一张图片，首先更改图片的大小以建立图像金字塔，这作为模型的数据输入
通过一个全卷积网络(P-Net)，生成候选框和bounding box regression vectors，使用bounding box regression vectors校准这些候选框，使用NMS合并重叠候选框
上层所有的候选框会被传给当前网络(R-Net)，这会进一步筛选，再使用bounding box regression vectors和NMS
和上面类似，使用O-Net输出最终的人脸框和特征点位置

CNN结构：

三、训练

算法需要实现三个任务，分别是人脸与非人脸分类，bounding box 回归，人脸特征点定位

1.人脸与非人脸分类

$L_i^{det}=-(y_i^{det}log(p_i)+(1-y_i^{det})(1-log(p_i)))$

类似于交叉熵损失函数，这里的y表示ground_truth label，p表示网络结果输出是人脸的概率。

2.bounding box回归

$L_i{box}=\left \| \tilde{y}_i^{box}-y_i^{box} \right \|}_2^2$

这里的 $y_i^{box}\epsilon R^4$ ，标志着左上角点的坐标，height和width

3.人脸特征点定位

$L_i^{landmark}=\left \| \tilde{y}_i^{landmark}-y_i^{landmark} \right \|_2^2$

这里的 $y_i^{landmark}\epsilon R^{10}$

4.多目标训练

$min\sum{_{i=1}^N} \sum{_{j\in {\{det,box,landmark\}}}}\alpha_j\beta_i^jL_i^j$

这里的 $\alpha_j$ 表示任务的重要性， $\beta_i^j$ 样本标签， L_i^j 代表上述三个损失函数。

这里在P-Net和R-Net使用了 $\alpha_{det}=1,\alpha_{box}=0.5,\alpha_{landmark}=0.5$ ，在O-Net使用了 $\alpha_{det}=1,\alpha_{box}=0.5,\alpha_{landmark}=1$

这里数据集分类以下四个类别

Positive
Negative
Part faces
Landmark face

其中Negative和Positive用于人脸分类，Positive和Part faces用于bounding box回归，Landmark用于特征点定位。

猜你喜欢

转载自blog.csdn.net/adorkable_thief/article/details/85270179

Joint Face Detection and Alignment using Multi-task Cascaded Convolutional Networks 论文阅读

论文解读——Joint Face Detection and Alignment using Multi-task Cascaded Convolutional Networks（一）论文解读——Joint Face Detection and Alignment using Multi-task Cascaded Convolutional Networks（一）

Joint Face Detection and Alignment using Multi-task Cascaded Convolutional Networks（MTCNN）论文和代码解读

论文解读——Joint Face Detection and Alignment using Multi-task Cascaded Convolutional Networks（二）

论文解读——Joint Face Detection and Alignment using Multi-task Cascaded Convolutional Networks（一）

论文笔记1.3——Joint Face Detection and Alignment using Multi-task Cascaded Convolutional Networks

论文笔记1.2——Joint Face Detection and Alignment using Multi-task Cascaded Convolutional Networks

论文笔记1.1——Joint Face Detection and Alignment using Multi-task Cascaded Convolutional Networks

人脸关键点：MTCNN-Joint Face Detection and Alignment using Multi-task Cascaded Convolutional Networks

Joint Face Detection and Alignment using Multi-task Cascaded Convolutional Networks

Joint Face Detection and Alignment Using Multitask

CenterFace: Joint Face Detection and Alignment Using Face as Point

mtcnn(Multi-task Cascaded Convolutional Networks)理解(一）－－－－理论理解

Automatic Brain Tumor Segmentation using Cascaded Anisotropic Convolutional Neural Networks

《Deep Alignment Network: A convolutional neural network for robust face alignment》论文阅读

【论文阅读】【多传感器融合】LIDAR-Camera Fusion for Road Detection Using Fully Convolutional Neural Networks

论文阅读：Joint 3D Face Reconstruction and Dense Alignment with Position Map Regression Network

MTCNN（Multi-task convolutional neural networks）人脸对齐

MTCNN（Multi-task convolutional neural networks）人脸检测

阅读心得：GSTD:Joint Object Detection and Multi-Object Tracking with Graph Neural Networks

论文笔记：OverFeat: Integrated Recognition, Localization and Detection using Convolutional Networks

学习---论文笔记：OverFeat: Integrated Recognition, Localization and Detection using Convolutional Networks

【论文阅读笔记】Automatic Liver and Lesion Segmentation in CT Using Cascaded Fully Convolutional Neural Net

Fast Face-swap Using Convolutional Neural Networks

肝脏分割 Using Cascaded Fully Convolutional Neural Networks and 3D Conditional Random Fields

OverFeat: Integrated Recognition, Localization and Detection using Convolutional Networks

【OverFeat】《OverFeat：Integrated Recognition, Localization and Detection using Convolutional Networks》

《Joint 3D Face Reconstruction and Dense Alignment with Position Map Regression Network》论文学习笔记

CPU Real-time Face Detection and Alignment-68 using MTCNN

【视频异常检测-论文阅读】Anomaly Detection in Video via Self-Supervised and Multi-Task Learning

今日推荐

《美国对全球网络空间安全与发展的威胁和破坏》报告发布

火速冲上 GitHub 热榜 —— 开源编程语言、框架哪有这么可爱？

北京人形机器人创新中心发布全球首个纯电驱拟人奔跑的全尺寸人形机器人“天工”

LFOSSA 源来如此公开课 | 掌握云原生未来：CNCF 认证全面攻略与备考秘籍

周排行

循环神经网络（rnn）讲解

Tigao教程四：单独的关节运动

金蝶K3WISE15.0-注册套打教程

如何在Mac上配置Kubernetes

Android应用结束自身进程的方法

SpringMVC学习十三拦截器栈

中国驻洛杉矶总领馆举行新春招待会

HttpClient get post 发送

11 - three.js 笔记 - 绘制三维字体模型

Mysql递归获取某个父节点下面的所有子节点和子节点上的所有父节点

每日归档

更多

2024-05-01(4)

2024-04-30(1)

2024-04-29(40)

2024-04-28(0)

2024-04-27(56)

2024-04-26(39)

2024-04-25(22)

2024-04-24(36)

2024-04-23(26)

2024-04-22(39)