图像分割 | U-Net - 代码天地

kaggle上的常客（医学图像领域比较常用，也是car分割的冠军模型）：
https://www.kaggle.com/c/data-science-bowl-2018/discussion/54426
U-Net: Convolutional Networks for Biomedical Image Segmentation

这里写图片描述

VALID padding not SAME padding（因为边界用了镜像处理）
matched lower and upper features after cropping lower feature（The cropping is necessary due to the loss of border pixels in every convolution：注意看图1中的虚线框，就是与右分支对应的位置去crop左分支）
weighted cross entropy loss to separate instances
elastic deformations

边界的镜像处理：
这里写图片描述

加权处理：类别平衡和给两个细胞之间的空隙更大的权重
这里写图片描述
可见对于两个细胞之间的缝隙，它们获得的权重值很大，因此会将这部分分成背景（这样做有效的前提是训练数据就没有重叠在一起？）。

语义分割相关的design choices（后面再看看DeepLab系列）：

“DeepLab’s Four Alignment Rules”:
(1) Use of odd-sized kernels in all convolution and pooling ops.
(2) Use of SAME boundary conditions in all convolution and pooling ops.
(3) Use align_corners=True when upsampling feature maps with bilinear interpolation.
(4) Use of inputs with height/width equal to a multiple of the output_stride, plus one (for example, when the CNN output stride is 8, use height or width equal to 8 * n + 1, for some n, e.g., image HxW set to 321x513).

相关讨论：
Unet神经网络为什么会在医学图像分割表现好？
语义分割(semantic segmentation) 常用神经网络介绍对比-FCN SegNet U-net DeconvNet
（这几个网络其实大同小异，要点就是encode & decode，然后底层信息和高层信息融合，做upsampling的时候可以有unpooling或转置卷积等不同的做法）

图像分割 | U-Net

猜你喜欢