1、论文总述

在这里插入图片描述
【注】：上图的来源：https://blog.csdn.net/leviopku/article/details/82660381

这篇论文不能说是一篇paper，它是一篇技术报告，文笔很随意，还特别吐槽了coco数据集的mAP的metric，YOLOv3借鉴了resnet和FPN的思想，设计了新的backbone：Darknet53
，还用上多尺度的feature map，每个cell 3个prior（和anchor机制还是不一样的，v2用的其实不是anchor机制，只是宽高有个先验知识，具体看这个
YOLO系列之yolo v2），并且分类时没有用softmax损失函数，而是用了二分类交叉熵损失函数，这个可以用作多标签分类，比如某目标既属于women又属于person。

重点： 这里推荐一篇对YOLO系列讲解的特别清晰的博客，yolo系列之yolo v3【深度解析】，看这一篇就够了，我写这个也只是把看论文时自己的特有的标注记录一下。

2、Bounding Box Prediction的逻辑回归

在这里插入图片描述

YOLOv3 predicts an objectness score for each bounding
box using logistic regression. This should be 1 if the bounding box prior overlaps a ground truth object by more than
any other bounding box prior. If the bounding box prior
is not the best but does overlap a ground truth object by
more than some threshold we ignore the prediction, following [17]. We use the threshold of .5. Unlike [17] our system
only assigns one bounding box prior for each ground truth
object. If a bounding box prior is not assigned to a ground
truth object it incurs no loss for coordinate or class predictions, only objectness.

这个对每个box的objectness预测就相当于RPN网络在筛选anchors。logistic regression就是二值交叉熵，分类时也是用的这个二值交叉熵损失函数，只有x y w h是均方差损失函数。