引言：刚开始看深度学习的论文，会隐遇到有一些不常见的科研论文术语（例如 state-of-the-art）感到困惑，看得多了才能慢慢理解。

下面我列举的一些论文术语以及评估指标，我刚开始遇到是不太懂的。我结合网络和自己的理解给出合理解释，供参考~

附：

一、论文

1.A-G

Ablation study
直接翻译为消融实验。实际上是一种对比实验，控制变量法。研究哪些部分对网络性能产生，选择一些可能的因素，分别对比去除和加上的网络性能。

An ablation study typically refers to removing some “feature” of the model or algorithm, and seeing how that affects performance.

Anchor
字面意思是锚，CV 中是锚点或锚框，目标检测中常出现的 anchor box是锚框，表示固定的参考框。参考：目标检测中的Anchor

Baseline
翻译为基线，是研究的一个参照物。你以 baseline 为基础，在上面加了一些东西构成新模型，就要看看你的模型比 baseline 的性能提高了多少，你的改进有没有效果。（在 baseline 的基础上纵向比较）

a line serving as a basis

Benchmark
翻译为基准，在测试中可以比较计算机或程序的性能、可靠性的既定参考点，是评价算法好坏的一种规则和标准。（与比较牛的模型 benchmark 横向比较）

an established point of reference against which computers or programs can be measured in tests comparing their performance, reliability, etc.

Backbone
可翻译为骨干（eg. backbone network）

the most important part of something, providing support for everything else

Batch
可翻译为批。模型训练的一次迭代（即一次梯度更新）中使用的样本集。（一个批次的样本数是 batch size）拓展：epoch,batch,iteration

a group of jobs, data, or programs treated as a unit for computer processing

End-to-end
端到端，输入原始数据，输出得到想要的结果，使用一个单独的网络模型训练，不用考虑多阶段处理。（eg. end-to-end learning, end-to-end network）

End-to-end describes a process that takes a system or service from beginning to end and delivers a complete functional solution, usually without needing to obtain anything from a third party.

Ensemble model
可翻译为融合模型，是一种提高模型性能的方法。

multiple diverse models are created to predict an outcome, either by using many different modeling algorithms or using different training data sets. The ensemble model then aggregates the prediction of each base model and results in once final prediction for the unseen data.

Epoch
可翻译为代，使用训练集额全部数据对模型进行一次完整训练是一个 epoch。拓展：epoch,batch,iteration

a particular period of time marked by distinctive features, events

Fine tune
翻译为微调（eg. fine tune a network）

to make very small changes to something in order to make it work as well as possible.

Feature map
可翻译为特征图，指 CNN 中卷积层的输出。

The feature map is the output of one filter applied to the previous layer.

Ground truth
可理解为设定的一个正确的真实值。图像分类中，标签为 ground truth。

In machine learning, the term “ground truth” refers to the accuracy of the training set’s classification for supervised learning techniques.

2.H-N

Hyperparameter
超参数。模型参数是根据数据自动估算的，例如权重。但模型超参数是手动设置的，并且在过程中用于帮助估计模型参数，例如学习率。

3.O-T

Pipeline
直译是流水线，论文中会称一些框架为 pipeline，说明这个框架分为几个阶段。

State-of-the-art (SOTA)
可翻译为最先进的，最高水平。

the latest and most sophisticated or advanced stage of a technology, art, or science.

Region of interest(ROI)
可翻译为感兴趣区域，从图像中选择的一片图像区域，这是你要研究的目标。

samples within a data set identified for a particular purpose

Receptive field(RF)
翻译为感受野（感受域），指的是神经网络中神经元“看到的”输入区域。CNN 的 feature map上某个元素的计算受输入图像上某个区域的影响，这个区域即该元素的感受野。

The receptive field is defined as the region in the input space that a particular CNN’s feature is looking at.

4.U-Z

Without bells and whistles
没有花里胡哨的方法（不添加不必要，冗余的东西）

It refers to non-essential but often engaging features added to a piece of technical equipment or a computer program to make it seem more superficially attractive without enhancing its main function

二、指标

1. CV

FPS
Frames Per Second 的缩写，翻译为“每秒传输帧数”，也就是“每秒中填充图像的帧数(帧/秒)”

AP(Average Percision)
平均精度，指的是所有图片内的具体某一类的PR曲线下的面积。

mAP(Mean Average Precision)
均值平均精度，指的是所有图片内的所有类别的AP的平均值，目前，在目标检测类里用的最多的是 mAP。

IOU(Intersection over Union)
交并比，指的是 ground truth bbox与 predict bbox 的交集面积占两者并集面积的一个比率，IoU值越大说明预测检测框的模型算法性能越好.

Top-1 error
预测输出的概率最高的类别，是否和人工标注的类别一致，如果不一致，此时的概率。常用于 ImageNet 数据集。

Top-5 error
预测输出的概率最高的前5个类别当中，有没有和人工标注的类别不一致，当5个都不一致时错误次数占总次数的概率。

2. ML

Accuracy
准确率，正确预测的样本数占总预测样本数的比值

Precision
精确率，正确预测的正样本数占所有预测为正样本的数量的比值

Recall
召回率，正确预测的正样本数占真实正样本总数的比值

F1-Score
F1分数，是统计学中用来衡量二分类模型精确度的一种指标，它被定义为精确率和召回率的调和平均数

catOneTwo

原创文章 44 获赞 36 访问量 2万+

关注私信

面向小白的深度学习论文术语（持续更新）

Index