【论文笔记】smooth-AP : smoothing the path towards large-scale image retrieval

其他 2021-04-03 18:43:53 阅读次数: 0

title：Smooth-AP: Smoothing the Path Towards Large-Scale Image Retrieval
link：https://arxiv.org/abs/2007.12163
Presentation：Smooth-AP: Smoothing the Path Towards Large-Scale Image Retrieval, ECCV 2020
code：https://github.com/Andrew-Brown1/Smooth_AP
author：VGG

1. 问题/目标：

不同于以往基于度量学习的损失函数，作者提出了基于优化排序的损失函数。选择的优化对象是AP，但是AP是不可微的，所以提出了smooth AP

2. 如何解决：

AP估值计算后，将其中的不可微部分换成sigmoid函数

3. 解决效果：

在Stanford Online products，VehicleID，INaturalist，VGGFace2 and IJB-C上做了实验，在已经商用的人脸检测任务上提升AP 2~4%， recall提升不明显，有升有降低；

4. 公式：

在这里插入图片描述

4.1 Smooth AP

在这里插入图片描述

4.2 三点分析：

第一点是平滑系数越小，AP的估计值越接近真实AP，而越大的平滑系数会带来更大的操作空间，就是图二里求导后的曲线下方面积，可以提供更多的梯度信息（并没有理解，原文如下）。

Smoothing parameter τ governs the temperature of the sigmoid that replaces the Indicator function 1{·}. It defines an operating region, where terms of the difference matrix are given a gradient by the Smooth-AP loss. If the terms are mis-ranked, Smooth-AP will attempt to shift them to the correct order. Specifically, a small value of τ results in a small operating region (Figure 2 (b) – note the small region with gradient seen in the sigmoid derivative), and a tighter approximation of true AP. The strong acceleration in gradient around the zero point (Figure 2 (b)-© second row) is essential to replicating the desired qualities of AP, as it encourages the shifting of instances in the embedding space that result in a change of rank (and hence change in AP), rather than shifting instances by some large distance but not changing the rank. A large value of τ offers a large operating region, however, at the cost of a looser approximation to AP due to its divergence from the indicator function.
第二点是triplet loss 更像是度量损失而不是优化排序。
第三点是相对于其他优化AP的方法 FastAP and Blackbox AP，本方法更简单，并且估计的更准。而且这俩方法和triplet loss 一样，可能更像度量损失。

猜你喜欢

转载自blog.csdn.net/weixin_41521681/article/details/111804538

【论文笔记】smooth-AP : smoothing the path towards large-scale image retrieval

Revisiting Oxford and Paris: Large-Scale Image Retrieval Benchmarking论文笔记

大规模图像检索深度特征：Large-Scale Image Retrieval with Attentive Deep Local Features

论文笔记：Towards Zero-shot Cross-lingual Image Retrieval and Tagging

Deep Hash in Large Scale Image Retrieval

论文笔记《Very Deep Convolutional Networks for Large-Scale Image Recognition》

[论文笔记]小目标识别文献综述Towards large-scale small object detection: Survey and Benchmarks

论文阅读笔记--VERY DEEP CONVOLUTIONAL NETWORKS FOR LARGE-SCALE IMAGE RECOGNITION

经典论文阅读《VERY DEEP CONVOLUTIONAL NETWORKS FOR LARGE-SCALE IMAGE RECOGNITION》简短阅读笔记

VGGNet论文学习记录：VERY DEEP CONVOLUTIONAL NETWORKS FOR LARGE-SCALE IMAGE RECOGNITION

VGGNet论文（Very Deep Convolutional Networks for Large-Scale Image Recognition）（译）

论文理解 - VGGNet - Very Deep Convolutional Networks for Large-Scale Image Recognition

论文解读| Very Deep Convolutional Networks for Large-Scale Image Recognition

VGGNet论文翻译-Very Deep Convolutional Networks for Large-Scale Image Recognition

VERY DEEP CONVOLUTIONAL NETWORKS FOR LARGE-SCALE IMAGE RECOGNITION论文翻译

Very Deep Convolutional Networks for Large-Scale Image Recognition—VGG论文翻译

论文学习：（VGG）Very Deep Convolutional Networks for Large-scale Image Recognition

阅读笔记：Very Deep Convolutional Networks for Large-Scale Image Recognition

[深度学习] Very Deep Convolutional Networks for Large-Scale Image Recognition（VGGNet）阅读笔记

Very Deep Convolutional Networks for Large-Scale Image Recognition（VGG）笔记

(VGG)Very Deep Convolutional Networks for Large-Scale Image Recognition阅读笔记

VGG-16、VGG-19(论文阅读《Very Deep Convolutional NetWorks for Large-Scale Image Recognition》)

Product Quantization Network for Fast Image Retrieval 论文笔记

Training Vision Transformers for Image Retrieval 论文笔记

VERY DEEP CONVOLUTIONAL NETWORKS FOR LARGE-SCALE IMAGE RECOGNITION

VERY DEEP CONVOLUTIONAL NETWORKS FOR LARGE-SCALE IMAGE RECOGNTION（翻译）

Very Deep Convolutional Networks For Large-Scale Image Recognition(VGGnet)

VGG: Very Deep Convolutional Networks for Large-Scale Image Recognition

VGG —— Very Deep Convolutional Networks for Large-Scale Image Recognition

VGG：VERY DEEP CONVOLUTIONAL NETWORKS FOR LARGE-SCALE IMAGE RECOGNITION

今日推荐

《美国对全球网络空间安全与发展的威胁和破坏》报告发布

火速冲上 GitHub 热榜 —— 开源编程语言、框架哪有这么可爱？

北京人形机器人创新中心发布全球首个纯电驱拟人奔跑的全尺寸人形机器人“天工”

LFOSSA 源来如此公开课 | 掌握云原生未来：CNCF 认证全面攻略与备考秘籍

周排行

让自己的头脑极度开放

CentOS 6.5(x64) 和Redhat6.5操作系误删libc

高可用注册中心

【日记】12.28/【题解】AtCoder AGC041

XML（5）_XML 约束_DTD

Java集合Map（四）

树梅派安装桌面环境教程

pipenv 的使用和安装

小程序白屏问题和内存研究

C语言简单选择排序

每日归档

更多

2024-05-02(0)

2024-05-01(4)

2024-04-30(1)

2024-04-29(40)

2024-04-28(0)

2024-04-27(56)

2024-04-26(39)

2024-04-25(22)

2024-04-24(36)

2024-04-23(26)