frustum-pointnets 使用ground truth 后性能降低的原因 - 代码天地

frustum-pointnets 使用ground truth 后性能降低的原因

其他 2020-03-04 09:51:16 阅读次数: 0

官方tensorflow代码:https://github.com/charlesq34/frustum-pointnets
欢迎star我的pytorch复现代码:https://github.com/simon3dv/frustum_pointnets_pytorch

问题的发现

用RGB 2d box:car_detection_3d AP AP:85.09, 72.11, 64.25
用gt 2d box:car_detection_3d AP AP:74.14 68.03 62.82
用gt 2d box后,2d AP是 100,100,100,但3d AP大幅下降

原因

在用pascal voc 的方式计算AP中会用到score值,如果score是固定1或者随机的,那么在低recall的情况下也得不到高的precision.因为低recall的时候会拿score最高的那几个预测来计算precision.
然而,f-pointnets 没有输出一个良好的score值.

在test.py中
用RGB时,score的计算方式: score_list.append(batch_rgb_prob[j]),即用2d检测器的score
用gt时,score的计算方式:

mask_mean_prob = np.sum(batch_seg_prob * batch_seg_mask, 1) # B,
mask_mean_prob = mask_mean_prob / (np.sum(batch_seg_mask, 1))
batch_scores = np.log(mask_mean_prob) + np.log(heading_prob) + np.log(size_prob)
score_list.append(batch_scores[j])

验证

如果RGB也用f-pointnet计算score的方式,那么AP也会严重下降:3D AP: 76.976532 69.313423 63.041763

解决方案

最简单的解决方案是把gt的score计算方式改成

mask_max_prob = np.max(batch_seg_prob * batch_seg_mask, 1)
batch_scores = np.max(mask_max_prob)

得到AP:car_detection_3d AP: 85.082458 74.658356 67.191765
如果rgb也用这种方式计算score,AP是car_detection_3d AP: 81.558723 70.068161 63.369583,仍然比用2d score要差几个点,可见这种方式也不是最好的.
我猜最好的方式还是再训练一个分类分支,比如f-convnet就不会出现这样的问题.

猜你喜欢

转载自www.cnblogs.com/simingfan/p/12398432.html

frustum-pointnets 使用ground truth 后性能降低的原因

Ground truth的含义

关于Ground truth

机器学习里的ground truth

论文中Ground Truth的意思

机器学习中的ground truth

faster-RCNN 加入新的Ground Truth

ground truth 在机器学习中的含义

3D目标检测算法详解_pointnet, pointnet++,frustum-pointnets，VoteNet

机器学习里面的Ground Truth是什么意思

图像中里面的Ground Truth是什么意思

ICDAR2015 的 Ground Truth 标注在图像数据上

不同数量的预测框和Ground Truth框计算IoU

yoloV5 Ground Truth box框框的绘制

[论文研读]天天看到的 ground truth，到底是什么意思？

从B导的yolox、yolov7-tiny的标签中提取出来ground truth

【目标检测】概念理解：region proposal、bounding box、anchor box、ground truth、IoU、NMS、RoI Pooling

计算图像中任意四个点连成的四边形面积与Ground truth的IOU(Python)

KITTI数据集Raw Data与Ground Truth序列00-10的对应关系，以及对应的标定参数

【python代码】Kittle数据集的ground truth生成深度图攻略|彩色深度图|代码无恼运行

基于语义分割Ground Truth（GT）转换yolov5图像分割标签（路面积水检测例子）

基于语义分割Ground Truth（GT）转换yolov5目标检测标签（路面积水检测例子）

【3D目标检测】Frustum PointNets

分支分歧(branch divergence)造成SIMT性能降低的原因

使用GridsearchCV时The truth value of an array with more than one element is ambiguous.

frustum pointnets训练代码学习笔记——kitti_object.py

跑groud truth的disparity

how to find the truth

Truth Value Testing

什么是Single source of truth？

今日推荐

开源日报 | Chrome内置Gemini的意义不在于Gemini；中国AI追随之路的五大误区；ECharts创始人“下海”养鱼；谷歌I/O开发者大会什么都有，只是没有惊喜

微软回应中国区AI团队“打包赴美”传闻

基于大语言模型的开源知识库问答系统 MaxKB GitHub Star 数量突破 5,000 个！

美国拟限制 AI 大模型出口中国和俄罗斯

苹果将与 OpenAI 达成协议，将 ChatGPT 应用于 iPhone

openKylin 社区生态委员会第六次会议圆满召开

阿里云正式发布通义千问 2.5

Python 3.13 发布首个 Beta：实验性自由线程模式和 JIT、改进交互式解释器

Stack Overflow 拿我的代码去训练 AI 大模型，还封了我的账号

Pop!_OS 的 COSMIC 桌面完成 App Store 上架工作

《2024 年一季度互联网投融资运行情况》研究报告

报告：Django 仍然是 74% 开发者的首选

周排行

返回指定时间格式

fopen函数中的mode参数

Java 单例模式探讨

Flex remoteobject工作原理探讨

寻找mplayer的便捷安装方法

30天了解30种技术系列---(26)MySQL自动化运维工具Inception

关于Jboss/Tomcat/Jetty的JNDI定义123

程序减肥，strip，eu-strip 及其符号表

AsyncTask、View.post(Runnable)、ViewTreeObserver三种方式总结frame animation自动启动

Json和Bean的互相转换

每日归档

更多

2024-05-15(24)

2024-05-14(0)

2024-05-13(18)

2024-05-12(0)

2024-05-11(38)

2024-05-10(38)

2024-05-09(35)

2024-05-08(42)

2024-05-07(14)

2024-05-06(40)