2021计算机视觉-包揽所有前沿论文源码 -上半年

大家是否遇到过这种情况,就是在工作或者学习的时候,想去找一些方向的网络,但是呢,尴尬的是,老旧的网络里不想要,前沿的网络又不知道有哪些。为了解决大家的这个困扰,本人决定收集2021年上半年大部分前沿的网络相关链接,之后我会( 文末附带 \color{blue}{文末附带} 文末附带 公众号 − \color{blue}{公众号 -} 公众号 海量资源。 \color{blue}{ 海量资源}。 海量资源):

每周一更新一次(下面是我多年年收集的链接地址) \color{red}{每周一更新一次(下面是我多年年收集的链接地址) } 每周一更新一次(下面是我多年年收集的链接地址)
计算机视觉-包揽所有前沿论文源码

有兴趣的朋友可以加微信:17575010159 相互讨论技术。若是帮助到了你什么,一定要记得点赞!因为这是对我最大的鼓励!

视觉工作项目-为后来的你,提供一份帮助!
上面这个链接是我所有工作项目的详细解。 \color{red}{上面这个链接是我所有工作项目的详细解。} 上面这个链接是我所有工作项目的详细解。

 

文章分类

(01)AAAI 2021 | 腾讯优图11篇论文入选,涵盖动作识别、人群密度估计、人脸安全等领域
(02)重磅!网易伏羲9篇论文入选AI顶会AAAI 2021
(03)CVPR2020 最全整理:论文汇总 / 代码 / 项目 / 论文解读(更新中)【计算机视觉】
(04)CVPR、ECCV 2020 两大会议论文分类索引
(05)人体姿态估计、识别与生成最新技术一览
(06)一文概览 CVPR2021 最新18篇 Oral 论文
(07)WACV 2021 论文大盘点-GAN 篇与行人监控篇
(08)近期必看的视觉综述,含 GAN、Transformer、人脸超分辨、遥感等
(09)CVPR2021 最全整理:论文汇总 / 代码 / 项目 / 论文解读(更新中)【计算机视觉】
(10)重磅!悉尼科大ReLER实验室13篇论文入选CVPR 2021
(21)WACV 2021 论文大盘点 目标检测与图像分割篇(持续更新)
(22)WACV 2021 论文大盘点-GAN 篇与行人监控篇
(23)近期必看的视觉综述,含图像检索、目标检测、人脸关键点检测、医学图像分割、遥感、模型优化等
(24)WACV 2021 Paper Inventory - Human Action Detection and Recognition & Image and Video Retrieval
(25) AAAI 2021 | Summary of excellent papers from Microsoft Research Asia!
(26) An overview of the latest 18 Oral papers of CVPR2021
(27) The most comprehensive arrangement of CVPR2021: paper summary / code / project / paper interpretation (updating) [computer vision]
(28) CVPR2021 latest collection of papers received! Summary of 100+ papers in 22 directions|Continuous update
(29) Recommend several must-read visual reviews in the near future, including GAN, Transformer , face super-resolution, remote sensing, etc. (
30) Recommend several recent must-read visual reviews, including image retrieval, target detection, face key point detection, medical image segmentation, remote sensing, model optimization, etc. ! Continuously updating! (33) https://github.com/52CV/CVPR-2021-Papers


(34)CVPR2021中的目标检测和语义分割论文汇总
(35)一文概览 CVPR2021 最新18篇 Oral 论文
(36)CVPR 2021 | 腾讯AI Lab入选论文解读
(37)顶会论文分类汇总,包含WACV21/CVPR19、20/ECCV20(附下载)
(38)2021 最新CV综述分类汇总(持续更新)
(39)CVPR 2021 论文/代码分类汇总!持续更新中!
(40)CVPR 2021 速览 | 旷视研究院22篇入选学术成果盘点
(41)一文概览 CVPR2021 最新18篇 Oral 论文
(42)CVPR 2021放榜,腾讯优图20篇论文都在这里了!
(43)CVPR 二十年,影响力最大的10篇论文!
(44)CVPR 2021公布最佳论文候选!华人占据半壁江山,何恺明、沈春华等人上榜
(45)添加链接描述CVPR 2021大奖出炉!何恺明获最佳论文提名,华人四篇“最佳”!第一届Thomas S. Huang 纪念奖颁发
(46)CVPR 二十年,影响力最大的10篇论文!
(47)Just now, CVPR 2021 Best Paper, Best Student Paper and other awards have been announced! (Paper download address is attached)
(48) CVPR 2021 awards released: Best paper goes to Max Planck Institute, He Yuming was nominated, and the first Huang Xutao Memorial Award was announced
(49) Recommended open source papers this week: including face recognition, instance segmentation, tracking, SR, etc.
(50) CVPR 2021 papers are open for download!
(51) [CVPR 2021 Best Paper Candidate] 32 best paper candidates have been announced. Guess which one won the CVPR 2021 Best Paper Candidate?
(52) [June 2] Code sharing of ten (to be) open source papers
(53) May 26] Code sharing of seven (to be) open source papers
(56) 2021 745 published papers are the most comprehensive classification summary!
(57) Recommend several new CVPR 2021 open source papers, including image segmentation, domain adaptation, image retrieval, line of sight estimation, etc.

 

face technology

(01) The best new framework of CVPR2020|Large-scale facial expression recognition (with source code)
(02) Inventory|Lightweight face detection algorithm implementation, almost no friends are here~
(03) 10 lightweight face detection algorithms big PK|Code open source
(04) The remaining problems of face recognition: from occlusion, age, posture, makeup to kinship, face attack
(05) Overview of liveness detection algorithms in face recognition
(0) 6) TinaFace: A new record for face detection!
(07) The accuracy rate exceeds 99.5%!







如何入门多视角人脸正面化生成?不得不看的超详细最新综述!
(07)WACV 2021 论文大盘点-人脸技术篇
(08)重要!分享几个业界新出人脸识别数据集
(09)Facebook等新提出的视听语音分离的方法VisualVoice,利用跨模态一致性
VisualVoice: Audio-Visual Speech Separation with Cross-Modal Consistency
单位 |德克萨斯大学奥斯汀分校,Facebook
论文 |https://arxiv.org/abs/2101.03149
代码 |https://github.com/facebookresearch/VisualVoice
主页 |http://vision.cs.utexas.edu/projects/VisualVoice/
(10)人脸超分辨率,基于迭代合作的方法
(11)不得不赞!京东开源FaceX-Zoo,一站式人脸识别研究平台
(12)跳过人脸检测和关键点定位,Facebook等提出实时3D人脸姿态估计新方法
(13)无需人脸检测和关键点定位,Facebook等提出实时3D人脸姿态估计新方法
(14)CVPR 2021 | 中科大联合快手,提出人脸伪造检测新方法
(15)Face Transformer for Recognition is used for face recognition
(16) The Chinese team won the world's first face recognition mask!
(17) Open source! Face detection algorithm with only 85K parameters
(18) Occlusion face problem | Detailed interpretation of Attention-Based method to solve occlusion face recognition problem (attached paper download)
(19) CVPR2021 (Oral) Shang Tang, Hong Kong Chinese to achieve a new breakthrough in monocular face reconstruction: a renderer based on a generative network! Geometry is more precise! The rendering effect is more realistic!
(20) OpenVINO™ realizes eye fatigue/drowsiness detection based on facial landmark detection
(21) Tencent Youtu TFace is officially open source, more reliable face recognition!
(22) CVPR2021 (Oral) Shangtang and Hong Kong Chinese achieve a new breakthrough in monocular face reconstruction: a renderer based on a generative network! Geometry is more precise! The rendering effect is more realistic!
(23) Monocular 3D face reconstruction, wrinkles can change naturally with the expression, more realistic.
(24) 3D face modeling Snap et al. proposed the first one-shot 3D face style migration framework, which only needs an image of any style, and it can generate a 3D face model with exaggerated geometric shape and texture stylization.

 

Target Detection

(01) Video target detection inventory
(02) R-CenterNet: Use CenterNet to detect rotating targets
(03) The University of Hong Kong proposed OneNet: a one-stage end-to-end target detection network without NMS! No binary matching required!
(04) Overview of Anomaly Detection
(05) Open Source Software | Deep Learning for Road Disease Detection
(06) Transformer has made another contribution! Fast (420 fps) and good lane line detection algorithm
(07) NanoDet: Lightweight (1.8MB), ultra-fast (97fps on mobile) object detection project
(08) Use your strength to justify yourself, YOLOv5: I am the strongest in road damage detection! GRDDC'2020 Contest Report
(09) An alternative to YOLO, the 97FPS Anchor-Free target detection model NanoDet on the mobile phone is now open source~
(10) Excellent! Tongji Berkeley of Hong Kong University proposed Sparse R-CNN: a new paradigm for target detection
(11) Generalized Focal Loss V2 for painless target detection
(12) Using CenterNet to detect rotating targets
(13) Crack detection scheme based on computer vision
(14) Multimedia Laboratory of Chinese University of Hong Kong | Open source video target detection & tracking platform (with source code download)
(15)基于密度图的航空物体检测:理论与代码实现
(16)目标检测的稀疏对抗攻击,代码已开源
(17)北亚利桑那大学等推出:航拍森林火情检测数据集 FLAME
(18)无需NMS的目标检测,OneNet
(19)NAS在检测中的应用
(20)NeurIPS 2020 | 微软亚洲研究院论文摘录之目标检测篇
(21)难以置信的目标检测小妙招:多训练几个epochs,平均一下就能获得更好的模型

(22) Sparse adversarial attack for object detection, the code has been open-sourced
(23) C++ implements OpenVINO deployment of yolov5
(24) Jishi live broadcast playback 丨 No. 75 - Fang Hao: New SOTA for lane line detection, RESA: cyclic feature displacement aggregator (AAAI2021) (25) One article sorts out defect detection methods (26) Open source project | Based on YOLO-V5 to realize pedestrian social distance risk warning (with complete source code
) (
27 )
Heavyweight ! 13 Anchor free-based target detection methods
(28) Interpretation of rotating target detection methods (DCL, CVPR2021)
(29) One article sorting out defect detection methods
(30) No NMS! Alibaba and Ada proposed PSS: a simpler and more effective end-to-end target detection
(14) point-up technique! Small target detection: data augmentation
(15) AAAI 2021 target detection paper inventory (YOLObile/R3Det/StarNet, etc.)
(16) Target detection competition ideas, tricks collection, data summary
(17) CVPR 2021 | GFLV2: Target detection conscience technology, no Cost increase point!
(18) Detailed practical tutorial: Deploying YOLOv5 target detection with OpenCV's DNN module
(19)Dry goods practice | Anchor optimization improves target detection so significantly
(20) Small object problem in object detection
(21) Big change to Yolo framework | New target detection framework with extremely low energy consumption (attached paper download) (
22) Small target detection: data enhancement
(23) Big inventory | 2 best reviews of abnormal algorithms in 2020 (
24) Small target detection: Feature Extraction
(25) Summary of the latest research on industrial image anomaly detection (2019-20) 20)
(26) Overview丨Research progress on surface defect detection of industrial metal flat materials
(27) Detailed explanation of camouflaged target detection based on deep learning
(28) Deployment of YOLOV5 model based on Caffe format
(29) #WACV 2021 FisheyeYOLO: Universal Object Detection on Fisheye Cameras for Autonomous Driving. For object detection in fisheye images, the author found better representation methods in different object representation methods, such as oriented bounding boxes, ellipses and general polygons. And a novel curved bounding box model is designed, which has the best properties of the fisheye distortion model. FisheyeYOLO: Generalized Object Detection on Fisheye Cameras for Autonomous Driving unit | University of Limerick, Valeo paper | https://www.researchgate.net/publication/346931586_FisheyeYOLO_Object_Detection_on_Fisheye_Cameras_for_Autonomous_Driving code |

(30) Aerial object detection based on density map: theory and code implementation
(31) WACV 2021 paper inventory - target detection
(32) Target detection in AAAI 2021 (detailed version with code)
(33) From L1 loss to EIoU loss, a list of loss functions for target detection border regression
(34) #城市天眼# Developed by Skylark Labs in the United States, drone security monitoring can be used in high altitude (3- 90 meters) to detect and analyze behaviors of crowds and find suspicious activities.
Source: https://twitter.com/i/status/1364086835266211843
(35) Fast and accurate without lidar! SMOKE for 3D target detection
(36) Understanding Objectness in object detection

(37)目标检测一卷到底之后,终于有人为它挖了个新坑|CVPR2021 Oral
(38)CVPR2021目标检测佳作 | Weighted boxes fusion(附github源码及论文下载)
(39)基于YOLOV4深度网络的车辆压实线检测算法
(40)56.4 AP!超越YOLOv4,更快更强的CenterNet2来了!
(41)CVPR2121目标检测 | 少见的知识蒸馏用于目标检测(附论文下载)
(42)用于自动驾驶的实时车道线检测和智能告警
(43)全新FPN!CE-FPN:通道增强特征金字塔网络,助力目标检测涨点!
(44)极市项目|未拴绳遛狗识别算法需求
(45)基于YOLOV5深度网络模型的火焰检测
(46)基于YOLOV5深度网络模型的交通标志设施的模型训练
(47)基于深度学习YOLOV5网络的道路状况检测
(48)基于YOLOV5深度网络的公路病害检测
(49)使用Disentangling形式的损失函数回归2D和3D目标框
(50)CVPR 2021 | 腾讯AI Lab入选论文解读
(51)实操教程:android camera nanodet 实时物体检测的高效实现总结
(52)CVPR2021 目标检测佳作 | Weighted boxes fusion(附 GitHub 源码及论文下载)
(53)我扔掉FPN来做目标检测,效果竟然这么强!YOLOF开源:你只需要看一层特征|CVPR2021
(54)【入门教程】异常检测(Anomaly Detection)到底是什么?
(55)最强检测 | YOLO V4?都是弟弟! CenterNet2以56.4mAP超越当前所有检测模型
(56)mmdetection性能简单优化方法
(57)目标检测一卷到底之后,终于有人为它挖了个新坑|CVPR2021 Oral
(58)轻量高速检测器LFFD升级版LFD发布!用Pytorch部署,支持多类检测
(59)船舶检测 | 计算机视觉来看苏伊士运河堵船
(60)基于YOLOV4的印刷电路板PCB目标检测
(61)INT4量化用于目标检测
(62)超越YOLOv5!PP-YOLOv2:更快更好的目标检测网络
(62)Hugging Face发布PyTorch新库「Accelerate」:适用于多GPU、TPU、混合精度训练
(63)超越YOLOv5还不够!这个目标检测开源项目又上新了
(64)60.6 AP!打破COCO记录!微软提出DyHead:将注意力与目标检测Heads统一
(65)当YOLOv5遇见OpenVINO!
(66)OpenVINO™ 头部姿态评估网络应用演示
(67)实操教程|YOLOv5实现自定义对象训练与OpenVINO部署全解析
(68)缺陷检测算法汇总(传统+深度学习方式)|综述、源码
(69)一文梳理水下目标检测方法
(70)不容忽视的问题:行人检测器的泛化能力
(71)让检测告别遮挡 | NMS-Loss是如何解决目标检测中的遮挡问题的?
(72)旋转目标检测 | 基于高斯 Wasserstein 距离损失的目标检测(附源代码)
(73)干货 | 利用像机图像通过卷积神经网络实时进行水稻检测(致敬袁老)
(74)MaskedFace-Net | 新冠疫情中的口罩检测(附论文及源代码)
(75)CVPR 2021 | 谷歌提出MobileDets:轻量化目标检测网络
(76)收藏 | 使用合成数据集做目标检测
(77)Moving target detection - ViBe algorithm
(78) Selected series of target detection, the most complete summary at present!












 

Classification, re-identification (backbone network)

(01) NanoDet, a 1.8M ultra-light object detection model, runs faster than YOLO, and has more than 200 Stars in two days. (
02) Sun Yat-sen University proposes a new pedestrian re-identification method and the largest evaluation benchmark in history
(03) ECCV 2020 paper inventory - remote sensing and aerial image processing and recognition
(04) Sun Yat-sen University proposes a new pedestrian re-identification method and the largest evaluation benchmark in history
(05) Video person re-identification: relationship guides spatial attention + temporal features Extraction model
(06) Wuhan University and others released the latest review of ReID! Including the three top visual conferences, a new benchmark method AGW|TPAMI2021
(07) Sun Yat-sen University proposed a new pedestrian re-identification method and the largest evaluation benchmark in history
(08) was fully upgraded! FastReID V1.0 is officially open source: Beyond reID
(09) the strongest ResNet variant! Goodbye normalization! DeepMind proposed NFNet, the code has been open source!
(10) Review and Prospect of Deep Learning Pedestrian Re-Identification, Latest Articles of TPAMI 2021
(11) Supervised Pedestrian Re-Identification Problem in Camera Domain
(12) WACV 2021 Paper Review - Image Classification
(13) WACV 2021 Paper Review - Image and Video Retrieval
(14) CVPR 2021 | Target-Guided Human Attention Estimation Improves Zero-Shot Learning
(15)From the road to simplicity! In-depth interpretation of CVPR2021 paper RepVGG!
(16) Propose an end-to-end prototypical cross-domain self-supervised learning (PCS) framework for Few-shot Unsupervised Domain Adaptation (FUDA).
(17) ResNet has been strongly upgraded, and it will compete with EfficientNets only by improving training and expansion strategies

(19) CVPR2021|ACNet evolved again, Tsinghua University & Megvii Technology proposed an Inception type DBB
(20) After two years, EfficientNet v2 is here! Faster, smaller and stronger!
(21) 89.77% accuracy rate! Google proposed CoAtNet: Combining convolution and self-attention
(22) CVPR 2021 Oral | A new self-attention model beyond convolution! Google proposes: HaloNet, another super-strong visual backbone...
(23) Research status and problems of remote sensing image classification of hyperspectral images
(24) Google proposes a new model of "convolution + attention", surpassing the strongest variant of ResNet!
(25) The world's first open source image recognition system is launched
(26) Bytedance won the double champion of CVPR2021 fine-grained image competition
(27) Dry goods | Ali's image search architecture
(28) EfficientNetV2

 

Semantic Object Segmentation

(01) NeurIPS 2020 Oral: Using Pixel-Level Cyclic Consistency to Solve Domain Adaptive Semantic Segmentation Problems
(02) Performance Improvement by More Than 30%! Industrial SOTA's real-time instance segmentation algorithm SOLOv2, faster and stronger!
(03) Inventory of CVPR 2020 papers - matting
(04) Real-time matting without green screen, SenseTime and others proposed a new method MODNet that only needs a single image and a single model (
05) performance improvement of more than 30%, real-time instance segmentation algorithm SOLOv2 realizes industry SOTA
(06) YolactEdge, the first real-time instance segmentation method on edge devices (Jetson AGX Xavier: 30 FPS
(07) medical image segmentation Comprehensive comparison of the best methods: U-Net and U-Net++
(06) MODNet is easy to train in an end-to-end fashion. It is much faster than contemporaneous matting methods, running at 63 frames per second.

(07) In this work, the author proposed BoxInst, which can only be labeled with instance bounding boxes (rather than instance mask labels)
(08) The author named this joint task as depth-aware video panorama segmentation, and proposed a new evaluation index and two derived datasets for it, and said that these datasets will be made public.












Transformer再突破!MedT:医学图像分割新网络
(22)CVPR 2021 | MSRA提出像素级别自监督预训练方法PixPro,大幅提升下游检测分割任务性能
(23)SG-net:一次视频实例分割的空间粒度网络
(24)Panoptic FCN:真正End-to-End的全景分割
(25)CVPR 2021 Oral | Transformer再突破!美团等提出VisTR:视频实例分割网络
(26)CVPR 2021 | 250 FPS!让实时语义分割飞起!重新思考BiSeNet
(27)顶刊TPAMI 2021!南开大学提出深度霍夫变换:语义线检测新方法
(28)Segmenter:基于纯Transformer的语义分割网络
(29)谷歌等新作:视觉Transformer的有趣特性
(30)视觉Transformer比CNN更鲁棒!IBM华人研究员新作
(31)更快更强!谷歌提出NesT:收敛更快、鲁棒更好的Transformer

(32) real-time, high-resolution background replacement techniques that run at 30fps in 4K resolution and 60fps in HD, and the code is open source!
(33) HKU & NVIDIA proposed SegFormer: a new idea for simple and effective Transformer semantic segmentation
(34) Practical tutorial|An example of using image segmentation for defect detection
(35) CVPR2021 masterpiece | One-Shot is too much, Zero-Shot instance sample segmentation
(36) Training data does not need to be manually labeled and segmented, but can image segmentation also be achieved?
(37) CVPR2021 double-layer instance segmentation, greatly improving occlusion processing performance
(38) Google releases a new dataset for semantic segmentation! By the way, a model slaughter list was developed, which has been accepted by CVPR2021

 

target tracking

(01) Favorites | Introduction to Multi-Target Tracking (MOT)
(02) Overview of Single-Target Tracking
(03) Simple and crude Multi-Target Tracking Artifact – DeepSort
(04) Long-term target tracking combined with re-detection
(05) Target tracking has added a heavy open source toolbox, MMTracking is here!
(06) Remote sensing image + CNN, predicting regional population income level
(07) Summary of target tracking
(08) WACV 2021 paper inventory - target tracking

(09) Inadvertently "walking two steps" can lock identity information, which is the black technology of gait recognition.
Recommend a new review that comprehensively introduces the development of gait recognition, including technology evolution, main data sets, and the current level of technology. It is a must-read paper for understanding deep learning gait recognition.
Deep Gait Recognition: A Survey https://arxiv.org/pdf/2102.09546.pdf

(10) #手身追输# Human hands are extremely flexible, and there are various complex self-contacts and occlusions, which bring difficulties to tracking. Facebook Reality Labs recently invented an extremely accurate method of hand tracking by adding physical constraints to the visual model. Highly accurate tracking is possible with one or two hands.
Constraining Dense Hand Surface Tracking with Elasticity
Home | https://research.fb.com/publications/constraining-dense-hand-surface-tracking-with-elasticity/

(11) TraDeS: CVPR 2021 multi-target tracking algorithm, which improves the current online method of joint detection and tracking, uses tracking clues to assist detection, and achieves a significant improvement in accuracy in multiple data sets. The author is from the State University of New York. The paper has not yet been published, and the code will be open source.
Track to Detect and Segment: An Online Multi-Object Tracker
project homepage: https://jialianwu.com/projects/TraDeS.html

(12) Multi-channel surveillance video stitching system based on scale-invariant feature transformation
(13) TCSVT2021: A pedestrian re-identification method combining global and local fine-grained features
(14) Image stitching algorithm based on SIFT scale-invariant feature transformation
(15) The latest open source! TransReID: The first ReID network based on Transformer, leading in all tasks!
(16) Interpretation of WACV2021 paper - Scale Equivariance Improves Siamese Tracking
(17) End-to-end multi-object tracking, the code will be open source, Looking Beyond Two Frames: End-to-End Multi-Object Tracking Using Spatial and Temporal Transformers (18) SiamGAT is proposed for
object tracking, and its performance is ahead of many current advanced trackers, reaching SOTA .
(19) TCSVT2021: A pedestrian re-identification method that combines global and local fine-grained features
(20) CVPR 2021 | The first pedestrian search framework without an anchor frame (Anchor-Free) (with code)
(21) Pedestrian multi-target tracking based on YOLOV3 and DeepSort
(22) From theory to actual combat! Video Streaming Vehicle Counting and Object Tracking
(23)TPAMI 2021 :基于 event stream 的步态识别,准确率高达90%
(24)极市直播回放丨第80期-张新宇:CVPR 2021-​Alpha Refine:通过精确的边界框估计提高跟踪性能
(25)目标跟踪入门篇-相关滤波

 

动作检测与识别

(01)MMAction2: 新一代视频理解工具箱
(02)WACV 2021 论文大盘点-人体动作检测与识别篇
(03)CVPR 2021 | 用于动作识别,即插即用、混合注意力机制的 ACTION 模块
(04)CVPR 2021 | 商汤提出最强时序动作提名修正网络:TCANet
(05)人体动作识别与生成:基于ST-GCN的方法
(06)刷爆HACS挑战赛时序动作检测榜单!TCANet:最强时序动作提名修正网络 CVPR 2021
(07)更快更强!视频理解模型PP-TSM重磅发布:速度比SlowFast快4.5倍
(08)视频异常行为检测算法MPN,在多个数据库上达到SOTA
(09)CVPR2021Oral #人体运动捕捉使用 4 个RGBD摄像头进行人体运动捕捉,在几何重建和纹理生成上效果都更好
(10)CVPR 2021 | 又好又快的视频异常检测,引入元学习的动态原型学习组件

 

姿态估算

(01)多人姿态识别框架——AlphaPose
(02)GitHub:人体姿态估计最全资料集锦
(03)人体姿态估计 (Human Pose Estimation) 常用方法总结
(04)CVPR2020 | 旷视研究院提出 PVN3D:基于 3D 关键点投票网络的单目 6DoF 位姿估计算法
(05)人体姿态估计、识别与生成最新技术一览
(06)深度学习人体姿态估计:2014-2020全面调研
(07)最新开源:端到端6D物体姿态跟踪,无需标注数据集!
(08)手势识别基础~手势骨架与关键点提取
(09)动物姿态估计!马、老虎、牛、鹿、狗狗的姿态都能搞定!斩获CVPR 2021 Oral
(10)OpenVINO™ 头部姿态评估网络应用演示
(11)CVPR 2021 | 微软提出"解构式关键点回归", 刷新COCO自底向上多人姿态检测记录!

 

OCR

(01)万字长文 | 图表示学习中的Encoder-Decoder框架
(02)霸榜Github:又一款OCR神器面世!
(03)新视角:用图像分类来建模文字识别也可以SOTA
(05)都2021了,别再堆砌网络了!10万奖金悬赏最强轻量化OCR模型
(06)顶刊TPAMI 2021!PAN++:精确高效的任意形状文本检测与识别
(07)最新!CVPR 2021 OCR领域论文大盘点(22篇)
(08)论文推荐|【KSII TIIS 2021】DP-LinkNet:一种用于古籍文档图像二值化的卷积网络(有源码)

 

3D,深度估算,点云,SLAM

(01)CVPR2020 | 3D 目标检测新框架:3DSSD
(02)CenterFusion:融合雷达与摄像头数据的高精度3D目标检测
(03)最佳论文!商汤提出手机端实时单目三维重建系统 | ISMAR 2020

(04)商汤提出手机端实时单目三维重建系统,实现逼真AR效果和交互
(05)基于深度学习的图像匹配技术一览
(06)极市直播|AAAI’21杰出论文许鸿斌:一个解决三维重建对数据依赖的新框架(已开源)
(07)OpenCV再升级!修改一行代码,将图像匹配效果提升14%!
(08)重磅!谷歌开源TensorFlow 3D场景理解库
(09)极市直播回放丨第76期-许鸿斌:AAAI’21杰出论文,一个解决三维重建对数据依赖的新框架(已开源)
(10)可用于大规模点云表面重建的深度学习算法
(11)可用于大规模点云表面重建的深度学习算法

(12)深度估计是机器人和自动驾驶研究的重要内容,而这往往需要特殊设备,如RGB-D相机或激光雷达,如何使用RGB相机感知深度呢?研究人员曾经做了很多的尝试。该视频是CVPR 2021论文Depth from Camera Motion and Object Detection结果,通过使用“普通手机摄像头运动+目标检测的包围框”数据,设计RNN网络实现了达到最先进精度的目标深度估计。单位 | 密歇根大学,史蒂文森理工学院论文 | https://arxiv.org/abs/2103.01468代码 | https://github.com/griffbr/ODMD
(13)CVPR 2021 | TPCN 点云就是这么美妙
(14)一文了解激光点云的组织形式
(15)基于YOLO的新型RGB-D融合方法对行人进行检测和3D定位
(16)ECCV2020 | 夜间图像的无监督单目深度估计
(17)MVSNeRF: Fast Generalizable Radiance Field Reconstruction from Multi-View Stereo
(18)在 KITTI 基准数据集上实现最先进的单目3D目标检测结果,表现与基于单目视频的方法相当。
(19)真正实用的退化模型:ETH开源业内首个广义盲图像超分退化模型,性能效果绝佳
(20)ResNet也能用在3D模型上了!清华计图首创三角网格面片上的卷积神经网络:SubdivNet
(21)开源|AAAI‘21杰出论文-三维重建新探索:解决数据依赖问题,让自监督信号更可靠!
(22)综述:基于点云的自动驾驶3D目标检测和分类方法
(23)PatchmatchNet:一种高效multi-view stereo框架 (CVPR2021 Oral)
(24)CVPR2021|神经网络如何进行深度估计?
(25)DXSLAM:一种基于深度特征的鲁棒且高效的视觉SLAM系统
(26)实时高分辨率 RGB-D表面重建(CVPR2021)
(27)Complexer-YOLO:基于语义点云的实时三维目标检测与跟踪
(28)HDRUNet | 深圳先进院董超团队提出带降噪与反量化功能的单帧HDR重建算法
(29)基于点云的3D障碍物检测
(30)极市直播丨朱思语:基于深度学习的视觉稠密建图和定位
(31)基于3D Surfel图的单目直接法稀疏定位
(32)将合成 3D 场景表示合并到生成模型中,从而实现更可控的图像合成。
(33)传统单图像深度估计往往只能给出低分辨率结果,细节也不够丰富,视觉上总给人模糊不清的感觉,来自SFU和Adobe的研究者通过合并不同分辨率生成高分辨率的深度估计,终于可以还原清晰的细节。
(34) OmniPhotos, currently the fastest 360° panoramic VR photography method. The code is open source.
(35) The author proposes and integrates GrooMeD-NMS – a novel grouped mathematically differentiable NMS for monocular 3D object detection, (
36) CVPR 2021 | Adaptive Activation Function ACON: A New Paradigm for Unifying ReLU and Swish

 

GUN (image generation, super resolution, motion transfer)

(01) CVPR 2020 paper inventory - image enhancement and image restoration
(02) Harbin Institute of Technology and others proposed a lightweight blind super-resolution model LESRCNN, the code has been open source
(03) Latest! Comprehensive comparative study of image denoising
(04) Without user input, Adobe proposes new method for automatic high-quality image synthesis
(05) Researchers at NVIDIA Research propose an adaptive discriminator enhancement mechanism that significantly stabilizes training in limited data environments.
(06) Photos become cartoon style in seconds! Teach you to use PaddleGAN to quickly generate your exclusive cartoon avatar
(07) and accurately generate Fake faces! Amazon's new GAN model gives you all-round beauty without dead ends
(08) The postdoctoral sister has upgraded the "two-dimensional wife generator"! AniGAN: This time you can specify the style of painting
(09) Training GANs 10 lessons I learned in a year
(10) Covering 18+ SOTA GAN implementations, this open source project PyTorch library fire
(11) 6ms EfficientDeRain: Inspiring simple and efficient rain removal algorithm
(12) PULSE: An image super-resolution algorithm based on implicit space
(13) Solve the problem that previous single-image super-resolution algorithms only work well on synthetic data and cannot be applied to real scenes. It can generalize to different cameras without training on images from a specific type of camera. Exploiting Raw Images for Real-Scene Super-Resolution Unit | Carnegie Mellon University, SenseTime, University of California Paper | https://arxiv.org/pdf/2102.01579.pdf Code | https://www.dropbox.com/s/a66iuwoswul65da/RawSR_PAMI20.zip?dl=0 Home | https://sites.google.com/ view/xiangyuxu/rawsr_pami (14) The next generation
locker room! A virtual fitting application made by a foreign designer. 2D joint point tracking based on OpenPose, rendered with Houdini special effects.
Source: https://80.lv/articles/next-gen-dressing-room-with-markerless-tracking-in-houdini/

(15) #GAN #WACV2021
SinGAN-GIF can generate samples of any aspect ratio, perform super-resolution, change the time frame rate, and can be used for video editing applications.
SinGAN-GIF: Learning a Generative Video Model From a Single GIF
Author | Rajat Arora, Yong Jae Lee
Unit | University of California, Davis Paper
|
https://openaccess.thecvf.com/content/WACV2021/papers/Arora_SinGAN-GIF_Learning_a_Generative_Video_Model_From_a_Single _GIF_WACV_2021_paper.pdf
home page | https://rajat95.github.io/singan-gif/

(16) WACV 2021 Paper Inventory - Image Quality

(17) Morph-UGATIT: An image translation method that supports progressive domain migration
(18) An extremely fast video frame interpolation method recently proposed by UC San Diego, CMU, and Facebook is 384 times faster than the previous most accurate method and 23 times faster than the previous fastest 8-fold interpolation method. This video is a slow motion image obtained by using this method. The code will be open source.
FLAVR: Flow-Agnostic Video Representations for Fast Frame Interpolation
units | University of California, San Diego; Carnegie Mellon University; Facebook AI
paper | https://arxiv.org/abs/2012.08512
github | coming
home | https://tarun005.github.io/FLAVR/

(19) The Neural Body algorithm invented by Zhejiang University and other scholars can output 3D human body and new angle views by inputting multi-angle videos. Paper | https://arxiv.org/pdf/2012.15838.pdf Code | https://github.com/zju3dv/neuralbody (coming soon)

(20) CVPR 2021 Accepted Paper: AdCo Contrastive Learning Based on Adversarial

(21) The first DNN solution to employ both sensor data and images for video stabilization. Deep Online Fused Video Stabilization Unit | University of Wisconsin-Madison, Google Paper | https://arxiv.org/abs/2102.01279 Code | Coming Home | https://zhmeishi.github.io/dvs/

(22) The Neural Body algorithm invented by Zhejiang University and other scholars can output 3D human body and new angle views by inputting multi-angle videos. Paper | https://arxiv.org/pdf/2012.15838.pdf Code | https://github.com/zju3dv/neuralbody (coming soon)

(23) TIP 2021 paper: Joint realization of multi-exposure image fusion and super-resolution
(24) CVPR 2021 Oral|Silky 3D effects can be rendered in real time with only static images
(25) Intellectual interest丨Real-time style migration, running on mobile terminals, and face special effects have a new way
(26) CVPR 2021 | HKUST: How to use flash images to remove reflections?
(27) CVPR 2021 | Neighbor2Neighbor: A method for training arbitrary noise reduction networks with only noisy images
(28) Multifunctional image super-resolution model: Asymmetric convolutional neural network for blind image super-resolution
(29) A breakthrough in deep image restoration
(30) GANSpace: Discovering Interpretable GAN Controls
(31) TIP2021 | Multi-level feature fusion network in video super-resolution
(32) rt Flow: Unbiased Image Style Transfer via Reversible Neural Flows proposes ArtFlow to prevent content leaks in the general style transfer process. ArtFlow consists of a reversible neural flow and an unbiased feature transfer module. It supports both forward and backward reasoning, and operates using a projection-transfer-restore scheme. ArtFlow achieves comparable performance to state-of-the-art style transfer methods while avoiding content leaks.
(33)极市直播丨邓欣:TIP 2021-多曝光图像融合及超分辨的联合实现方法
(34)Weather GAN:实现晴、阴、雾、雨、雪之间的天气状况自由迁移
(35)CVPR 2021 | 五官画风都能改,用无监督方法控制 GAN (附源码) -周博磊团队
(36)CVPR 2021|Neighbor2Neighbor:无需干净图像的自监督图像降噪
(37)图像反光能被一键去除了?港科大开源RFC,仅用一个操作,强反光也能完美去除|CVPR2021
(38)你好,这是微视AI还原的李焕英
(39)有限数据来训练GAN的一种思路
(40)揭秘腾讯微视人脸技术「黑科技」,基于GAN的人脸魔法特效
(41)添加链接描述
(42)CVPR2021|超分性能不降低,计算量降低50%,董超等人提出加速图像超分的ClassSR
(43)RealSR性能大幅提升!旷视+快手+电子科大联合提出“先发散再收敛”的D2CSR
(44)仅需2张图!AI便可生成完整运动过程
(45)PornHub 用独家数据集!修复了百年前的电影…
(46)新垣结衣夫妇的孩子会长啥样?我用BabyGAN预测试试…
(47)just! AMD releases new super-resolution technology FSR: N cards can also be used
(48) Transformer makes another breakthrough! ETH proposes: Video super-resolution Transformer
(49) Cai Xukun x special xxx dream linkage! This artifact allows images to imitate human movements in real time
(50) Everyone can use the second dimension! This GAN network allows Miss Sister to generate different styles of anime images! Changeable skin color and hairstyle
(51) This AI artifact brings father back to 18 years old! GAN model generates anime portraits in 130 milliseconds! (60)








Deepfake text version turned out: AI high imitation of your handwriting only needs 1 word!
(61) [Open source] Discussion on font generation based on image background, human body pose prediction, key point detection, super-resolution, etc. (62
) https://intel-isl.github.io/PhotorealismEnhancement/
(63) Image filling is not afraid of large areas! MSRA et al. proposed co-modulation generative confrontation network
(54) CVPR 2021 Oral | GLEAN: High-magnification image super-resolution based on implicit generation library
(55) beats the pack! 2021 NTIRE @CVPR 2021 Triple Crown and One Sub-Video Super Score Solution: BasicVSR++

 

GNN (Graph Neural Correlation)

(01) ECCV 2020 Paper Inventory - Image and Video Restoration
(02) Detailed Explanation: Types of Multimodal Knowledge Graphs and Their Applications
(03) Facebook@ICLR2021: Adding label propagation to GNN, the training time is reduced by 100 times

 

Transformer

(01) Transformer is the next step, Facebook and others propose a multi-target tracking algorithm TrackFormer
(02) Full text translation | Huawei, Peking University, University of Sydney: Overview of the latest visual Transformer (2017-2020) (
03) Transformer in computer vision
(04) The latest application of Transformer, 3D point cloud processing, achieves the first breakthrough of 70% in S3DIS dataset scene segmentation mIoU!

(05)用Pytorch轻松实现28个视觉Transformer,开源库 timm 了解一下!(附代码解读)
(06)一文看懂9种Transformer结构
(07)更深、更轻量级的Transformer!Facebook提出:DeLighT
(08)刷爆AI圈!基于Transformer的DALL-E代码刚刚开源了
(09)Transformer又来了!这个谷歌3D大法闻歌起舞,流畅且自然!
(10)视觉Transformer之简单总结
(11)效果远超Transformer!AAAI 2021最佳论文Informer:最强最快的序列预测神器
(12)Transformer携手Evolving Attention在CV与NLP领域全面涨点!
(13)无卷积!金字塔视觉Transformer(PVT):用于密集预测的多功能backbone
(14)CVPR 2021 | Transformer进军low-level视觉!北大华为等提出预训练模型IPT
(15)CVPR 2021 Oral | Transformer再发力!华南理工和微信提出UP-DETR:无监督预训练检测器
(16)搞懂 Vision Transformer 原理和代码,看这篇技术综述就够了(二)
(17) The powerful combination of CNN and Transformer! Google's latest open source BoTNet, ImageNet has an accuracy rate of 84.7%
(18) ResNet has been completely surpassed, and it is done by Transformer: YITU Technology open source "large or small" T2T-ViT, the lightweight version is better than MobileNet (19) Dimensionality reduction blow from Transformer: ReID takes the lead in all tasks, Ali & Zhejiang University proposed TransReID (20) Paper Express: Pyramid Transformer, Transformer backbone architecture more suitable for dense prediction tasks (21) ) Visualization of Visual Transformer|CVPR2021 (22) Meituan proposed a Transformer with "position
encoding "
, which
is better
than ViT and DeiT
(23) Maxing out the AI ​​​​circle! The DALL-E code based on Transformer has just been open sourced
(24) CVPR2021 | Target detection with Transformers unsupervised pre-training
(25) CVPR2021 | Target detection with Transformers unsupervised pre-training
(26) Facebook's first spatio-temporal Transformer training speed far exceeds 3D CNN!
(27) Add link description
(28) CVPR 2021 | Transformer makes another breakthrough! Fudan et al. proposed SETR: Semantic Segmentation Network
(29)Dominating the list of major CV tasks, Swin Transformer was born!
(30) https://arxiv.org/abs/2103.14803
(31) On the Adversarial Robustness of Visual Transformers
(32) Generic Attention-model Explainability for Interpreting Bi-Modal and Encoder-Decoder Transformers
(33) CrossViT: Cross- Attention Multi-Scale Vision Transformer for Image Classification is used for image classification, the code will be open source (34) HiT:
Hierarchical Transformer with Momentum Contrast for Video-Text Retrieval for video text retrieval
(35) TransCenter: Transformers with Dense Queries for Multiple-Object Tracking for multi-object tracking, the code will be open source (36
) TFPose: Direct Human Pose Est imation with Transformers is used for human body pose estimation, the code is open source
(37)Multi-Scale Vision Longformer: A New Vision Transformer for High-Resolution Image Encoding
(38) https://zhuanlan.zhihu.com/p/361092528
(39) https://zhuanlan.zhihu.com/p/361059921
(40) New paper Stone Hammer Transformer: Don't just look at attention, there is no residual and MLP, It is nothing
(41) CNN helps again! CoaT: Co-Scale convolution-attention image Transformer
(42) Fudan proposed M2TR: the first multi-modal multi-scale Transformer
(43) Wu Enda is really top-notch! New Transformers! The deep learning course was updated, nearly 600,000 people signed up...
(44) Twins: Rethinking the spatial attention mechanism in the visual Transformer
(45) Slaughtering major CV tasks! "Baidu Dinghui Thesis Reappearance Camp" comes with Swin Transformer!
(46) Heavy open source! Twins: More efficient visual Transformer backbone network, perfect for downstream detection and segmentation tasks
(47) Transformer's mid-life crisis
(48) Transformer makes another breakthrough! Xiamen University and others proposed ISTR: end-to-end instance segmentation
(49)Transformer makes another breakthrough! Swin-Unet: The first pure Transformer medical image segmentation network
(50) Google replaced the Transformer self-attention layer with Fourier transform! 7 times faster on GPU, 2 times faster on TPU...
(51) DeepViT: Towards a deeper visual Transformer
(52) Beyond PVT! Nantah University proposed ResT: an efficient multi-scale visual Transformer
(53) surpassing PVT! Nantah University proposes ResT: Efficient Multi-Scale Visual Transformer
(54) Transformer makes another breakthrough! DeepMind's new model automatically generates CAD sketches. Netizens: Architectural design is about to take off
(55) Rethinking: Jump connections applicable to both ResNet and Transformer
(56) Latest! CVPR 2021 Visual Transformer Papers Inventory (43)
(57) Transformer makes another victory! The top of the low-level tasks is occupied, and the University of Science and Technology of China and others jointly proposed: Uformer
(58) Tsinghua University proposed DynamicViT: Efficient Visual Transformer for Dynamic Token Sparse
(59) Surpassing StyleGAN! TransGAN update! Building High Resolution GAN with Pure Transformer
(60) Not all images are worth 16x16 words! Tsinghua & Huawei proposed DVT: Dynamic Vision Transformer
(61)Tencent proposes Shuffle Transformer: Rethinking the Space Shuffle of Vision Transformer
(62) Transformer is crazy! Actually won the ImageNet competition of the graph neural network, beating DeepMind, Baidu...
(63) Google Brain's new work: teach you to train your own visual Transformer model
(64) Nankai & Ali proposed P2T: a visual Transformer based on pyramid pooling! Can be used for various downstream scene understanding tasks!
(65) Google's new work: Visual Transformer surpasses ResNet! ! ! Train from scratch!
(66) Apple's new work: Transformer without attention is still top-notch! ! !
(67) BEYOND PVT! NTU proposed ResT: an efficient multi-scale visual Transformer
(68) Zhejiang University & Huawei proposed VTP: the first pruning method for visual Transformer
(69) Transformer for end-to-end target detection and tracking (with source code)
(70) How do you view the application prospects of unsupervised learning on vision transformers?
(71) You only need to watch one sequence! YOLOS: Rethink Transformer's generalization performance
(72) Universal Transformer's cornerstone visual architecture, bringing a wide range of performance improvements! (with project address)
(73)The PVT based on detectron2 is open-sourced; it can be used for Backbone’s pyramid vision transformer for intensive tasks
(74) To understand the principle and code of Vision Transformer, it is enough to read this technical overview (9)
(75) Jishi Live丨Chen Xin: CVPR 2021-​TransT: High-performance single-target tracking algorithm based on Transformer
(76) Breaking the fate of Transformer, rookie VOLO open source! Sweeping multiple CV records, the first model that exceeds 87%
(77) OpenVINO™ realizes eye fatigue/drowsiness detection based on face landmark detection
(78) Various Transformers are slightly inferior, LV-ViT: Exploring multiple efficient tricks for improving ViT performance
(79) Using TRansformer for end-to-end target detection and tracking (with source code)
(80) COTR An image matching network based on Transformer
(81) Google AI uses 30 100 million data to train a 2 billion parameter Vision Transformer model, reaching a new SOTA on ImageNet!
(82) FcaNet: Rethinking the attention mechanism from the perspective of the frequency domain
(83) Heavy open source! Twins: More efficient visual Transformer backbone network, perfect for downstream detection and segmentation tasks
(84)Introducing Transformer to Facebook in the CV session has a new discovery this time: self-supervised learning + Vision Transformers are more suitable!

 

Semi-supervised, unsupervised, reinforcement learning

(01) Can pseudo-labels still be used in this way? Semi-supervised masterpiece UPS (ICLR 2021) revealed!
(02) 3 secrets in deep learning: integration, knowledge distillation and self-distillation
(03) Can pseudo-labels be used like this? Semi-supervised masterpiece UPS (ICLR 2021) revealed!
(04) From SimCLR to BarLow Twins, an article to understand the cognitive development history of self-supervised learning that keeps hitting faces
(05) From 4 top conference papers to see the latest research progress of Self-training
(06) Inventory | He Yuming's team's work in the field of self-supervision: MoCo Trilogy
(07) CVPR 2021 Oral | Wonderful! Image line segments that are not afraid of occlusion can be matched with SOLD2, and can also be combined with self-supervised line segment detection
(08) CVPR 2021 | New work by He Yuming and others! In-depth exploration of unsupervised spatiotemporal representation learning
(09) beyond SEED! Tencent Youtu Proposed DisCo: Rescue the Effect of Small Models in Self-Supervised Learning
(10) CVPR 2021 | Breakthrough Research! Applying self-supervised learning to automatic driving
(11) LeCun teamed up with Chinese postdoctoral fellows to propose a new self-supervised learning work! But Reddit netizens questioned: The first picture is wrong...
(12) CVPR 2021 | Peking University & MSRA proposed CPS: Semi-supervised semantic segmentation based on cross pseudo-supervision
(13) In-depth understanding of self-supervised learning, just read this interpretation! Hinton team masterpiece: SimCLR series

 

Model optimization, compression, acceleration, NAS (network search), attention mechanism

(01)超越MobileNetv3!Facebook提出FP-NAS:搜索速度快,精度更高添加链接描述
(02)如何简单有效地实现迁移学习?ECCV 2020 论文介绍
(03)超越 EfficientNet与MobileNetV3,NeurIPS 2020 微软NAS方向最新研究
(04)模型压缩新突破,刷新滤波器剪枝的SOTA效果,优图NeurIPS 2020论文
(05)从频域角度重新思考注意力机制——FcaNet
(06)即插即用!视频超分中的涨点神器:iSeeBetter
(07)可变形卷积的深度思考
(08)真正的即插即用!盘点11种CNN网络设计中精巧通用的“小”插件
(10)深度学习模型压缩与加速综述
(11)量化新方:模型压缩 6 倍,无需重训练
(12)用20篇论文走完知识蒸馏在 2014-2020 年的技术进展
(13)基于TensorRT量化部署YOLOV5s 4.0模型
(14)推理实践落地 | 最详细的 Pytorch 底层算子扩展总结(文末附源码)
(15)教程:基于TensorRT完成NanoDet模型部署
(16) Image and video compression based on deep learning
(17) Three tips for improving the accuracy of deep learning: model integration, knowledge distillation, and self-distillation
(18) CVPR2021 deep framework training | Not all data enhancements can improve the final accuracy
(19) CVPR 2021 |
(20) CVPR Oral: I will show you a show made out of nothing|Beihang Shangtang Yale
(21) CVPR 2021 | Adaptive activation function ACON: A new paradigm for unifying ReLU and Swish
(22) New height of dynamic filter convolution! DDF: Simultaneously solve the two major defects of content agnostic and computational complexity|CVPR 2021
(23) Attention Nine-story Pagoda: Nine-fold Understanding of Attention Mechanism
(24) CNN visualization has added a new work! Nantah University proposed Group-CAM: an efficient saliency map generation method
(25) surpassing self-attention! Tsinghua proposed EA and EAMLP: a new attention mechanism using two linear layers
(26) plug and play! Zhejiang University & Hong Kong Chinese proposed CompConv convolution: let the model not lose accuracy and speed up
(27) Summary and code implementation of Attention mechanism in deep learning (2017-2021)
(28) Overview of image enhancement based on deep learning
(29)Complete analysis of RNN, Seq2Seq, Attention Mechanism
(30) Overview | Attention Mechanism
(31) CVPR 2021 | Beyond Convolution, Self-Attention Model HaloNet
(32) CVPR 2021 | Neural Architecture Search Based on Random Labels
(33) This may be the strongest AI algorithm visualization artifact!

 

datasets, competitions, annotation tools, utilities

(01) Inventory of CVPR 2020 Algorithm Competition
(02) Annotation tool for trapezoidal coordinates (can be used for license plate/OCR/face key points)
(03) From 3D face to automatic driving, ten top open source datasets of CVPR2020
(04) 80GB medical imaging dataset released! OCTA-500 public download
(05) hot GitHub! The visualization artifact of 3.2k Star is open source!
(06) Watch CNN training up close! 360-degree visualization, netizens: the beauty is unreal
(07) Tianchi complete open source dataset!
(08) RTX 3090 deep learning environment configuration guide: Pytorch, TensorFlow, Keras
(09) Erasure: 3 important means to improve CNN feature visualization
(10) 500,000 bonus, 1 billion pixels, this target detection and tracking is not easy
(11) Over 20 million pictures, the world's largest human eye image dataset is
open source Download, the preliminary competition officially starts today!
A total of 7G data sets and related annotations, PANDA-Image consists of 555 static gigapixel images, containing a total of 21 different scenes, of which the Training set includes 390 images.
Registration link: https://tianchi.aliyun.com/s/be6691073b92dc4f2c2f230db97af7f5
Technical interpretation: https://mp.weixin.qq.com/s/AYW7_yJjKv3dmkYJEJDJNg

(13)终于来了!我们发布了 PAKDD 2021 智能运维大赛 baseline
(14)RankDataset:超大规模数据集加载利器
(15)史上最全RGB-D数据集在这里!附详细对比下载文档!
(16)10万奖励+10万数据集!垃圾分类/表情识别等赛事全面启动!2021高通人工智能应用创新大赛来了
(17)ImageNet验证集6%的标签都是错的,MIT:十大常用数据集没那么靠谱
(18)PANDA行人和车辆多目标检测方案及baseline代码
(19)10万+数据集,表情识别/农作物病虫害识别/垃圾分类识别/手绘图像识别四大赛题等你挑战
(20)CVPR 2021 | Short-video Face Parsing Challenge 开赛,数据集已开放!
(21)54万奖金!目标检测新赛事!百度发起"智能交通检测"大赛
(22)CVPR 2021商品识别竞赛来了!阿里达摩院主办
(23)31万奖金!目标检测新赛事!第六届信也科技杯智能零售算法大赛来了
(24)ICCV 2021 | 规模最大的戴口罩人脸识别比赛启动!
(25)ICCV 2021 | 首个大规模视频语义分割比赛启动!
(26) 100,000 prize pool! The OpenMMLab Algorithm Ecological Contest is officially launched!
(27) Three tracks of motion detection/location/analysis! ICCV 2021 DeeperAction Challenge is here

(28) Fighting Immortals丨NTIRE2021 Video Super Score Challenge Dual-Track Program
(29) 3D Human Object Detection and Behavior Analysis Competition kicks off, with a prize pool of 70,000+ and a data set of 16,671 pieces!
 

miscellaneous

(01) MultiPoseNet: human detection, pose estimation, and semantic segmentation, all in one "net"
(02) 10 open source Python OpenCV small projects, popular on YouTube
(03) Image algorithms can process videos stably! Hong Kong University of Science and Technology open-sourced a general algorithm to solve the problem of temporal instability in video processing|NeurIPS 2020
(04) performance SOTA, applicable to various types of objects, National University of Defense Technology single RGB-D image to predict object symmetry
(05) Remote sensing image + CNN, predict regional population income level
(06) Tsinghua & Megvii proposed RepVGG: Let your CNN roll to the end!
(07) RepVGG: Minimalist architecture, SOTA performance, making the VGG model great again!
(08) ICLR 2021 | SEED: Self-supervised distillation learning, significantly improving the performance of small models!
(09) MIT new framework | MIT open source high-performance automatic differentiation framework, speed up by 4.5 times (with framework source code)
(10) Multi-modal deep learning: use deep learning to integrate various information
(11) AdvProp: two sets of Batch Normalization to help you efficiently increase points in CNN confrontation training
(12) Top publication TPAMI 2021 | Can data amplification be achieved by changing the loss function?
(13) Megvii proposed MomentumBN: Alleviating the large batch requirements of self-supervised learning, the increase point is obvious!
(14) Raise some artifacts! Nanjing University proposed IC Networks: remodeling the basic unit of CNN
(15) a magic weapon! Re-label ImageNet, let CNN increase significantly! The code has been open sourced
(16) DeepMind redesigned high-performance ResNet! No need to activate the normalization layer
(17) generalization artifact! Li Mu and others proposed two regularization techniques: both CV and NLP have been greatly improved
(18) CVPR 2021 | RepVGG: minimalist architecture, SOTA performance, making the VGG model great again!
(19) CVPR 2021 | A magic weapon for rising points! IC-Conv: Inception convolution using efficient hole search, all-round improvement!
(12) CVPR 2021 | Plug and Play! CA: New attention mechanism, help classification/detection/segmentation rise!
(13) ICLR2021 Oral|9 lines of code to improve the generalization ability of few-shot learning, the code has been open-sourced
(24) Overview: lightweight CNN architecture design
(25) Embedding position information into channel attention! NUS proposed a new mechanism to significantly improve the expression of convolutional features|CVPR2021
(26) Add a link to describe CVPR2021 masterpiece | Relabeling ImageNet: From global labels to local labels (with GitHub code and papers)
(27) Relabeling ImageNet: Multi-label, comprehensively improve model performance
(28)just! The frequency domain channel attention network FcaNet is open source!
(29) I roll myself - cvpr2021: Involution
(30) Don't you want to give performance for free? cvpr2021-Diverse branch block
(31) Jishi Salon Review|CVPR2021-Li Duo: Visual Recognition by Inverting the Intrinsic Properties of Convolution
(32) DO-Conv Painless Growth Point: Using over-parameterized convolutional layers to improve CNN performance
(33) Dynamic convolution super evolution! Channel fusion replaces attention, reducing the number of parameters by 75% and significantly improving performance ICLR 2021
(34) CVPR'21 | Involution: A new neural network operator that surpasses convolution and self-attention
(35) [New Attention] The strongest Attention function is born, bringing you unexpected huge improvements!
(36) ICML 2021 (Long Oral) | In-depth study of imbalanced regression problems
(37) Google Brain new work: focus on MLP!
(38) ICML 2021 | A new method of sparse training: In-Time Over-Parameterization
(40) Tsinghua University proposed RepMLP: FC "involve", roll out performance!
(41) Turing Award winner Bengio has published a new paper: Using reinforcement learning to improve model generalization! Netizens crash: idea crashed...
(42)Read all 20 kinds of convolutions in deep learning in one article (with source code collation and paper interpretation)
(43) Dry goods|Heavy parameter skills in deep learning
(44) Climb to a higher peak! The team of Yan Shuicheng and Cheng Mingming open-sourced ViP, introduced a three-dimensional information coding mechanism, and did not require convolution and attention
(45) Selected latest video anti-shake papers + open source code summary
(46) CVPR 2021 | Following Google, scholars such as Tsinghua University and Oxford have published three MLP-related papers, and LeCun is also speaking (48) The latest overview of field generalization (49) Practical tutorial | Using CNN to detect fake images (50) Introducing a new activation function family ACON (51) CVPR 2021 Image Compression Latest Progress (52) Google releases a new dataset for semantic segmentation! By the way, a model was developed, which has been accepted by CVPR2021 (53) CVPR 2021 | Adaptive activation function ACON: A new paradigm for unifying ReLU and Swish






 

article reading

(01) LS-Net: Non-linear least squares learning algorithm for single and binocular vision
(02) GNN and RL are rising strongly, and CNN is beginning to show signs of fatigue? This is ICLR 2021's most comprehensive paper topic analysis
(03) SimSiam, the latest masterpiece of He Yuming's team: Eliminate the "collapse" of representation learning, and explore the root of the success of comparative expression learning
(04) Can a simple structure be efficient and accurate? Tsinghua & Huawei proposed a new residual cycle super-resolution model: RRN!
(05) The Transformer jointly built by Huawei Peking University and others surpassed CNN in the field of CV: multiple underlying visual tasks reached SOTA
(06) Tencent micro-vision model | the best result in the history of a single model, (BLENDer) topped the authoritative list of VCR
(07) "Thinking carefully" Faster-R-CNN
(08) ACCV 2020 Top 10 most concerned open source code papers!

(09) Detailed Explanation: Types of Multimodal Knowledge Graphs and Their Applications
(10) Summary of Excellent Papers on Noise Samples (2017-2020)
(11) When Frequency Domain (DCT) Meets CNN
(11) My brother questioned that the CV paper of Google Top Conference is wrong! And took out the reproducible code to prove
(12) In-depth research on model compression classic Ghostnet: How to generate a large number of feature maps with a small amount of calculation?
(13) AAAI21 Best Paper Informer: A long-sequence prediction artifact that far outperforms Transformer!
(14) How does the target detection algorithm of deep learning solve the scale problem?
(15) Diagram RepVGG
(16) Detailed Explanation: Types of Multimodal Knowledge Graphs and Their Applications
(17) Activate or not? CVPR2021-Activate Or Not: Learning Customized Activation
(18) When CV meets federated learning! FedVision: The first lightweight and scalable vision federation open source framework
(19) Deep learning predecessors have high precision, how to innovate?
(20) Google AI Research Institute: Underestimated data! Overrated model...
(21) New pits are coming! Google proposed MLP-Mixer: a visual architecture composed of pure MLP
(22) I did fisheye correction for the first photo of Mars in China
(23) PEER REVIEW IS A JOKE! Nature broke the news: Computer-generated spam articles can still be accepted, 64% come from China
(24) The visual architecture is unified! Hong Kong Chinese proposed: Container, which unifies CNN, Transformer and MLP-Mixer
(25) True Bicycle! Huawei's genius boy just "released" an unmanned bicycle. Netizens: Isn't this TM more flammable than Tesla?
(26) Papers cannot be reproduced! Really public execution! PapersWithCode launches "Thesis Reproduction Report"
(27) In 2021, what are the unsaturated, potential and rising research directions of deep learning?
(28) Boston Dynamics robot dog working this year
(29) Introduction to research on unmanned vehicle tracking technology

insert image description here

Guess you like

Origin blog.csdn.net/weixin_43013761/article/details/111400851