CV computer vision daily open source code Paper with code quick overview-2023.11.21

Click @CVComputer Vision to follow more CV information

The paper has been packaged, click to enter -> download interface

Click to join—>CV computer vision exchange group

1. [Basic network architecture: Transformer] Multi-entity Video Transformers for Fine-Grained Video Representation Learning

2.【Anomaly Detection】NNG-Mix: Improving Semi-supervised Anomaly Detection with Pseudo-anomaly Generation

3.【Semantic Segmentation】Generalized Category Discovery in Semantic Segmentation

4.【3D Target Detection】Sparse4D v3: Advancing End-to-End 3D Detection and Tracking

5.【点云】Point Cloud Self-supervised Learning via 3D to Multi-view Masked Autoencoder

6.【Point Cloud 3D Object Detection】Domain Generalization of 3D Object Detection by Density-Resampling

7. [Medical Image Segmentation] SA-Med2D-20M Dataset: Segment Anything in 2D Medical Imaging with 20 Million masks

8.【Multi-modal】VLM-Eval: A General Evaluation on Video Large Language Models

9.【多模态】LION : Empowering Multimodal Large Language Model with Dual-Level Visual Knowledge

10.【多模态】CORE-MM: Complex Open-Ended Reasoning Evaluation For Multi-Modal Large Language Models

11.【Multimodal】GPT-4V(ision) for Robotics: Multimodal Task Planning from Human Demonstration

12.【Digital Human】Semantic-Preserved Point-based Human Avatar

13.【Autonomous Driving】A Language Agent for Autonomous Driving

14.【Diffusion】Reti-Diff: Illumination Degradation Image Restoration with Retinex-based Latent Diffusion Model

15.【Human Pose Estimation】Multiple View Geometry Transformers for 3D Human Pose Estimation

16.【Crowd Counting】Evaluating Supervision Levels Trade-Offs for Infrared-Based People Counting

17.【Image Restoration】Deep Equilibrium Diffusion Restoration with Parallel Sampling

18.【NeRF】Entangled View-Epipolar Information Aggregation for Generalizable Neural Radiance Fields

19.【3D Reconstruction】LiDAR-HMR: 3D Human Mesh Recovery from LiDAR

The paper has been packaged , download link

CV computer vision communication group

The group includes target detection, image segmentation, target tracking, Transformer, multi-modality, NeRF, GAN, defect detection, salient target detection, key point detection, super-resolution reconstruction, SLAM, face, OCR, biomedical images, 3D reconstruction, attitude estimation, autonomous driving perception, depth estimation, video understanding, behavior recognition, image dehazing, image deraining, image restoration, image retrieval, lane line detection, point cloud target detection, point cloud segmentation, image compression, motion Leaders in prediction, neural network quantification, network deployment and other fields share technical knowledge, interview skills and internally recommended recruitment information from time to time .

Students who want to join the group please add WeChat ID to contact the administrator: PingShanHai666 . When adding friends, please note: school/company + research direction + nickname .

Recommended reading:

CV computer vision daily open source code Paper with code quick overview-2023.11.20

CV computer vision daily open source code Paper with code quick overview-2023.11.17

CV computer vision daily open source code Paper with code quick overview-2023.11.16

Guess you like

Origin blog.csdn.net/zhangkai950121/article/details/134630475