CV computer vision daily open source code Paper with code quick overview-2023.11.22

Click @CVComputer Vision to follow more CV information

The paper has been packaged, click to enter -> download interface

Click to join—>CV computer vision exchange group

1.【语义分割】Mobile-Seed: Joint Semantic Segmentation and Boundary Detection for Mobile Robots

2.【Medical Image Segmentation】Semi-supervised Medical Image Segmentation via Query Distribution Consistency

3.【Super-resolution reconstruction】Swift Parameter-free Attention Network for Efficient Super-Resolution

4.【域自适应】(WACV2024)GLAD: Global-Local View Alignment and Background Debiasing for Unsupervised Video Domain Adaptation with Large Domain Gap

5.【Multi-Modal】ShareGPT4V: Improving Large Multi-Modal Models with Better Captions

6.【多模态】GPT4Motion: Scripting Physical Motions in Text-to-Video Generation via Blender-Oriented GPT Planning

7.【多模态】From Wrong To Right: A Recursive Approach Towards Vision-Language Explanation

8.【多模态】ViLaM: A Vision-Language Model with Enhanced Visual Grounding and Generalization Capability

9.【多模态】Boosting Audio-visual Zero-shot Learning with Large Language Models

10.【Multi-modal】Enhancing Novel Object Detection via Cooperative Foundational Models

11.【自动驾驶:Occupancy Prediction】SelfOcc: Self-Supervised Vision-Based 3D Occupancy Prediction

12.【Diffusion】Concept Sliders: LoRA Adaptors for Precise Control in Diffusion Models

13.【Object Count】Point, Segment and Count: A Generalized Framework for Object Counting

14.【视频生成】MagicDance: Realistic Human Dance Video Generation with Motions & Facial Expressions Transfer

15.【3D Reconstruction】TouchSDF: A DeepSDF Approach for 3D Shape Reconstruction using Vision-Based Tactile Sensing

The paper has been packaged , download link

CV computer vision communication group

The group includes target detection, image segmentation, target tracking, Transformer, multi-modality, NeRF, GAN, defect detection, salient target detection, key point detection, super-resolution reconstruction, SLAM, face, OCR, biomedical images, 3D reconstruction, attitude estimation, autonomous driving perception, depth estimation, video understanding, behavior recognition, image dehazing, image deraining, image restoration, image retrieval, lane line detection, point cloud target detection, point cloud segmentation, image compression, motion Leaders in prediction, neural network quantification, network deployment and other fields share technical knowledge, interview skills and internally recommended recruitment information from time to time .

Students who want to join the group please add WeChat ID to contact the administrator: PingShanHai666 . When adding friends, please note: school/company + research direction + nickname .

Recommended reading:

CV computer vision daily open source code Paper with code quick overview-2023.11.21

CV computer vision daily open source code Paper with code quick overview-2023.11.20

CV computer vision daily open source code Paper with code quick overview-2023.11.17

CV computer vision daily open source code Paper with code quick overview-2023.11.16

Guess you like

Origin blog.csdn.net/zhangkai950121/article/details/134657529