CV computer vision daily open source code Paper with code quick overview-2023.11.23

Click @CVComputer Vision to follow more CV information

The paper has been packaged, click to enter -> download interface

Click to join—>CV computer vision exchange group

1. [Basic network architecture: Transformer] White-Box Transformers via Sparse Rate Reduction: Compression Is All There Is?

2.【Rotating Object Detection】Innovative Horizons in Aerial Imagery: LSKNet Meets DiffusionDet for Advanced Object Detection

3.【Image Segmentation】Visual In-Context Prompting

4.【Medical Image Segmentation】SegVol: Universal and Interactive Volumetric Medical Image Segmentation

5.【Domain Adaptive】DA-STC: Domain Adaptive Video Semantic Segmentation via Spatio-Temporal Consistency

6.【多模态】Soulstyler: Using Large Language Model to Guide Image Style Transfer for Target Object

7.【Multi-modal】PG-Video-LLaVA: Pixel Grounding Large Video-Language Models

8.【多模态】FusionFrames: Efficient Architectural Aspects for Text-to-Video Generation Pipeline

9.【Multimodal】LiveChat: Video Comment Generation from Audio-Visual Multimodal Contexts

10. [Number People] XAGen: 3D Expressive Human Avatars Generation

11.【Depth Estimation】Camera-Independent Single Image Depth Estimation from Defocus Blur

12.【Diffusion】DiffusionMat: Alpha Matting as Sequential Refinement Learning

13.【Target Counting】T-Rex: Counting by Visual Prompting

14.【NeRF】PIE-NeRF: Physics-based Interactive Elastodynamics with NeRF

15.【图像合成】Diffusion360: Seamless 360 Degree Panoramic Image Generation based on Diffusion Models

The paper has been packaged , download link

CV computer vision communication group

The group includes target detection, image segmentation, target tracking, Transformer, multi-modality, NeRF, GAN, defect detection, salient target detection, key point detection, super-resolution reconstruction, SLAM, face, OCR, biomedical images, 3D reconstruction, attitude estimation, autonomous driving perception, depth estimation, video understanding, behavior recognition, image dehazing, image deraining, image restoration, image retrieval, lane line detection, point cloud target detection, point cloud segmentation, image compression, motion Leaders in prediction, neural network quantification, network deployment and other fields share technical knowledge, interview skills and internally recommended recruitment information from time to time .

Students who want to join the group please add WeChat ID to contact the administrator: PingShanHai666 . When adding friends, please note: school/company + research direction + nickname .

Recommended reading:

CV computer vision daily open source code Paper with code quick overview-2023.11.22

CV computer vision daily open source code Paper with code quick overview-2023.11.21

CV computer vision daily open source code Paper with code quick overview-2023.11.20

Guess you like

Origin blog.csdn.net/zhangkai950121/article/details/134693457