Follow and star
never get lost
Institute of Computer Vision
Public Account ID|ComputerVisionGzq
Learning group|Scan the QR code to get the joining method on the homepage
Computer Vision Research Institute column
Author: Edison_G
Target detection is a very important basic task in computer vision. Different from common image classification/recognition tasks, target detection requires the model to further give the position and size information of the target above the category of the target. In CV 3 It is in a key position connecting the preceding and the following in large tasks (identification, detection, segmentation).
Transferred from "360AI Research Institute"
OVD basic flow diagram
论文1:Open-Vocabulary Object Detection Using Captions
Paper address: https://arxiv.org/pdf/2011.10678.pdf
Code address: https://github.com/alirezazareian/ovr-cnn
Paper address: https://arxiv.org/abs/2112.09106
Code address: https://github.com/microsoft/RegionCLIP
论文3:CORA: Adapting CLIP for Open-Vocabulary Detection with Region Prompting and Anchor Pre-Matching
Paper address: https://arxiv.org/abs/2303.13076
Code address: https://github.com/tgxs002/CORA
© THE END
For reprinting, please contact this official account for authorization
The Computer Vision Research Institute study group is waiting for you to join!
ABOUT
Institute of Computer Vision
The Institute of Computer Vision is mainly involved in the field of deep learning, and is mainly committed to research directions such as face detection, face recognition, multi-target detection, target tracking, and image segmentation. The research institute will continue to share the latest new paper algorithm framework. The difference in our reform this time is that we need to focus on "research". Afterwards, we will share the practical process for the corresponding fields, so that everyone can truly experience the real scene of getting rid of the theory, and cultivate the habit of loving programming and thinking with your brain!
VX:2311123606
Past recommendation
Are some domestic products of GPT really worse than foreign ones? (Long length, please bookmark)
Transformer industrial deployment landed! Beyond ResNet, CSWin (with source code)
CVPR2023 high-quality paper | Consistent-Teacher: Semi-supervised target detection super SOTA
Multi-Grid Redundant Bounding Box Annotation for Accurate Object Detection
LCCL network: mutual guidance game to improve target detection accuracy (with source code)
Pure dry goods: Box Size confidence deviation will damage the target detector (with source code)