No need to label massive data, the new paradigm of object detection OVD makes multi-modal AGI a step forward

Follow and star

never get lost

Institute of Computer Vision

19c26b68d733695e2e6a0add96b2ffa8.gif

ff307c2d270b467408271b584d2a71f0.gif

Public Account IDComputerVisionGzq

Learning groupScan the QR code to get the joining method on the homepage

Computer Vision Research Institute column

Author: Edison_G

Target detection is a very important basic task in computer vision. Different from common image classification/recognition tasks, target detection requires the model to further give the position and size information of the target above the category of the target. In CV 3 It is in a key position connecting the preceding and the following in large tasks (identification, detection, segmentation).

Transferred from "360AI Research Institute"

51141745913822d55e3b7bba8a7a6be3.jpeg

4b9a4ec86ca6e5971f1a3656163a80d2.png

12e5004451eaa794514b2a7640664085.png

OVD basic flow diagram

30fafb8a0af94a28fcb23a1c852047e0.png

论文1:Open-Vocabulary Object Detection Using Captions

d0800aee8fb11ccf5f2ca006a37091b8.png

  • Paper address: https://arxiv.org/pdf/2011.10678.pdf

  • Code address: https://github.com/alirezazareian/ovr-cnn

860645e914f0bb294f63fbdf13aa967e.png

339850a3e629b873b2c5650562a9dd14.png

9afe5937da92e707505c3222085f85e9.png

64f52f73f3287550fba224f63b33355a.png

  • Paper address: https://arxiv.org/abs/2112.09106

  • Code address: https://github.com/microsoft/RegionCLIP

ed82845e43488eeda428e945550bbdac.png

4ac775931c25b14573350a904db3fd44.png

f7e7623335e9bd7c8fbe159041659f64.png

论文3:CORA: Adapting CLIP for Open-Vocabulary Detection with Region Prompting and Anchor Pre-Matching

568518a164753ef7d8907c1d6d8497fd.png

  • Paper address: https://arxiv.org/abs/2303.13076

  • Code address: https://github.com/tgxs002/CORA

4f5deb9bc887a4d4148fe8ecbe38032e.png

a1999b6fc9e51aada5dace74544ceb7f.png

754bcdd115cd5947a2dca227d06bc8a9.png

71bf44a204a7281f14acaf9e8a227ff7.png

© THE END 

For reprinting, please contact this official account for authorization

03beb3e0116fd037b97008c92e2e3f68.gif

The Computer Vision Research Institute study group is waiting for you to join!

ABOUT

Institute of Computer Vision

The Institute of Computer Vision is mainly involved in the field of deep learning, and is mainly committed to research directions such as face detection, face recognition, multi-target detection, target tracking, and image segmentation. The research institute will continue to share the latest new paper algorithm framework. The difference in our reform this time is that we need to focus on "research". Afterwards, we will share the practical process for the corresponding fields, so that everyone can truly experience the real scene of getting rid of the theory, and cultivate the habit of loving programming and thinking with your brain!

VX:2311123606

638544c89adef253dc2ab82eabb597e3.png

Past recommendation

Guess you like

Origin blog.csdn.net/gzq0723/article/details/131078621