Main tasks of computer vision

Computer Vision : A simulation of biological vision using computers and related equipment.
It mainly includes 6 major tasks, image classification, target detection, target tracking, semantic segmentation, instance segmentation, and image reconstruction .
Image classification : An image processing method that distinguishes targets of different categories based on different features reflected in image information. Indexing technology based on color features, image classification technology based on texture, image classification technology based on shape, and image classification technology based on spatial relationships.
Object Detection : A type of image segmentation based on the geometric and statistical characteristics of objects. It combines target segmentation and recognition into one, and its accuracy and real-time performance is an important capability of the entire system.
Target tracking : The position of the target in the first frame of the video will be given in the form of a bounding box, and we need to predict the bounding box of the target in other frames.
Semantic segmentation : Semantic segmentation is to classify each pixel in the image. It is currently widely used in medical images and autonomous driving (the same category does not need to be distinguished, such as sheep 1 and sheep 2).
Instance segmentation: Instance segmentation classifies all different instance classes, such as marking ten cars with ten different colors (car 1 and car 2 should be distinguished by different colors).
Panoramic segmentation (Panoramic) is a combination of semantic segmentation (semantic) and instance segmentation (instance). The difference from instance segmentation is that instance segmentation only detects objects in the image and segments the detected objects, while panoramic segmentation detects and segments all objects in the image, including the background.
Insert a picture here to describe
Insert image description here
image reconstruction : In layman's terms, if you have an old photo and want to reconstruct the damaged scene in it, it is image reconstruction.

Guess you like

Origin blog.csdn.net/weixin_42620513/article/details/132558164