This week’s post revisits how we can learn Computer Vision with the Vector robot. Computer Vision is a subset of Machine Learning that allows us to understand the details of an image. For robots, Computer Vision lends the robot the ability to perceive what it sees with its camera. This could be either using Object Detection (e.g. identifying an object in an image by placing a bounding box around the object), or Instance Segmentation (classifying the shape of an object in an image by drawing a contour around it). Prior to the exploding popularity of Large Language Models (LLMs) (triggered by ChatGPT introduced by OpenAI), Computer Vision dominated the progress of Machine Learning. While multi-model models (which can understand an image and reply with text) are growing in popularity, they don’t match the performance and accuracy of a state of the art Object Detection or Image Segmentation model. So let’s try to understand these concepts in this post.
Try Roboflow with Vector
One of our po…