What is Computer Vision?
Also known as: CV
Computer Vision is a field of AI that enables computers to interpret and understand visual information from images and videos, similar to how humans see.
It works by processing pixel data through algorithms that detect patterns, edges, shapes, and objects. Modern approaches rely heavily on deep learning models, especially convolutional neural networks (CNNs), trained on large labeled datasets.
Key ideas include image classification (labeling an entire image), object detection (locating and identifying multiple objects), and semantic segmentation (assigning labels to every pixel).
The field combines techniques from image processing, machine learning, and geometry to extract meaningful information from visual inputs.
Example
A smartphone camera app that can identify plants or animals in a photo uses computer vision to analyze the image and match visual features against trained models.
Why it matters
Computer vision powers many everyday AI applications such as facial recognition, autonomous vehicles, medical diagnosis from scans, and content moderation on social media.
Frequently asked questions
No. Image processing focuses on manipulating images (like filters or enhancements), while computer vision aims to understand and extract meaning from them.
Related terms
A Convolutional Neural Network (CNN) is a specialized type of deep neural network designed to process grid-like data such as images by automatically learning spatial patterns and features.
Object detection is a computer vision task that finds and identifies multiple objects in an image or video. It both classifies what the objects are and locates them using bounding boxes.
Deep Learning is a subset of machine learning that uses multi-layered artificial neural networks to automatically learn complex patterns from large datasets.
Machine learning is a branch of artificial intelligence in which algorithms learn patterns from data to make predictions or decisions, rather than following hand-coded rules for every situation.
Artificial General Intelligence (AGI) is a type of AI that can understand, learn, and apply knowledge across any intellectual task at a human level or beyond, rather than being limited to narrow specialties.
Artificial Intelligence (AI) is the field of computer science focused on creating machines that can perform tasks typically requiring human intelligence, such as learning, reasoning, and decision-making.