Skip to content
Sign in

What is Computer Vision?

Also known as: CV

Computer Vision is a field of AI that enables computers to interpret and understand visual information from images and videos, similar to how humans see.

It works by processing pixel data through algorithms that detect patterns, edges, shapes, and objects. Modern approaches rely heavily on deep learning models, especially convolutional neural networks (CNNs), trained on large labeled datasets.

Key ideas include image classification (labeling an entire image), object detection (locating and identifying multiple objects), and semantic segmentation (assigning labels to every pixel).

The field combines techniques from image processing, machine learning, and geometry to extract meaningful information from visual inputs.

Example

A smartphone camera app that can identify plants or animals in a photo uses computer vision to analyze the image and match visual features against trained models.

Why it matters

Computer vision powers many everyday AI applications such as facial recognition, autonomous vehicles, medical diagnosis from scans, and content moderation on social media.

Frequently asked questions

No. Image processing focuses on manipulating images (like filters or enhancements), while computer vision aims to understand and extract meaning from them.