Skip to content
Sign in

What is CUDA?

CUDA is NVIDIA's platform and programming model that lets developers run general-purpose computations on NVIDIA GPUs instead of just CPUs.

It exposes the GPU's thousands of cores for parallel tasks by extending languages like C++ and providing an API to launch kernels that execute across many threads simultaneously.

CUDA includes optimized libraries such as cuBLAS and cuDNN that accelerate common linear-algebra and deep-learning operations without requiring low-level GPU coding.

Frameworks like PyTorch and TensorFlow automatically use CUDA when an NVIDIA GPU is present, so most users benefit from GPU speedups without writing CUDA code directly.

Example

A researcher training a ResNet model on ImageNet can switch from CPU to a CUDA-enabled GPU and see training time drop from weeks to a few days because matrix multiplications run in parallel on the GPU cores.

Why it matters

Virtually all large-scale AI training and much of inference today depends on CUDA-enabled GPUs, making it the de-facto standard infrastructure layer for modern deep learning.

Frequently asked questions

No. Popular frameworks handle CUDA calls automatically; you only need CUDA installed and a compatible NVIDIA GPU.