Serving, hardware and MLOps.
Throughput measures how much work an AI system completes in a given time, such as the number of model inferences or training examples processed per second.