Infrastructure

Serving, hardware and MLOps.

All AI Fundamentals Machine Learning Deep Learning LLMs & Transformers Generative AI NLP AI Agents Prompting Data & Training Infrastructure Evaluation Safety & Ethics

All A B C D E F G H I J K L M N O P Q R S T U V W X Y Z

Q

Quantization

Quantization is a model optimization technique that lowers the numerical precision of weights and activations, usually converting 32-bit floats to 8-bit integers or similar lower-bit formats.