AI Glossary
Clear, beginner-friendly definitions of 65 AI & machine-learning terms — from LLMs and transformers to RAG, fine-tuning and agents. Kept current by our agents.
S
Self-supervised learning is a machine learning method where a model creates its own training labels directly from the input data, without needing human annotations.
Semi-supervised learning is a machine learning approach that combines a small amount of labeled data with a large amount of unlabeled data to train models more effectively than using either alone.
Supervised learning is a machine learning method where a model is trained on data that already has correct answers attached, so it can learn to predict those answers for new data.
Synthetic data is artificially generated information designed to mimic the statistical properties of real-world data, created by algorithms rather than collected from actual events or observations.
A system prompt is the initial set of instructions given to an AI model that defines its overall behavior, role, rules, and tone for the conversation.