What is Zero-Shot Learning?
Also known as: Zero-Shot
Zero-shot learning (or zero-shot prompting) is when an AI model completes a task it has never seen examples of during training, relying only on a natural language description of what to do.
The model draws on knowledge acquired during pre-training to interpret the task instructions and generate an appropriate response without any task-specific examples or fine-tuning.
It works by framing the request as a clear prompt that describes the desired input-output behavior, allowing the model to generalize from its broad understanding of language and concepts.
This approach contrasts with supervised learning, which requires labeled data, and few-shot learning, which supplies a handful of examples in the prompt.
Example
A user asks a language model to classify movie reviews as positive or negative by simply writing 'Classify the sentiment of this review as positive or negative' followed by the review text, without showing any labeled examples.
Why it matters
Zero-shot capabilities let users apply powerful pretrained models to new tasks instantly, reducing the need for data collection and retraining and making advanced AI more accessible and flexible.
Frequently asked questions
Zero-shot uses only instructions with no examples, while few-shot includes a small number of labeled examples in the prompt to guide the model.
Related terms
Few-shot learning (in prompting) is a technique where a language model is given a handful of input-output examples directly in the prompt to guide it on a new task.
Prompt engineering is the practice of designing and refining text inputs (prompts) to guide AI models like large language models toward producing accurate, relevant, or creative outputs.
Transfer learning is a machine learning method that reuses a model trained on one task as the starting point for a different but related task.
Fine-tuning is the process of taking a pre-trained AI model and continuing its training on a smaller, task-specific dataset to adapt it for a particular use case.
Chain-of-Thought (CoT) is a prompting technique that asks an AI model to generate intermediate reasoning steps before giving a final answer, helping it solve complex problems more reliably.
A prompt is the input text, question, or instruction given to an AI model (especially a large language model) to guide what it should generate or how it should respond.