We use only essential, cookie‑free logs by default. Turn on analytics to help us improve. Read our Privacy Policy.
Kenaz
← Back to Glossary

Context Window

Definition

A context window is the maximum amount of information, measured in tokens, that a language model can consider at once during a single inference step.

Purpose

The purpose of the context window is to bound the information available to the model at inference time, directly influencing what the model can attend to, reason over, and respond based on.

Key Characteristics

  • Fixed or model-dependent maximum token capacity
  • Includes system instructions, user input, retrieved data, and intermediate context
  • Limits the amount of information a model can directly reason over in a single inference
  • Requires prioritization, summarization, or pruning of context in complex systems
  • Distinct from long-term storage or agent memory mechanisms

Usage in Practice

In practice, the context window is managed by selecting, ordering, and compressing inputs so that the most relevant information fits within the model's token limits during each inference call.

One implementation of this concept is offered by Kenaz through the Semantic Engineering service.

Related Terms