What is Token Limit?
AI EngineeringLast updated:
The maximum number of tokens an LLM can process in a single request, combining both input and output.
Token limits constrain how much text you can send and receive in one API call. Exceeding the limit causes errors or truncation. Managing token budgets across system prompts, context, and responses is key to production AI design.