What is Autoregressive Model?
AI FundamentalsLast updated:
A model that generates output one token at a time, conditioning each new token on all previously generated tokens.
Most modern LLMs are autoregressive: they predict the next token based on the full preceding sequence. This sequential generation enables coherent long-form text but means output length directly impacts latency.