What is Throughput?
AI EngineeringThe number of AI requests or tokens a system can process per unit of time.
Throughput measures system capacity for handling concurrent AI requests. Optimization strategies include batching, model parallelism, and efficient infrastructure scaling.