Inference Space

Performance

Latency, throughput, recent traffic, and availability from real calls — not eval scores.

ModelLatency (p50)Throughput7d volume7d uptimeDetail
GPT-5.4GPT883ms350tok/s1100.00%
GPT-5.5GPT1,989ms158tok/s2100.00%
Claude Sonnet 4.6Claude3,046ms11tok/s1100.00%
Claude Opus 4.8Claude4,674ms3tok/s2100.00%
GPT Image 2GPT Image 226,556ms9tok/s3100.00%
Claude Haiku 4.5Claudeneeds metrics100.00%
GPT-5.3 CodexGPTneeds metrics100.00%
GPT-5.4 MiniGPTneeds metrics100.00%