The Information reports that OpenAI engineers developed an optimization that cut inference costs in half; it reduced the number of GPUs for logged out ChatGPT traffic to a couple hundred

The Information reports that OpenAI engineers developed an optimization that cut inference costs in half; it reduced the number of GPUs for logged out ChatGPT traffic to a couple hundredReddit

Jun 30, 2026 18:06

Wonder how much of these optimisations were discovered by AI vs humans.

This tweet cannot be embedded.
Open direct link