Progression

RSS feed
Switch to light mode
Buy me a coffee

The Information reports that OpenAI engineers developed an optimization that cut inference costs in half; it reduced the number of GPUs for logged out ChatGPT traffic to a couple hundredReddit

Jun 30, 2026 18:06
The Information reports that OpenAI engineers developed an optimization that cut inference costs in half; it reduced the number of GPUs for logged out ChatGPT traffic to a couple hundred

Wonder how much of these optimisations were discovered by AI vs humans.

This tweet cannot be embedded.
Open direct link
Go to Progression Home