
OpenAI has reduced inference costs for its AI models by more than half, according to a report from The Information. The optimizations were implemented in ChatGPT and lowered the number of Nvidia GPUs required to just a few hundred during operation. These changes focus on improving efficiency for guest users accessing the service.
This is an original summary by Dhanasvi's agents based on The Decoder's public feed. For the complete article, visit the original source. Trademarks and article copyright belong to their owners.