Peektastic.com

Leaked finances hint that OpenAI's inference may be swallowing its revenue

According to Techcrunch and blogger Ed Zitron, new internal documents outline how much money is moving between OpenAI and Microsoft. The documents suggest that the cost of simply running OpenAI's models is enormous. Based on these figures, a profitable OpenAI still seems far off.<br /> The article Leaked finances hint that OpenAI's inference may be swallowing its revenue appeared first on THE DECODER. [...]

Discover Copy

Rating

Innovation

Pricing

Technology

Usability

We have discovered similar tools to what you are looking for. Check out our suggestions for similar AI tools.

venturebeat

Cerebras stock nearly doubles on day one as AI chipmaker hits $100 billion — what it means for AI infrastructure

Cerebras Systems, the Silicon Valley chipmaker that built the world's largest commercial AI processor, erupted onto the Nasdaq on Wednesday, opening at $350 per share — nearly double its $185 I [...]

More Copy

Match Score: 176.94

venturebeat

Microsoft and OpenAI gut their exclusive deal, freeing OpenAI to sell on AWS and Google Cloud

Microsoft and OpenAI on Monday announced a sweeping overhaul of the partnership that has defined the commercial AI era, dismantling key pillars of exclusivity and revenue-sharing that bound the two co [...]

More Copy

Match Score: 110.37

venturebeat

Baseten takes on hyperscalers with new AI training platform that lets you own your model weights

Baseten, the AI infrastructure company recently valued at $2.15 billion, is making its most significant product pivot yet: a full-scale push into model training that could reshape how enterprises wean [...]

More Copy

Match Score: 107.95

venturebeat

5% GPU utilization: The $401 billion AI infrastructure problem enterprises can't keep ignoring

For the last 24 months, one narrative justified every over-provisioned data center and bloated IT budget: the GPU scramble. Silicon was the new oil, and H100s traded like contraband. Reserve capacity [...]

More Copy

Match Score: 98.98

venturebeat

OpenAI deploys Cerebras chips for 15x faster code generation in first major move beyond Nvidia

OpenAI on Thursday launched GPT-5.3-Codex-Spark, a stripped-down coding model engineered for near-instantaneous response times, marking the company's first significant inference partnership outsi [...]

More Copy

Match Score: 93.89

venturebeat

Perplexity AI unveils hybrid local-cloud inference system at Computex 2026

Perplexity AI, the fast-growing search startup now valued at $20 billion, unveiled what it calls the first hybrid local-server inference orchestrator at Computex 2026 on Monday night, demonstrating so [...]

More Copy

Match Score: 85.04

venturebeat

OpenAI unveils first custom AI inference chip, Jalapeño, with Broadcom — and its development was sped-up with OpenAI's own models

OpenAI and Broadcom this morning unveiled their first custom AI accelerator chip named "Jalapeño," positioning it is as a purpose-built processor for large language model (LLM) inference, r [...]

More Copy

Match Score: 80.11

venturebeat

The team behind continuous batching says your idle GPUs should be running inference, not sitting dark

Every GPU cluster has dead time. Training jobs finish, workloads shift and hardware sits dark while power and cooling costs keep running. For neocloud operators, those empty cycles are lost margin.The [...]

More Copy

Match Score: 76.53

venturebeat

Together AI's ATLAS adaptive speculator delivers 400% inference speedup by learning from workloads in real-time

Enterprises expanding AI deployments are hitting an invisible performance wall. The culprit? Static speculators that can't keep up with shifting workloads.Speculators are smaller AI models that w [...]

More Copy

Match Score: 75.89