AI Computing Power Costs Soar, GPU Prices Fluctuate Like Commodities

Stock News07-04 19:41

The cost of AI infrastructure is undergoing significant volatility, with the unpredictable pricing of GPU servers becoming a central challenge for cloud providers and AI developers. According to a report, driven by tight supply of memory chips and other key components, the price of Nvidia AI servers has been rising over the past several months, with costs for some components fluctuating by as much as 40% in a single week. This situation has forced multiple cloud service providers to increase their rental prices for AI developers. GPU cloud service provider Nebius raised its on-demand computing power rental prices by approximately 30% on June 1, followed by Amazon AWS announcing a roughly 20% price increase for its EC2 capacity blocks effective July 1. This intense price volatility is reshaping the cost structure of the entire AI computing power market.

Carmen Li, CEO of price data provider Silicon Data, noted that the GPU rental prices charged by cloud providers to customers are beginning to exhibit characteristics similar to commodities like oil, driven by supply and demand dynamics. Small and medium-sized customers relying on on-demand rentals are bearing the brunt of this shift, and a lack of transparency in market pricing mechanisms is further exacerbating the information disadvantage for buyers.

Component Cost Swings and Narrow Pricing Windows

The instability in GPU server pricing stems from severe tightness in the upstream component supply chain. A source selling Nvidia servers to cloud providers revealed that the cost of components required for server racks can fluctuate by up to 40% weekly. These components include input wafers from TSMC, co-packaging, networking, cooling, and most significantly, memory. This source stated that GPU server rack prices are "fluctuating wildly," with everything potentially changing completely within two to three weeks, making price trends impossible to predict. Prices can only be locked in within an extremely short window, preventing longer-term cost planning.

An executive at a GPU cloud service provider noted that the server racks they procure have recently been increasing in price by about 2% to 3% weekly. An executive at a competing firm pointed out that the NVMe storage drives in Nvidia's Grace Blackwell 300 racks are a primary source of price volatility. Fluctuations were "very severe" several months ago, and current rack costs are 10% to 15% above what they consider a "baseline price." The upward trend for GB300 racks now appears to be stabilizing, with monthly increases around 1%.

The impact of these price swings is magnified by the sheer monetary amounts involved. A single rack fully loaded with Grace Blackwell 300 chip systems costs approximately $70,000 per system. A fully configured 72-system rack totals around $5 million, with some clients making one-time purchases of thousands of units. A client executive procuring Vera Rubin racks revealed that these racks are expected to cost about $7 million each.

Pricing Power Concentrated Upstream

Behind this cost surge is a high concentration of pricing power along the supply chain. The aforementioned server sales source stated that Nvidia "can pretty much ask for any price." A Nvidia spokesperson responded that pricing depends on server rack component costs, and the company collaborates with server providers on pricing, which may vary between different providers. Data shows Nvidia's gross margin has increased by 15 to 20 percentage points over the past few years, underscoring its strong market pricing power.

Simultaneously, memory chip manufacturers like Micron are exerting similar pricing pressure on Nvidia and other customers, driving price increases across a range of products from Apple Macs to Nvidia GPUs. Carmen Li pointed out that once chips leave Nvidia, the rental prices charged by cloud providers begin to follow the supply-and-demand logic typical of commodity markets. Her data indicates that the rental price for Blackwell 200 chips has risen about 20% this year. Rental prices for older Nvidia chip models, after accumulating over 20% increases in the past year, have largely stabilized over the last 30 days.

Small Clients Bear Greatest Pressure Amid Opaque Market

In this wave of price increases, customers relying on on-demand compute rentals are in the most vulnerable position. Cloud providers are testing the upper limits of pricing in the current tight GPU supply environment and may prioritize server resources for large clients, reducing the computing power available for smaller customers. However, the price trend is not one-directional. An executive at an AI model development firm noted that after prices doubled over the previous one to two months, they have actually decreased in the last two weeks. This divergence reflects that the GPU cloud service market is still in a relatively early stage, compounded by a surge in the number of GPU cloud providers and a market structure that is not yet solidified.

A lack of pricing transparency further increases uncertainty for buyers. GPU cloud service providers typically do not publicly disclose actual prices, which effectively places pricing power in the hands of the providers rather than the customers. An investor in a GPU cloud service provider expressed concern: "For our core clients, there is a tipping point—once the economics no longer make sense, their businesses become unsustainable, and we absolutely do not want to hit that red line." This statement reveals that the continuous rise in computing power costs will ultimately impose a substantial constraint on the commercial viability of the AI application layer.

Disclaimer: Investing carries risk. This is not financial advice. The above content should not be regarded as an offer, recommendation, or solicitation on acquiring or disposing of any financial products, any associated discussions, comments, or posts by author or other users should not be considered as such either. It is solely for general information purpose only, which does not consider your own investment objectives, financial situations or needs. TTM assumes no responsibility or warranty for the accuracy and completeness of the information, investors should do their own research and may seek professional advice before investing.

Comments

We need your insight to fill this gap
Leave a comment