Get in the Token Path
As always, these posts are more of a brain dump of “what I’m thinking” about…And lately I’ve been thinking about a pattern that keeps showing up when I study the biggest infrastructure winners of the cloud era, and what it means for AI companies today.
Here’s the general idea: the biggest infrastructure winners of the cloud era monetized the core consumption primitive of the platform. In the cloud era, that primitive was compute, storage, and network I/O. In the AI era, it increasingly looks like tokens.
Let’s unpack.
When cloud computing first started taking off, the core primitive of the platform was very clear: compute. Everything that happened in the cloud ultimately boiled down to compute cycles running somewhere inside a data center. Storage, networking, and databases all mattered of course, but the engine driving the system was compute.
That had an interesting downstream effect. The companies that ultimately became the biggest infrastructure businesses of the cloud era found ways to align their revenue directly with compute activity (or they charged directly for compute). They owned the meter.
$Amazon.com(AMZN)$ AWS and the other hyperscalers obviously did this, their business’ are literally selling compute hours. The more workloads move to the cloud, the more compute gets consumed, the more AWS and the hyperscalers made. But it wasn’t just the cloud providers.
Let’s look at some of the infra leaders of the cloud buildout.
Databricks monetizes job compute. Every time a customer runs a data pipeline, trains a model, or processes a workload, Databricks' revenue grows automatically. $Snowflake(SNOW)$ monetizes query compute, similar story. Every new query, every new dataset, every new workload meant more revenue without having to sell a single new seat. $Datadog(DDOG)$ monetizes telemetry generated by compute workloads. Every new microservice, every new container, every new cloud instance generates incremental Datadog revenue. More cloud compute = more Datadog revenue. Cloudflare monetizes requests generated by applications running on compute. MongoDB charges based on storage and compute consumed through Atlas.
The details vary, but the pattern is remarkably consistent. The biggest companies ended up sitting directly in the execution path of workloads, with pricing models that scaled automatically as compute activity increased (and compute was one of the core primitives of the cloud buildout).
And this is the key insight - these companies didn’t just have “consumption pricing.” Lots of companies have consumption pricing and grow slowly. What made these companies special is that their consumption unit was the same unit the entire ecosystem was scaling. When the world spun up more compute, these companies grew without doing anything. Their revenue was structurally coupled to the platform’s growth vector.
That might seem obvious today, but during the early years of the cloud buildout many infrastructure companies were still trying to monetize software the old way. Perpetual licenses, term licenses, maintenance contracts, support subscriptions layered on top of open source software. These models worked well in the on-premise world where infrastructure growth was slow, predictable, and tightly controlled.
But cloud computing fundamentally changed the underlying economics. Workloads could scale instantly. Compute consumption could grow by orders of magnitude. The companies that adapted to this new world quickly built the biggest outcomes. The ones that didn’t…well, the contrast is striking.
Docker might be the most instructive example. Docker was containerization. They were one in the same. It was the technology that made cloud-native development possible. Millions of developers used it. Arguably the most important developer tool of the cloud era. But Docker never figured out how to monetize the primitive. They couldn’t connect their massive developer adoption to the underlying compute spend that containers enabled. Kubernetes (open sourced by Google) ate their orchestration business, and every hyperscaler ended up monetizing Docker’s own innovation through managed container services. Docker enabled billions of dollars in compute spend…they just never captured any of it (they have been doing a much better job over the last few years, however. This commentary is more geared towards their origins).
The common thread across Docker and others who gained massive adoption but ran into some sort of business model wall was similar. Most were deeply embedded in the cloud infrastructure stack and were critical tooling. But when they failed, it was often because they didn’t figure out how to make their revenue a derivative of the core consumption primitive. They monetized adjacently, through seats, support contracts, consulting, and the market rewarded them accordingly. Or rather, didn’t.
Compare these outcomes to Snowflake, Datadog, and Cloudflare. Same era. Same underlying platform. But radically different business models, and radically different outcomes. The difference? The winners owned the meter. They put themselves directly in the path of the underlying compute primitive.
Now Map this to AI
If cloud infrastructure was built on the primitive of compute, AI infrastructure is being built on a different primitive: tokens.
Every AI workload ultimately boils down to tokens being generated, processed, and consumed by models. Prompts become tokens. Context becomes tokens. Responses become tokens. Agents running multi-step workflows can generate enormous volumes of tokens as they reason through tasks. Tokens are the atomic unit of work in modern AI systems.
And once you start looking at the ecosystem through this lens, a very familiar pattern emerges.
The model providers like OpenAI and Anthropic - they literally are the token primitive (like the hyperscalers were the compute / storage primitive for the cloud buildout). They charge per token in, per token out.
But it’s not just the model providers. The fastest-ramping AI companies today are the ones sitting directly in the token path.
Coding agents are the standout. Cursor reportedly hit $2B ARR recently according to online reports. Every keystroke, every code completion, every agent action triggers inference, and their business model has evolved from simply charging per seats (these seats now come with usage limits!). Their revenue is structurally coupled to token consumption.
Inference cloud companies (Inferact, Baseten, Fireworks, Together, Baseten etc) are essentially building the “AWS of tokens.” They’re selling the raw primitive.
The companies that sit closest to the generation and consumption of tokens are the ones whose revenue naturally scales with AI activity. Meanwhile, other parts of the AI ecosystem are experimenting with more traditional SaaS pricing models, seat-based developer tools, platform subscriptions, enterprise licenses layered on top of open source frameworks.
Those businesses may still succeed. Many developer tools companies built valuable franchises during the cloud era. But if history is any guide, the biggest infrastructure companies tend to emerge where the core unit of platform activity is measured and monetized. And we already have a pretty clear set of counterexamples from the cloud era showing what happens when you don’t.
Now, being in the token path is necessary but not sufficient. This is an important nuance.
The pure-play CDN companies of the cloud era were technically “in the compute path.” They charged based on bandwidth and requests. Traffic was exploding. They should have been massive winners. But bandwidth turned out to be a commodity. Prices compressed relentlessly. Limelight Networks had record traffic during the streaming boom of 2020-2021 and declining revenue at the same time. They eventually rebranded (to Edgio) and went bankrupt. Meanwhile Cloudflare, which started in a similar spot, layered on security, developer tools, and edge compute, building real differentiation and switching costs on top of the primitive. Same starting point, radically different outcomes.
The lesson for AI founders: get in the token path, but build something differentiated on top of it. Don’t just be a pipe that tokens flow through. Be the layer that makes those tokens more valuable, whether that’s through better developer experience (Cursor), specialized vertical models, security and compliance tooling, or proprietary data moats.
There’s also a timing dimension. In the cloud era, the companies that established themselves as defaults early in the compute path captured the most value. Datadog, Snowflake, and Cloudflare all got to scale before the primitives became fully commoditized. The implication: the window to get into the token path is now. Inference costs are dropping rapidly (which is great, it means more tokens consumed, but it also means per-unit economics compress over time). You need to be in the path and build a moat before that happens.
The biggest infrastructure winners of the cloud era monetized the core consumption primitive. The ones that didn’t, even the ones with massive adoption, beloved products, and deep integration into the stack, produced dramatically smaller outcomes.
The primitive has changed. It’s no longer compute cycles. It’s tokens. And if the pattern holds, the most valuable AI infrastructure companies of the next decade will be the ones that find a way to get themselves directly into the token path.
If you own the meter, growth takes care of itself.
For SG users only, Welcome to open a CBA today and enjoy access to a trading limit of up to SGD 20,000 with unlimited trading on SG, HK, and US stocks, as well as ETFs.
🎉Cash Boost Account Now Supports 35,000+ Stocks & ETFs – Greater Flexibility Now
Find out more here.
Complete your first Cash Boost Account trade with a trade amount of ≥ SGD1000* to get SGD 688 stock vouchers*! The trade can be executed using any payment type available under the Cash Boost Account: Cash, CPF, SRS, or CDP.
Other helpful links:
-
💰Join the TB Contra Telegram Group to Get $10 Trading Vouchers Now🎉
-
How to open a CBA. How to link your CDP account. Other FAQs on CBA. Cash Boost Account Website.
Disclaimer: Investing carries risk. This is not financial advice. The above content should not be regarded as an offer, recommendation, or solicitation on acquiring or disposing of any financial products, any associated discussions, comments, or posts by author or other users should not be considered as such either. It is solely for general information purpose only, which does not consider your own investment objectives, financial situations or needs. TTM assumes no responsibility or warranty for the accuracy and completeness of the information, investors should do their own research and may seek professional advice before investing.

