Chinese Models Surpass US Ones in Token Consumption

A developer in San Francisco selects MiniMax as her inference backend. The request crosses the Pacific, gets processed in a Chinese data center, returns in a second. She pays $0.30 per million tokens. Claude Opus would cost $5.00 for the same volume. No customs declaration. No tariffs. No entry in any trade database. Chinese tech media calls this Token 出海: Token Export. This month’s OpenRouter data showed it crossing a threshold: Chinese models surpassed American ones in total token consumption for the first time. This did not happen overnight. I spent February tracing the structural forces behind it: efficiency architectures, energy cost gaps, and a competitive intensity that pushes pricing below rational margins. Nvidia’s record quarter and falling stock price are part of the same story. China’s Compute Bet explains the system. https://lnkd.in/ghEBwk-s The Export That Tariffs Can’t Touch explains what happens when that system meets the global market. https://lnkd.in/gh_ajmmZ

To view or add a comment, sign in

Explore content categories