Vast Hits $1B as Tripo AI Wins China's 3D Model Race
Funding

Vast Hits $1B as Tripo AI Wins China's 3D Model Race

Beijing startup Vast raised nearly $200M, crossed a $1B valuation, and now powers 10M users on Tripo AI, China's leading text-to-3D engine.

Share:XLinkedIn

Key Takeaways

  • Vast raised nearly $200 million across A+ and A++ rounds, crossing a $1 billion valuation to become China's latest AI unicorn on June 1, 2026.
  • Its Tripo AI platform turns text and image prompts into production-ready 3D assets and claims nearly 10 million users and about 90,000 studios.
  • Named clients include NetEase and Sony, signaling output good enough for commercial gaming and entertainment pipelines.
  • Founder Simon Song previously co-founded MiniMax, part of a pattern of veteran Chinese AI founders moving from text models to 3D tooling.
  • The strategic thesis is that generative 3D is core infrastructure for robotics, world models, and AR, not just a creative tool.

China just minted another AI unicorn, and it has nothing to do with chatbots. Vast, a three-year-old Beijing startup, crossed a $1 billion valuation on the back of a product that turns a sentence into a finished 3D object. While the industry argues about text and video models, the race for the third dimension just produced its first giant.

What Actually Happened

On June 1, 2026, Beijing-based Vast announced it had raised nearly $200 million across A+ and A++ financing rounds, pushing its valuation past $1 billion and making it China's latest AI unicorn. The rounds were led by Ince Capital and a China Life-backed venture fund, with Genesis Capital, Eminence Ventures, and Primavera Venture Partners also participating. The capital is earmarked for advancing the company's AI 3D and world-model roadmap.

Vast's flagship product is Tripo AI, a platform that converts text and image prompts into detailed, production-ready 3D assets for creators, game developers, and studios. The traction numbers are what justified the unicorn label: Tripo claims nearly 10 million individual users and roughly 90,000 studios and companies, with named clients including NetEase and Sony. For a category that barely existed two years ago, that is a base most generative tools never reach.

The founder story adds weight. Vast was founded in 2023 by CEO Simon Song, who previously co-founded MiniMax, one of China's earlier large-model unicorns. That pedigree helped Vast raise quickly, and it signals that experienced Chinese AI founders are now deliberately pivoting away from crowded text-model territory toward 3D and spatial tooling, where the field is younger and the moats are less contested.

Stay Ahead

Get daily AI signals before the market moves.

Join founders, investors, and operators reading TechFastForward.

Why This Matters More Than People Think

Generative AI conquered text, then images, then video. 3D has stayed stubbornly hard because a 3D asset is not a flat output, it is geometry, texture, topology, and rig, all of which have to be correct enough to drop into a game engine or a simulation. A tool that produces usable 3D from a prompt collapses a workflow that used to take a skilled artist hours or days. That is why 10 million users showed up: the alternative is expensive, slow, and scarce talent.

The strategic stakes are bigger than gaming art. 3D assets are the raw material for the fastest-growing frontiers in technology: game development, augmented and virtual reality, digital twins, and crucially the simulated environments used to train robots and world models. Every humanoid robot company and every autonomous-systems lab needs vast quantities of 3D scenes to train in simulation. A cheap, scalable 3D generation engine is not a creative toy, it is infrastructure for the embodied-AI and spatial-computing stack that everyone from Nvidia to the robotics labs is racing to build.

That reframes what Vast actually is. On the surface it is a 3D design tool with a freemium user base. Underneath, it is a bet that whoever can mass-produce 3D worlds becomes a supplier to the entire physical-AI economy. The world-model language in the funding announcement is not marketing garnish, it is the actual thesis: 3D generation and world simulation are converging, and Vast wants to own the input layer.

The Competitive Landscape

The race for generative 3D is global but lopsided, and China is currently ahead on tooling. Vast and Tripo compete with a cluster of Western efforts, from research projects at the large labs to startups building text-to-3D pipelines, but few have reached Tripo's scale of real users. The advantage compounds: more users generate more usage data, which improves the models, which attracts more users. In a category where high-quality 3D training data is scarce, a 10 million user funnel is a genuine moat.

The named clients matter as proof points. NetEase is one of the world's largest game publishers and Sony spans gaming, film, and consumer electronics. Enterprise adoption by companies of that caliber signals the output is good enough for commercial pipelines, not just hobbyist experiments. It also positions Vast as a supplier to exactly the industries, gaming and entertainment, that will drive the first wave of mass-market AR and immersive content.

There is a geopolitical contour as well. Vast is a Chinese company supplying 3D infrastructure to global clients at a moment when AI supply chains are fracturing along national lines. That is simultaneously a strength, access to China's enormous creator market and capital, and a vulnerability, exposure to export controls, data-sovereignty rules, and the same cross-border scrutiny that has complicated other Chinese AI firms with international ambitions.

Hidden Insight: 3D Is the Missing Input for Physical AI

The story everyone will tell is that Vast is the latest Chinese AI success and a sign that generative media has a new frontier. The more interesting read is what the round reveals about where AI value is migrating. The first generation of generative AI sold outputs people consume: text to read, images to look at, video to watch. 3D is different because its primary long-term customer is not a human viewer, it is another AI system that needs an environment to learn in. Vast is selling the training ground for the next wave of models.

The bear case, however, is that the gap between impressive demos and production assets is still wide. Skeptics point out that generative 3D output often needs heavy cleanup before it is usable in a AAA game or a precise simulation, that topology and rigging remain weak spots, and that 10 million users counts mostly free hobbyists whose conversion to paying customers is unproven. The risk is that Vast is being valued as infrastructure while its revenue still looks like a freemium creative tool. A $1 billion valuation on a largely free user base assumes a monetization curve that has not yet been demonstrated at scale.

The deeper signal is about which geography is building the spatial-AI supply chain. While Western attention is fixed on frontier text models and their trillion-dollar valuations, a Chinese founder from the MiniMax lineage just raised $200 million to own the 3D input layer for robotics, gaming, and world models. If physical AI and simulation are the next platform, the company supplying the worlds may end up as strategically important as the companies supplying the models. Over the next 12 to 24 months, watch whether the 3D layer gets treated as critical infrastructure, and who controls it.

The uncomfortable truth this challenges is the assumption that the AI value chain runs through a handful of Western frontier labs. The physical-AI stack has inputs those labs do not own, and 3D generation is one of them. A world where robots and world models are trained on simulated environments is a world where the supplier of those environments has leverage, and right now the early leader in that supply is sitting in Beijing.

What to Watch Next

In the next 30 to 90 days, watch how Vast deploys the capital and whether it ships world-model features that move it from asset generation toward full scene and environment simulation. Track the conversion story: any disclosure of paying-customer counts or enterprise revenue will reveal whether the $1 billion valuation rests on monetization or on user-growth optimism. Watch for new enterprise clients beyond NetEase and Sony, especially any robotics or simulation companies, because that would confirm the infrastructure thesis rather than the creative-tool one.

Over the next 180 days, watch the competitive response from Western labs and whether any of them move to acquire or out-build a text-to-3D leader. Watch the geopolitics too: any export-control or data-sovereignty action affecting Chinese AI tooling would land directly on a company like Vast with global clients. The single most important indicator is whether the robotics and simulation industry starts treating generative 3D as a required input, because the day that happens, Vast stops being a design startup and becomes critical infrastructure with pricing power to match.

The next generation of AI will not be trained on text. It will be trained in 3D worlds, and the company that mass-produces those worlds may matter as much as the labs that build the models.


Key Takeaways

  • Nearly $200 million raised across A+ and A++ rounds, pushing Vast past a $1 billion valuation as China's latest AI unicorn on June 1, 2026.
  • Tripo AI turns text and image prompts into production-ready 3D assets and claims nearly 10 million users and about 90,000 studios.
  • NetEase and Sony are named clients, signaling output good enough for commercial gaming and entertainment pipelines.
  • Founder Simon Song previously co-founded MiniMax, part of a pattern of veteran Chinese AI founders pivoting from text models to 3D and spatial tooling.
  • The thesis is infrastructure: generative 3D as core input for robotics, world models, and AR, not just a creative tool.

Questions Worth Asking

  1. If 3D worlds become the training ground for robots and world models, does the supplier of those worlds gain as much leverage as the model labs?
  2. A $1 billion valuation rests on roughly 10 million mostly free users, so what conversion rate would justify it, and is that realistic?
  3. If the spatial-AI supply chain's early leader is in Beijing, how should Western robotics and gaming companies think about that dependency?
Newsletter

Enjoyed this analysis? Get the next one in your inbox.

Daily AI signals. No noise. Built for founders, investors, and operators.

Share:XLinkedIn
</> Embed this article

Copy the iframe code below to embed on your site:

<iframe src="https://techfastforward.com/embed/vast-hits-1b-as-tripo-ai-wins-chinas-3d-model-race" width="480" height="260" frameborder="0" style="border-radius:16px;max-width:100%;" loading="lazy"></iframe>