Bỏ qua đến nội dung chính
Back to home
AI tools-ai 1 min read

Pinterest Slashes AI Costs by 90% via Deep Customization of Qwen3-VL 📉

Pinterest has achieved a major breakthrough in operational efficiency, slashing AI infrastructure costs by 90% and boosting accuracy by 30% by restructuring the vision processing layer of the Qwen3-VL model.

Tier 2 · sources 99% confidence Reviewed
Sources venturebeat.com

Pinterest, the world's leading image-sharing platform, has announced a significant breakthrough in optimizing large-scale AI operational costs, saving up to 90% of its infrastructure budget.

Background

With over 620 million monthly active users, Pinterest faced a challenging economic puzzle: relying on expensive frontier models to process billions of daily image recommendations was no longer a viable strategy, but rather a colossal financial burden. Pinterest's CTO, Matt Madrigal, noted that calling APIs of large models for every user interaction was a costly mistake.

To tackle this issue, Pinterest's engineering team took a bold approach: "gutting" the Qwen3-VL model. Instead of using its original architecture in its entirety, they completely stripped out its vision layer. The engineers then rebuilt this processing layer using proprietary embeddings trained on Pinterest's own massive image dataset.

Why It Matters

The results of this shift far exceeded expectations: inference costs plummeted by 90%, while the accuracy of image recommendations jumped by 30%. This serves as clear proof that "model gutting" and customizing models to enterprise-specific data are becoming the key to unlocking AI profitability.

For the tech community and startups in Vietnam, Pinterest's story offers a valuable lesson about not relying entirely on "off-the-shelf" models. Owning proprietary data and possessing the capability to customize AI model layers will serve as a core competitive advantage. This enables businesses to optimize their economics while enhancing user experience amid an increasingly expensive AI race.