TLDR AI 2025-09-10
Claude docs π, Microsoft & Anthropic π€, MCP Registry π
How Snowflake is adapting usage-based pricing to the AI era (Sponsor)
Snowflake's usage-based pricing has been one key driver of its market leadership. In this
webinar, you'll get an insider view of how the team managed its pricing and billing, and how they're preparing for the Al era of monetization.
Join Ryan Campbell, Director of Product Finance at Snowflake, and Scott Woody, Metronome CEO, to learn how Snowflake aligns finance, product, GTM, and engineering around pricing decisions. On the agenda:
- Reducing friction and accelerating pricing decisions
- Applying rigor and experimentation to your monetization model
- Lessons from Snowflake on designing pricing for predictability, visibility, and control
Join live: How Snowflake built its Monetization Operating Model
π§
Deep Dives & Analysis
The whole point of OpenAI's Responses API is to help them hide reasoning traces (5 minute read)
OpenAI's Responses API replaces the previous /chat/completions API for inference. The new API has a lot more features, but the main difference is that it is stateful. Users no longer have to pass the entire conversation history with each request, they just have to pass an ID representing the state of the conversation and the provider keeps models up to date. This allows OpenAI to keep its reasoning traces secret.
The Training Imperative (6 minute read)
Every serious AI company will eventually train its own models. The barrier to doing so is collapsing. Distillation, fine-tuning, and post-containing get easier every month. Soon, the only way to stay relevant will be to own your own models.
Thoughts on Evals (14 minute read)
Production monitoring reveals real issues that pre-deployment evals will inevitably miss, especially as AI products become more unpredictable and personalized. Evals are collections of already-known failure cases, but agents can and often do fail in ways that produce no error codes.
Gemini rolls out βTools' redesign on web as Nano Banana growth continues (3 minute read)
Gemini Web has rolled out its Tools redesign. The prompt bar is now centered with a new 'Tools' menu that lets users access Deep Research, Veo, image creation, Canvas, Guided Learning, and Deep Think. Gemini now groups suggested prompts together for a cleaner look. Screenshots of the new interface are available in the article.
The Gross Margin Debate in AI (8 minute read)
AI companies face varied gross margins across sectors. Chips maintain around 70% margins, while cloud services see margins pressured by AI investments, estimated between 50-55%. Application-level margins range widely, with AI "Supernovas" starting at 25%, potentially negative, while others hit 60%, highlighting pricing strategies and diversified revenue models to improve margins over time.
Get the most interesting AI stories and breakthroughs delivered in a free daily email.
Join 920,000 readers for
one daily email