Gemini Extended Thinking ✨, ChatGPT finance 📱, Claude Code at scale 👨‍💻

Your agent needs a harness, not a framework. 69% of engineers building in prod agree (Sponsor)

Inngest asked 130 engineers about running AI in production—only 19% were very confident their stack could scale, with gaps in tracing being a key issue. 1 in 5 now spend up to half their time on reliability work just piecing together context.

Read the full benchmark report to see what's working, what's just marketing (respectfully), and what teams your size actually use to ship production-ready apps and agents.

Or...just add the #1 thing the most confident teams use, for free 🤠

TLDR AI 2026-05-18

Gemini Extended Thinking ✨, ChatGPT finance 📱, Claude Code at scale 👨‍💻

Your agent needs a harness, not a framework. 69% of engineers building in prod agree (Sponsor)

Headlines & Launches

ChatGPT Personal Finance (6 minute read)

Gemini app rolling out ‘Extended' thinking level, new 3rd-party app integrations (3 minute read)

Codex will soon be able to control other desktop devices via Computer Use (2 minute read)

Deep Dives & Analysis

AI economics part 2 (11 minute read)

Portability Is a Myth: Why the Best AI Stacks Will Never Be Hardware-Agnostic (15 minute read)

Tokenomics: the 62.5-minute rule for Claude's cache (8 minute read)

Engineering & Research

May 26 workshop: Agent orchestration on AWS (Sponsor)

How Claude Code works in large codebases: Best practices and where to start (5 minute read)

Recent Developments in LLM Architectures: KV Sharing, mHC, and Compressed Attention (33 minute read)

Lighthouse Attention (11 minute read)

Notes on pretraining parallelisms and failed training runs (12 minute read)

Miscellaneous

The haves and have nots of the AI gold rush (1 minute read)

Runway started by helping filmmakers — now it wants to beat Google at AI (11 minute read)

Quick Links

Headroom (GitHub Repo)

Apple Silicon costs more than OpenRouter (3 minute read)

DeepSeek-V4-Flash means LLM steering is interesting again (9 minute read)

OpenAI Quietly Bought Voice-Cloning Startup Weights.gg, Then Folded the Team (3 minute read)

TLDR is hiring a Senior Software Engineer, Applied AI ($250k-$350k, Fully Remote)

Get the most interesting AI stories and breakthroughs delivered in a free daily email.