TLDR AI 2026-05-25
Mythos 1 🤖, neocloud boom 📈, MCP goes stateless 💻
)
Anthropic prepares Mythos 1 for Claude Code and Claude Security (2 minute read)
Anthropic appears to be moving Claude Mythos to broader availability, with the model now helping protect a wider range of organizations. Traces of the model have already surfaced on Google Cloud and AWS through vulnerability discovery programs. A release for Mythos 1 seems imminent. Claude Opus 4.8 is also rumored to be in the works for release.
👨💻
Engineering & Research
Take 15 things off your to-do list today (Sponsor)
Use
these 15 agentic workflows in Notion to automate, connect, and repeat processes. Use cases span ops, prod, support, and recruiting - including real examples used by Ramp, Vercel, and Clay. The playbook is free, detailed, and full of steps you can implement today. Start running agents 24/7 with Notion.
Get the guideThe 2026-07-28 MCP Specification Release Candidate (9 minute read)
The release candidate for the next Model Context Protocol (MCP) specification is now available. It is the largest revision of the protocol since launch. The candidate introduces a stateless core that scales on ordinary HTTP infrastructure, extensions, authorization that aligns more closely with OAuth and OpenID Connect deployments, a formal deprecation policy, and many other changes. It contains breaking changes. The final specification will ship on July 28.
Evaluating Multi-Agent Systems at Scale (48 minute read)
OpenAI outlined a macro-evaluation workflow for agentic systems that analyzes patterns across entire populations of traces rather than isolated failures.
Lance (Hugging Face Repo)
Lance is a lightweight native unified multimodal model that supports image and video understanding, generation, and editing. It delivers strong performance across image generation, image editing, and video generation benchmarks with just 3B active parameters. The model was trained entirely from scratch within a 128-A100-GPU budget. Examples of clips generated by the model are available in the repository.
Reasonix (Website)
Reasonix is a DeepSeek-native coding agent for the terminal. It is engineered around prefix-cache stability and designed to be left running. Token costs stay low across long sessions.
David Sacks's 11th-Hour Plea Led to Trump's Backtrack on AI Executive Order (9 minute read)
David Sacks, a venture Capitalist, warned President Trump on a call that the long-awaited executive order on the dangers posed by artificial intelligence that Trump was deliberating on could lead to mandatory regulations that slow down the industry in its race with Chinese competitors. Trump responded that he shared concerns about China and was worried about hindering AI investment. He then postponed the signing and told reporters he wouldn't sign the order. The incident shows how powerful Sacks' influence is and marks a win for those against strong guardrails to limit the risks posed by the technology.
Anthropic's march to profitability (3 minute read)
Anthropic is on track to do $10.9 billion in Q2 revenue, up from $4.8 billion in Q1, growing faster right now than Zoom did at the peak of the pandemic. The thing that flipped them to profit is compute getting cheaper, 71 cents of compute per revenue dollar in Q1, 56 cents in Q2. Claude Code on its own is at $2.5 billion in revenue, and the company expects to clear $559 million in profit just in time for its October IPO. The "AI labs burn money forever" story finally has a hole in it.
Clerk CLI: a scriptable interface to auth for developers and agents (Sponsor)
clerk init scaffolds auth into your project. clerk config manages settings in code. clerk api fetches users, orgs, and sessions — no dashboard required. Open source and available for your agentic harness today.
TLDR is hiring a Senior Software Engineer, Applied AI ($250k-$350k, Fully Remote)
TLDR's Applied AI team is tasked with making every process at TLDR legible to code, runnable by anyone, and composable into larger workflows. Join a small, fast moving team using the latest AI tools with an unlimited token budget.
Learn more.
Bumblebee Goes Open Source (1 minute read)
Perplexity open-sourced Bumblebee, a read-only security scanner that identifies risky packages, extensions, and AI tool configurations on developer machines.
Anthropic plans Claude memory update with new Memory Files (2 minute read)
Memory Files distributes notes across multiple structured documents organized by topic, project, or context.
Gemini 3.5 Flash (Low) (1 minute read)
Gemini 3.5 Flash (Low) generates around 45% fewer tokens than Gemini 3.5 Flash (Medium) and generally outperforms Gemini 3.5 Flash (High) on SWE tasks.
Paperwork is better when you can just talk through it (1 minute read)
ChatGPT users can now upload a form image, add the details they want included, and the chatbot can fill it out for them.
Get the most interesting AI stories and breakthroughs delivered in a free daily email.
Join 920,000 readers for
one daily email