Mistral Medium 3 ⚙️, Fidji Simo joins OpenAI 🤝, Anthropic Web Search API 🔍

Google launches 'implicit caching' to make accessing its latest AI models cheaper (3 minute read)

Google's new "implicit caching" feature in its Gemini API reportedly offers 75% cost savings on repetitive context for its Gemini 2.5 models. Unlike the previous manual explicit caching method, this automatic feature could ease developer concerns about unexpected API costs. However, developers should monitor its effectiveness, as Google hasn't provided third-party verification for the cost savings claims.

TLDR AI 2025-05-09

Mistral Medium 3 ⚙️, Fidji Simo joins OpenAI 🤝, Anthropic Web Search API 🔍

Headlines & Launches

Mistral Medium 3 (3 minute read)

Fidji Simo Joins OpenAI as CEO of Applications (2 minute read)

Anthropic launches Web Search API for Claude (2 minute read)

Deep Dives & Analysis

Separating Fact from Fiction: Here's How AI Is Transforming Cybercrime (6 minute read)

Is there a Half-Life for the Success Rates of AI Agents? (1 minute read)

The Leaderboard Illusion (1 minute read)

Engineering & Research

Osmosis self-improvement via real-time reinforcement learning (4 minute read)

Actor-Critic Learning with Offline Data (29 minute read)

AMIE gains vision: A research AI agent for multimodal diagnostic dialogue (1 minute read)

Miscellaneous

Google launches 'implicit caching' to make accessing its latest AI models cheaper (3 minute read)

Hugging Face releases a free Operator-like agentic AI tool (2 minute read)

AI-generated code could be a disaster for the software supply chain. Here's why (4 minute read)

Quick Links

Freepik releases an 'open' AI image generator trained on licensed data (2 minute read)

Meta Enters The Token Business, Powered By Nvidia, Cerebras And Groq (2 minute read)

Perplexity Expanding AI-Powered Learning (2 minute read)

Get the most interesting AI stories and breakthroughs delivered in a free daily email.