TLDR AI 2025-09-04
Mistral $14B valuation π°, xAI CFO quits βοΈβπ₯, Apple AI search π
Mistral, the French AI giant, is reportedly on the cusp of securing a $14B valuation (1 minute read)
Mistral AI is finalizing a β¬2 billion investment that will boost its valuation to $14 billion. Founded by ex-DeepMind and Meta researchers, it develops open-source language models and the European AI chatbot Le Chat. This marks Mistral's first major fundraising since June 2024.
Apple's rumored AI search tool for Siri could rely on Google (2 minute read)
Apple plans to enhance Siri with an AI search feature, potentially relying on Google's Gemini AI model. The feature, dubbed βWorld Knowledge Answers,β will generate summaries using web results, integrating various media formats. Apple has agreed with Google to test Gemini, while also considering its own and Anthropic's models for Siri's development.
xAI's CFO Steps Down, the Latest in a String of Executive Departures (4 minute read)
xAI has seen a number of high-profile departures in recent weeks. Mike Liberatore, xAI's chief financial officer, left the company at the end of July. Robert Keele, xAI's general counsel, announced he was leaving on August 7. Raghu Rao, a senior lawyer who oversaw commercial legal affairs for xAI, left around the same time as Liberatore and Keele. Igor Babuschkin, a veteran of Google's DeepMind AI research lab and OpenAI, announced on August 13 that he was leaving to start his own venture-capital firm focused on AI safety.
π§
Deep Dives & Analysis
Open Global Investment as a Governance Model for AGI (26 minute read)
Rather than nationalizing AI companies or creating new international bodies, we should create publicly-traded AGI corporations with stricter government oversight. Foreign countries and citizens would purchase voting stock in companies like OpenAI, giving them financial stakes and board influence instead of racing to build competing systems.
The Bitter Lesson is Misunderstood (15 minute read)
While general methods that leverage computation seem more effective, data still plays a very important role. You need 40% more data when doubling GPUs, otherwise you're just lighting cash on fire. AI companies have already consumed the internet. The leaders who win the next era of AI will be scaling data itself. This could be done by architecting better use of limited data or alchemizing entirely new ultimate data universes.
Trust me bro, just one more RL scale up, this one will be the real scale up with the good environments, the actually legit one, trust me bro (15 minute read)
AI companies haven't actually done a good job of scaling up RL - they've scaled up to compute, but with low-quality data. While some argue that once these companies actually scale RL up, there will be a big jump in AI capabilities, it is more likely that progress will be the result of numerous hard-won advances. Higher quality RL environments will be a substantial part of what drives progress over the next year or two, and maybe beyond that, but significant improvements will come from several advances, not just scaling RL.
π¨βπ»
Engineering & Research
Getting Started with AI Observability (Sponsor)
Teams can't see into their AI models - and many can't even answer basic questions like "which prompts are costing us the most money?"
AI observability is the way to stop guessing and starting getting answers - with automated dashboards for prompt frequency, response times, drift indicators, and cost impact.
Try it out in the Dynatrace PlaygroundSpeeding up PyTorch inference by 87% on Apple devices with AI-generated Metal kernels (16 minute read)
Frontier models can write optimized GPU kernels for Apple devices to speed up inference. This post discusses how researchers generated Metal kernels that were 1.87 times faster across 215 PyTorch mod duties, with some workloads running hundreds of times faster than baseline. The results show that AI can take on portions of optimizations, leaving human engineers to focus on the most complex optimizations.
Awesome Agentic LLM+RL Papers (GitHub Repo)
A curated list of papers on agentic reinforcement learning with large language models.
Agentic Design Patterns (Textbook)
In this 400 page preprint, a senior Google exec lays out the 21 fundamental design patterns for AI agents. They provide concrete code examples of prompt chaining, memory implementation, multi-agent coordination, and more from production applications. While it's still very early into agentic tooling, the core engineering foundations for the post-chatbot generation of AI are taking shape.
OpenAI boosts size of secondary share sale to $10.3 billion (2 minute read)
OpenAI is allowing current and former employees to sell more than $10 billion worth of stock in a secondary share sale. The sale will be at a $500 billion valuation, which is in line with expectations. OpenAI was valued at $300 billion in a fundraising round earlier in the year. Staff who have held shares for more than two years have until the end of September to decide whether to participate.
Palantir launches Working Intelligence (2 minute read)
Palantir has introduced Working Intelligence: The AI Optimism Project, positioning AI as a tool for American workers rather than a replacement. The campaign emphasizes practical applications across industries, from hospitals to factories, showing AI as a force for solving real problems and driving prosperity. The effort is part of a broader movement to empower workers and ensure they lead the next technological revolution.
Want more news from TLDR? (Sponsor)
You'll probably like our flagship newsletter. It's all about tech, science, and programming.
Same quick format. Still free.
Subscribe now.
Alex, the "Cursor for Xcode," Joins OpenAI's Codex Team (1 minute read)
Xcode, an iOS/macOS coding IDE, has lagged in AI adoption compared to its Windows-based competitors.
ElevenLabs Launches Free Text-to-Sound Effects Generator (2 minute read)
The AI voice company now generates custom sound effects from text prompts, offering 50 free generations monthly.
Entering the DOS era of AI (3 minute read)
Everyone's waiting for the friendly version of AI, while developers are already building in terminals.
Translation startup DeepL launches an AI βagent' in challenge to players like OpenAI (3 minute read)
The AI DeepL Agent, aimed at enterprises, is designed to complete repetitive, time-intensive tasks across a wide variety of functions.
Scale AI is suing a former employee and rival Mercor, alleging they tried to steal its biggest customers (3 minute read)
Scale AI is suing former employee Eugene Ling and competitor Mercor, alleging theft of over 100 confidential documents.
NotebookLM Adds Audio Overview Formats (1 minute read)
NotebookLM introduced four new audio overview types: Deep Dive for detailed analysis, Brief for quick recaps, Critique for expert feedback, and Debate featuring host-driven discussions.
Get the most interesting AI stories and breakthroughs delivered in a free daily email.
Join 920,000 readers for
one daily email