TLDR AI 2025-07-14
Google “acquires” Windsurf team 🔍, OpenAI delays open model 🤖, Kimi K2 tops DeepSeek 2️⃣
Moonshot AI's Kimi K2 outperforms GPT-4 in key benchmarks (5 minute read)
Chinese startup Moonshot AI released Kimi K2, a 1 trillion parameter open-source model that matches proprietary models on complex agentic tasks. The model was trained using a novel MuonClip optimizer that prevents training crashes that plague model development, potentially saving millions in computational costs.
OpenAI delays the release of its open model, again (2 minute read)
OpenAI is again delaying the release of its open model. The company had planned to release the model next week. The release has been pushed back indefinitely for further safety testing. The release of OpenAI's open model is one of the most highly anticipated AI events of the summer.
Windsurf's CEO goes to Google; OpenAI's acquisition falls apart (3 minute read)
Google DeepMind hired Windsurf CEO Varun Mohan, co-founder Douglas Chen, and key researchers after OpenAI's $3 billion acquisition attempt failed. Google gains a nonexclusive license to Windsurf's technology, enhancing its AI coding capabilities without direct control. Windsurf faces uncertainty as it loses top talent, continuing to offer AI coding tools with Jeff Wang as interim CEO.
Apple Will Seriously Consider Buying Mistral (2 minute read)
Apple is seriously considering acquiring Mistral, a French AI startup that has raised a total of €1.1 billion over seven funding rounds. Mistral has launched a range of large and small language models over the years and achieved notable success with its optical character recognition features. It is currently Europe's biggest AI startup. The acquisition of an AI startup would provide a much-needed boost to Apple's AI ecosystem.
👨💻
Engineering & Research
Why Your AI POCs Never Launch (Sponsor)
After helping hundreds of companies deploy AI solutions on AWS, Mission identified the 4 critical gaps that separate successful AI implementations from the 80% that never see real business impact. This
10-minute AI Readiness Assessment reveals exactly what's blocking your progress and gives you a roadmap with personalized recommendations, plus a strategy session to discuss your results.
Take the assessmentScaling up RL is all the rage right now (2 minute read)
Reinforcement learning (RL) will continue to lead to more gains because when done well, it is a lot more leveraged, more responsive to feedback, and superior to supervised fine-tuning. Researchers are likely to discover more about RL as rollout lengths continue to expand. There are a lot more S curves to discover, possibly specific to large language models and without analogues in game/robotics-like environments.
How to scale RL to 10^26 FLOPs (18 minute read)
Reinforcement learning is the next training technique for building frontier-level AI models. Training it on more data will make it better. The current approach of scaling is messy and complicated. Finding a way to do next-token prediction on the web using RL would enable models to reason from general web data instead of just math and code.
The upcoming GPT-3 moment for RL (7 minute read)
GPT-3 showed that scaling up language models unlocks powerful performance that often outperforms carefully fine-tuned models. Before GPT-3, achieving state-of-the-art performance meant first pretraining models, then fine-tuning them on specific tasks. Today's reinforcement learning is stuck in a similar pre-GPT paradigm. The approach suffers from the fundamental limitation where the resulting capabilities generalize poorly, leading to brittle performance that rapidly deteriorates outside of the precise contexts seen during training. The RL field will soon shift toward massive-scale training across thousands of diverse environments. Doing this effectively will produce RL models with strong abilities capable of quickly adapting to entirely new tasks - achieving this will require training environments at a scale and diversity that dwarfs anything currently available.
Get the most interesting AI stories and breakthroughs delivered in a free daily email.
Join 920,000 readers for
one daily email