TLDR AI 2026-01-21
Thinking Machines implosion 🤖, Gemini Chrome Skills ⚡, Humans& $480M seed 💰
Gemini in Chrome is getting “Skills” as it moves toward becoming a full AI agent (2 minute read)
Gemini in Chrome is being upgraded into a proactive agent capable of performing complex tasks on users' behalf. A hidden internal page for 'skills' has been spotted in testing. The page features a dedicated interface where users can define specific capabilities for the AI. Skills will effectively allow users to 'teach' Gemini how to handle specific or repetitive workflows within the browser.
AI startup Humans& raises $480 million at $4.5 billion valuation in seed round (2 minute read)
AI startup Humans& has secured $480 million in seed funding, reaching a $4.5 billion valuation. Key participants included Nvidia, Jeff Bezos, and GV. The company aims to develop human-centric AI for improved communication and collaboration. A product launch is expected early this year. The founding team consists of experts from OpenAI, Google DeepMind, and Meta.
The Messy Human Drama That Dealt a Blow to One of AI's Hottest Startups (5 minute read)
Mira Murati says that Barret Zoph was fired from Thinking Machines Lab after repeated concerns about his conduct and lack of productivity. Zoph claims he was fired for expressing an intent to take a job elsewhere. Within hours of Zoph's firing, Zoph, another co-founder, and a third employee had signed offers to rejoin OpenAI. The departures are a sign of how the AI race is as much a battle for talent as technology.
👨💻
Engineering & Research
Inworld releases new TTS model designed for the next wave of consumer AI applications (Sponsor)
Inworld released TTS-1.5, building on their #1-ranked position on the Artificial Analysis leaderboard. The new model achieves sub-250ms P90 latency, enabling natural conversation at thousands of queries per second. With 30% greater expressiveness, 40% lower word error rates, and 25x lower cost than alternatives, TTS-1.5 is designed for developers building high-demand, realtime applications at consumer scale.
Try TTS-1.5Learn More
Introducing FastMCP 3.0 🚀 (11 minute read)
FastMCP 3.0 was built to be as durable as it is future-proof. It moves beyond simple tool servers and enters the era of Context Applications: rich, adaptive systems that manage the information flow to agents. FastMCP 3.0 can source components from anywhere, compose and transform them freely, personalize what each user sees, track state across sessions, and more. The first beta is available now.
Differential Transformer V2 (12 minute read)
DIFF V2 features key upgrades over DIFF V1: faster inference via compatibility with FlashAttention, better training stability by removing per-head RMSNorm, and a simplified λ parameterization that eliminates complex initialization schemes.
RePo: Content-Aware Attention Reordering (18 minute read)
RePo is a module that repositions tokens based on contextual relevance rather than fixed order, improving long-range attention and robustness to noisy or structured inputs. The method draws on Cognitive Load Theory to justify reorganizing input structure for more efficient model processing.
Waypoint-1: Real-time Interactive Video Diffusion from Overworld (5 minute read)
Waypoint-1 is a real-time-interactive video diffusion model. It can be controlled and prompted via text, mouse, and keyboard. Users can give the model some frames, run it, then have it create a world that they can interact with. They can move the camera freely with the mouse and input any key on the keyboard, all with zero latency. Each frame is generated with the controls as context. The model runs fast enough to provide a seamless experience, even on consumer hardware.
AI Gains Starting to Show in the Real Economy (6 minute read)
The introduction of AI tools like Claude Cowork and Alibaba's Qwen Assistant marks a shift towards AI's real-world applications, similar to ChatGPT's accessibility impact. Recent data from Q3 2025 shows a 4.9% productivity jump, with minimal change in hours worked, indicating potential initial AI-driven efficiency gains. Despite early-stage adoption, these developments could significantly boost economic productivity and eventually increase living standards.
Anthropic CEO: Selling H200s to China is like giving nukes to North Korea (4 minute read)
Anthropic CEO Dario Amodei has likened the decision to allow Nvidia to sell GPUs to Chinese companies to giving nuclear weapons to an adversary. The Trump administration announced just over a month ago that it would allow shipments of Nvidia H200 accelerators to Chinese customers as long as the US received 25% of the revenues. Chinese authorities have yet to allow local buyers to acquire the GPUs. Anthropic wants stricter controls on AI exports, saying access to the chips would put Chinese model developers in a better position to compete with the West.
Get the most interesting AI stories and breakthroughs delivered in a free daily email.
Join 920,000 readers for
one daily email