TLDR AI 2026-02-05
Inside Codex โ๏ธ, Geminiโs growth ๐, AI eats SaaS ๐
๐ง
Deep Dives & Analysis
Inside the Codex App Server (22 minute read)
OpenAI detailed the architecture of the Codex App Server, a bidirectional JSON-RPC API layer powering Codex across platforms, offering tips for integrating coding agents into real workflows.
AI Is Finally Eating Software's Total Market: Here's What's Next (10 minute read)
AI's rise is shrinking the software market, causing unpredictability for SaaS companies. Key players like Salesforce and SAP are integrating AI functionality into popular platforms to adapt, while weaker firms face potential obsolescence. Success hinges on owning intent gateways and effectively leveraging AI for outcomes, positioning few companies to thrive amid industry-wide disruption.
Kimi K2.5 (17 minute read)
Kimi K2.5 is a solid model and exceeds expectations. It is likely now the leading open weights model. The model is excellent given its price. It is an excellent choice for developers who can't afford or don't want to pay for Opus 4.5 and have to go with something cheaper to run their OpenClaw.
๐จโ๐ป
Engineering & Research
Give any AI agent access to Google search with SerpApi (Sponsor)
Agents are smarter when they can search the web - but with SerpApi, you don't need to reinvent the wheel. Build with the same
web search API used by ahrefs, NVIDIA, and Perplexity. Integrate with a dead-simple GET request that any agent can make.
Start with 250 free credits/monthWindsurf Tab v2: 25-75% more accepted code with Variable Aggression (7 minute read)
Windsurf's Tab feature was very well received, but it wasn't well maintained after launch. The team has now significantly improved the underlying model and context/data engineering pipeline. This has led to direct Pareto improvements across all of Windsurf's metrics, with an average 54% increase in characters per predict. Tab now uses variable aggression to tailor the experience to each user's preferences.
Mistral Introduces Voxtral Transcribe 2 (3 minute read)
Mistral's Voxtral Transcribe 2 is a pair of next-gen speech-to-text models, including a real-time version with open weights that delivers sub-200ms latency and low-cost, high-accuracy transcription across 13 languages.
Claude and Codex are now available in public preview on GitHub (4 minute read)
Claude by Anthropic and OpenAI Codex are now available for Copilot Pro+ and Enterprise customers on GitHub in public preview. Users can start sessions and assign tasks to these coding agents from the web, mobile app, or VS Code without additional subscriptions. All agent actions, like drafting pull requests and task prioritization, can be managed through GitHub's existing infrastructure.
Owning a $5M data center (7 minute read)
comma has been running its own data center for years. All of the company's model training, metrics, and data live inside its own data center in its own office. This post describes how comma's data center works. It is aimed at inspiring other companies to build their own data centers too.
Why Nvidia builds open models with Bryan Catanzaro (64 minute read)
Nvidia's focus on building open models, spearheaded by efforts like the Nemotron 3 Nano release, positions it uniquely to enhance both its product offerings and the AI ecosystem. Open model development aids Nvidia in understanding compute-heavy AI workloads, which informs their hardware and software design, while fostering greater community collaboration. Nvidia distinguishes itself by emphasizing open data and models to drive AI infrastructure, benefiting businesses without AI acting as an overarching monopolistic event.
Get the most interesting AI stories and breakthroughs delivered in a free daily email.
Join 920,000 readers for
one daily email