TLDR AI 2026-02-18
Claude Sonnet 4.6 🧠, NoteBookLM export 📊, Cursor plugins 🧑💻
Airia: Enterprise AI orchestration that unifies experimentation, prod, and governance (Sponsor)
You want AI to be more than a never-ending work in progress, and that means enabling no-code, low-code, and pro-code development without IT gatekeepers standing in the way. Yet that doesn't mean governance goes out the window.
Airia is the enterprise AI platform built to drive AI adoption by unifying innovation and security. Teams at Stryker, BuzzFeed, and ArcelorMittal use it to:
- Test prompts, LLMs, and agent variants in safe, prod-like environments
- End AI anxiety by controlling agent sprawl and implementing automatic guardrails
- Discover risks, orchestrate agents, and monitor threats in one place
Get a demo and make AI adoption a reality
Claude Sonnet 4.6 (11 minute read)
Anthropic has released Claude Sonnet 4.6, upgrading coding, computer-use, planning, long-context reasoning, and knowledge-work performance while keeping Sonnet pricing unchanged. Sonnet 4.6 also introduced a 1M-token context window in beta and is now the default model for Free and Pro users in Claude's apps.
Prompt-Based Revisions (1 minute read)
NotebookLM has rolled out Prompt-Based Revisions, a feature that lets users tweak, tailor, and tune slides just by prompting the revisions they want. It currently supports PPTX, with Google Slides support coming soon. A video demo of the feature is available in the thread.
Mistral to acquire Koyeb to build out its AI cloud stack (4 minute read)
Mistral AI agreed to buy serverless deployment startup Koyeb in its first acquisition, positioning Koyeb's platform and team as a core component of Mistral Compute.
On Dwarkesh Patel's 2026 Podcast With Elon Musk and Other Recent Elon Musk Things (42 minute read)
Elon Musk recently went on a podcast to discuss alignment, AI data centers in space, robots, China, and more. Musk is very gung-ho about data centers in space, robots, and making his own fabs. He plans to make virtual humans and robots to turn on an 'infinity money glitch'. Otherwise, China will win - they're already more productive.
OpenAI's acquisition of OpenClaw signals the beginning of the end of the ChatGPT era (7 minute read)
OpenAI's acquisition of OpenClaw marks a strategic shift from conversational AI to autonomous agents capable of executing tasks. OpenClaw's popularity stemmed from its unrestrained, robust functionality, combining tool access, sandboxed code execution, and integration with messaging platforms. This move signals a new phase for enterprise AI as companies race to develop secure, deployable versions of dynamic AI agents.
👨💻
Engineering & Research
The weakest link in enterprise AI is rarely the models (Sponsor)
Human judgment is the hardest part to scale. You're relying on people to validate training data, evaluate models, review edge cases, and enforce policies — are they judging consistently and defensibly?
Welo Data builds systems that support human + AI judgment at scale.
Talk to an expertExperiential Reinforcement Learning (18 minute read)
ERL trains policies with an explicit attempt → feedback → reflection → revised attempt loop, then reinforced the successful revision back into the base model. The approach improves sparse-reward learning and tool-using reasoning performance while keeping deployment-time inference cost unchanged.
Cohere's Family of Open Models (9 minute read)
Cohere Labs released TinyAya-Base (3.35B) and instruction-tuned TinyAya-Global plus regional variants, aiming for balanced quality across ~67 languages on consumer hardware. The drop also included a multilingual fine-tuning dataset, new benchmarks, and a technical report for reproducible multilingual experimentation.
Open-Web Simulator for Agent Training (22 minute read)
WebWorld uses a pipeline of 1M+ open-web interactions to simulate long-horizon (30+ step) browsing tasks, paired with a multi-metric WebWorld-Bench for intrinsic evaluation. Trajectories synthesized from the simulator boosted downstream web-agent performance and transferred to other domains like code, GUI, and games.
The future of design is code and canvas (2 minute read)
Figma users can now bring work from Claude Code into the platform. They just need to install the Figma MCP, type 'Send this to Figma', and the browser's rendered state will be automatically translated to fully editable Figma layers. Workflows are changing, and it's easy to get lost in the momentum of creation. The new Claude Code to Figma Design integration aims to help designers escape that tunnel vision, zoom out, and explore the big picture.
Why I'm Worried About Job Loss + Thoughts on Comparative Advantage (21 minute read)
Benign outcomes from technological transitions have always been the product of deliberate institutional design: labor law, antitrust enforcement, public education, and social insurance.
The Impossible Backhand (10 minute read)
Domain expertise is appreciating in value because AI can't easily replace it.
A Guide to Which AI to Use in the Agentic Era (18 minute read)
Three things to consider when deciding what AI to use: Models, Apps, and Harnesses.
Here are the 17 US-based AI companies that have raised $100M or more in 2026 (5 minute read)
Simile, Anthropic, Runway, Goodfire, Fundamental, ElevenLabs, PaleBlueDot AI, Decagon, Flapping Airplanes, Baseten, Inferact, OpenEvidence, humans&, SkildAI, Deepgram, Arena, and xAI.
Get the most interesting AI stories and breakthroughs delivered in a free daily email.
Join 920,000 readers for
one daily email