TLDR AI 2026-04-10
OpenAI $100 plan π³, Claude Cowork GA π’, Perplexity x Plaid πΈ
Aman Sanger, Cursor: βAfter switching to turbopuffer, we're saving an order of magnitude in costsβ (Sponsor)
turbopuffer is the
serverless search engine for AI. It's built on object storage with smart caching - so it's fast, cheap, and extremely scalable.
β
Used by Cursor, Notion, Linear, Cognition, Atlassian, Ramp, Granola, Legora...
β
Fast β ~20ms p90 latency on 10M documents
β
Truly serverless β Run your first query in ~4 minutes
β
Up to 95% cheaper than traditional vector/text search databases
TLDR readers get their first month free. Sign up here.
ChatGPT $100 Pro Plan Introduced (2 minute read)
OpenAI introduced a new $100/month ChatGPT Pro tier aimed at power users, expanding its pricing lineup between the $20 Plus and $200 Pro plans. The company confirmed the $200 plan still exists despite not being listed on the pricing page.
Making Claude Cowork ready for enterprise (3 minute read)
Claude Cowork is now enterprise-ready, enhancing work with organization controls like role-based access, group spend limits, and expanded observability features. Admins can manage adoption with detailed usage analytics and integrate tools like Zoom for seamless workflows. Companies like Zapier and Airtree have already leveraged these features for improved project management and operational efficiency.
Perplexity launches Personal Finance powered by Plaid (2 minute read)
Perplexity has expanded its financial services with Plaid integration, transitioning from portfolio tracking to a comprehensive finance dashboard. Users can link checking, savings, credit card, and loan accounts to analyze spending, track liabilities, and calculate net worth. Targeted at AI-savvy consumers, it combines data insights with Plaid's infrastructure, transforming from a portfolio add-on to a full personal finance hub without executing trades.
Alibaba Claims Viral Happy Horse AI Model in Latest Breakthrough (3 minute read)
The Happy Horse video AI model that sent ripples across China's AI industry this week was created by Alibaba. The model hit the top spot on the text-to-video leaderboard of Artificial Analysis this week. Alibaba plans to provide API access to the model in the near future.
π§
Deep Dives & Analysis
Agentic Infrastructure on Vercel (8 minute read)
Vercel argued that coding agents were reshaping how software gets built and deployed, with agent-initiated deployments rising sharply and accounting for more than 30% of weekly deployments. The post framed this shift as a need for new infrastructure designed for agents to deploy software, run AI systems, and increasingly operate infrastructure autonomously.
CoreWeave Takes As Much Financial Engineering As It Does Datacenter Design (4 minute read)
CoreWeave's revenue backlog is up to $87.8 billion, with Meta represents 40.1% of that backlog. OpenAI's deals represent 25.5% of extended backlog. CoreWeave only has two dozen named customers, but it likely has more. The company achieved $5.13 billion in sales in 2025, up 2.7x from 2024, and posted a net loss of $1.17 billion. It recently held a private offering of $1.75 billion in notes to cover its data center expansion costs.
Research-Driven Agents: What Happens When Your Agent Reads Before It Codes (16 minute read)
Coding agents generate better optimizations when they read papers and study competing projects before touching code. Coding agents working from code alone generate shallow hypotheses. Adding a research phase improves results.
π¨βπ»
Engineering & Research
IronClaw: the secure agent harness for always-on AI (Sponsor)
You're one API key away from an agent that codes, commits, and pulls from GitHub and Slack autonomously. What's stopped you is trusting it with credentials.
IronClaw isolates every tool in a Wasm sandbox. Credentials never reach the model.
Open-source, built in rust, deployable on NEAR AI Cloud. Verifiably yours.
Try IronClaw Free β
Process Driven Image Generation (22 minute read)
Meta has introduced a multi-step framework for image synthesis that alternates between textual reasoning and visual updates. This enables models to iteratively plan, draft, reflect, and refine outputs.
Multimodal Embedding & Reranker Models with Sentence Transformers (8 minute read)
The v5.4 update of Sentence Transformers introduces multimodal embedding and reranker models that enable encoding and comparing texts, images, audio, and videos within a shared embedding space for tasks like cross-modal search and retrieval-augmented generation. These advanced models map different modalities into a consistent embedding space and assess relevance across mixed-modality pairs, though lower cross-modal similarity scores may occur due to modality clustering. Notably, multimodal rerankers provide high-quality scoring by evaluating individual pairs for more accurate retrieval results.
Anthropic launches advisor tool for Claude Platform API users (1 minute read)
The new advisor tool on Anthropic's Claude Platform API provides developers with the ability to use Opus as an advisor alongside Sonnet or Haiku as executors. This allows agents to access advanced reasoning capabilities while maintaining operational costs at the level of the more efficient executor models. The advisor tool is now publicly available through a configuration in the Messages API request.
Efficient Rollout Scaling for Diffusion RL (4 minute read)
NVIDIA's Sol-RL introduces a two-stage framework that separates exploration and training, using FP4 rollouts to generate large candidate sets and BF16 for selective policy updates. This approach reduced compute costs while improving alignment and accelerating convergence in diffusion model post-training.
Introducing KellyBench (2 minute read)
KellyBench evaluates sequential decision-making in sports betting by simulating the 2023-24 English Premier League season. Models like Claude Opus 4.6 and GPT-5.4 struggled, with none achieving positive returns, highlighting their limitations in adapting to long-term strategies. This underscores a need for complex environments enabling agents to learn from experience under uncertainty.
Dario Says Continual Learning Is Solved. Is It? (5 minute read)
Continual learning research is aimed at breaking past the feasible horizon of current techniques. If models can't learn new things while performing tasks, they will struggle when the task horizon grows very long. Context management and other engineering improvements can push task horizons to weeks, or even months, but very long-horizon tasks remain out of reach. Human-level continual learning may be 'solved', but the revolution is not yet complete.
Get the most interesting AI stories and breakthroughs delivered in a free daily email.
Join 920,000 readers for
one daily email