TLDR AI 2025-04-15
OpenAI GPT 4.1 ๐, Hugging Face acquires Pollen Robotics ๐ค, DolphinGemma ๐ฌ
Delve goes viral on X for pulling all-nighters to ship autopilot for SOC 2 (Sponsor)
Delve officially launched Computer Use Agents that allow founders and GRC teams to go auto-capture all screenshots for SOC 2.
And customers are using it to achieve incredible results:
- Lovable got fully SOC 2 compliant in less than 20 hours
- 11x ditched their old compliance platform, saved 143 hours on SOC 2, and unlocked $1.2M ARR
- Bland AI got SOC 2 and unlocked $500k ARR within 7 days
If you want to ditch your old platform, they'll even migrate you off for FREE.
Book a demo here for $2000 off compliance in April!
(PS: TLDR readers get free custom Arc'teryx jackets)
OpenAI GPT-4.1 (12 minute read)
OpenAI has launched three new models in its API: GPTโ4.1, GPTโ4.1 mini, and GPTโ4.1 nano. These models outperform GPTโ4o and GPTโ4o mini across the board, with major gains in coding and instruction following. They also have larger context windowsโsupporting up to 1 million tokens of contextโand are able to better use that context with improved long-context comprehension. They feature a refreshed knowledge cutoff of June 2024.
DolphinGemma (6 minute read)
DeepMind has announced DolphinGemma, a large language model developed by Google that helps scientists study how dolphins communicate โ and hopefully find out what they're saying, too.
Hugging Face Acquires Pollen Robotics (4 minute read)
Hugging Face, the center of the open source AI community, has long stated its goal is to be a decentralized DeepMind. While this isn't exactly the case, adding in an open source robotics platform via Pollen moves it closer to that goal.
๐จโ๐ป
Engineering & Research
Visual Reasoning with Less Data (16 minute read)
Using MCTS to quantify sample difficulty, ThinkLite-VL improves reasoning in VLMs with just 11k training samples and no distillation.
Improved MoE with C3PO (GitHub Repo)
C3PO introduces a new test-time optimization technique that improves accuracy in Mixture-of-Experts LLMs by re-mixing expert weights based on similar reference samples.
3B parameter tokenizer (GitHub Repo)
Scaling up image tokenizers is challenging because they tend to collapse. This work introduces GigaTok, which is a massive tokenizer with superior reconstruction performance. Decoder scaling and regularization helped with stability and overall quality.
Get the most interesting AI stories and breakthroughs delivered in a free daily email.
Join 920,000 readers for
one daily email