TLDR AI 2025-04-09
State of AI Index 2025 🤖, Waymo’s Privacy Concerns 🚗, Together AI DeepCoder 💻
DeepCoder (12 minute read)
Together AI has trained a coding model that is competitive with closed source reasoning models. It has released data, code, training recipes, and showcased the model's strong long context ability.
A New Batch Normalization (16 minute read)
This paper proposes a new batch normalization method for SPD manifolds that uses a learnable Generalized Bures-Wasserstein metric.
Benchmarking Open Source models for OCR (17 minute read)
OCR is the task of recognizing text in images. It is challenging in the long tail and immensely valuable when done right. Many closed models such as the Gemini series are exceptional at this task. The new suite of Llama 4 models pushes the state of the art for open models forward substantially.
Arabic AI Benchmarks (9 minute read)
Inception and MBZUAI have launched a unified Arabic AI evaluation platform featuring updated AraGen benchmarks and a new instruction-following leaderboard built on the Arabic IFEval benchmark.
17K reasoning traces from R1 (Hugging Face Hub)
A great set of reasoning traces from R1 that can be used as training data to distill a smaller reasoner or kick start the RL process.
How Students Use Claude in Education (12 minute read)
Anthropic analyzed one million student conversations to understand AI usage in education. STEM students dominate usage, primarily leveraging Claude for content creation, technical problem-solving, and higher-order learning tasks.
Get the most interesting AI stories and breakthroughs delivered in a free daily email.
Join 920,000 readers for
one daily email