TLDR AI 2026-06-12
OpenAI buys Ona π€, Anthropic backtracks π, Xiaomiβs MiMo code π¨βπ»
OpenAI Acquired Ona for Long-Running Agents (1 minute read)
OpenAI announced it would acquire Ona to bring secure cloud execution and orchestration capabilities into the Codex platform. The technology is intended to support persistent, customer-controlled environments where agents can continue working across extended periods and sessions.
Anthropic backtracks on policy that 'sabotaged' researchers' work (2 minute read)
Anthropic has decided to make its safeguards for frontier LLM development visible after backlash from researchers. The company had previously discreetly rerouted requests to a lesser model when asked to perform certain actions. Researchers found that Claude Fable 5 was either refusing or degrading responses for tasks like training competing models, debugging AI code, and optimizing neural architecture. This raised concerns about Anthropic's lack of transparency and also that tokens and money had been spent on a model that didn't do what was expected.
π§
Deep Dives & Analysis
Finding Optimal Tokenizers (15 minute read)
Frontier AI models are typically trained on sequences of integers known as tokens. Each token refers to some sequence of bytes, and these byte sequences often correspond to common words. This post presents an algorithm that can compute an optimal tokenizer in some settings.
Making a vintage LLM from scratch (50 minute read)
This post shares how a developer created their own LLM from scratch. It covers how they create their own base-training and fine-tuning scripts, data processing pipelines, and custom dataset. The total cost of the project was around $80, but they had a decent PC to process the data. The model and code are available in the post.
Can Compute Commoditize if it's Not Fungible? (5 minute read)
CoreWeave's co-founder, Brannin McBee, recently claimed that compute isn't fungible the way a commodity has to be. He has a real argument, but the non-commodity framing is the keystone of his company's value. While he appears to be saying that there is no market, he's actually pricing the market and revealing where the spread still hides.
π¨βπ»
Engineering & Research
You wouldn't bet your business on a fortune cookie...but that's how you're using AI (Sponsor)
AI models are brilliant, but they're probabilistic. Without the ground truth of your operational reality, trusting AI is like trusting a fortune cookie. The
Celonis Context Model is a dynamic digital twin of your business that gives AI agents the operational clarity they need to drive real ROI.
Meet the Context ModelXiaomi's new open source, agentic AI coding harness MiMo Code beats Claude Code at ultra-long, 200+ step tasks (14 minute read)
MiMo Code V0.1.0 is an open source terminal-native AI coding assistant from Xiaomi. It outperforms Claude Code on key agentic coding benchmarks, particularly on long-horizon, multi-step tasks. MiMo Code features a cross-session memory system that uses an independent subagent to take notes of decisions, issues, and the scope of the project as it progresses. The model is available on GitHub under an MIT license.
Predictive Data Debugging: Reveal and Shape What Your Model Learns, Before You Train (11 minute read)
Predictive data debugging identifies potential model behaviors before training by analyzing preference datasets. This technique, integrated into the Silico platform, allows engineers to reshape datasets or training processes to prevent undesired effects, improving both performance and safety. Case studies reveal common issues like compromised safety guardrails, hallucinated links, and context-specific sycophancy, allowing targeted interventions to fix these problems before deployment.
Optimizing PyTorch with Fused MLPs (29 minute read)
This deep dive explores how PyTorch executes and optimizes neural network layers, progressing from individual linear operations to a fused multilayer perceptron.
Get the most interesting AI stories and breakthroughs delivered in a free daily email.
Join 920,000 readers for
one daily email