TLDR Dev 2026-03-27
Anthropic’s harness design 🧗, saving 500k with AI 💵, model collapse 📉
Harness design for long-running application development (28 minute read)
Anthropic engineers overcame limitations of AI like context anxiety and poor self-evaluation by implementing multi-agent architectures, including a GAN-inspired generator-evaluator loop. This system used structured feedback, explicit grading criteria, and contextual handoffs (or later, improved model capabilities) to guide AI during long-running development.
We Rewrote JSONata with AI in a Day, Saved $500K/Year (5 minute read)
Reco had large compute costs and latency due to its Go pipeline calling JavaScript JSONata processes over RPC for billions of events. So, they used AI to rewrite JSONata into 'gnata,' a pure-Go implementation, in just seven hours for $400 in tokens. This new solution delivered a 1,000x speedup on common expressions and eliminated $500K/year in expenses.
Reducing our monorepo size to improve developer velocity (12 minute read)
Dropbox reduced their 87GB monorepo to 20GB after discovering that Git's delta compression quirk was causing translation files to bloat pack files. By performing a tuned server-side repack, they slashed clone times from over an hour to under 15 minutes.
Building shared coding guidelines for AI (and people too) (11 minute read)
As AI agents increasingly write code, organizations must develop explicit coding guidelines for consistency and maintainability within enterprise codebases. These guidelines need to be prescriptive, covering fundamental ground rules like tech stack and methodologies, along with specific decisions on naming conventions, code layout, and error handling.
What Construction at a Train Station Taught Me About Software Engineering (4 minute read)
An engineer observed construction at a live train station and realized that software engineering shares the same core challenge: simultaneously keeping systems running while improving them. True software engineering is not measured by lines of code but by the ability to navigate complexity, manage trade-offs, and apply skills like system thinking and communication under constant constraints
Is your AWS costing you more time and money than it should? (Sponsor)
Most cloud modernization efforts stall because of inconsistent processes, undocumented "tribal" knowledge, and gaps between security policies and actual configurations.
DuploCloud helps teams find and fix these leaks, recently Spring Health cut incremental cloud costs by 30-40%.
Book your FREE AWS architecture review (normally $2,500) before March 31.
Cog (GitHub Repo)
Cog is a plain-text cognitive architecture for Claude Code that allows it to build and maintain its own persistent memory using familiar Unix tools like `grep` and `git diff`. It operates through a set of plain-text conventions and skills that guide Claude in structuring a three-tier memory system and performing actions like reflection and condensation.
Editor (GitHub Repo)
The Pascal Editor is a 3D building tool using React Three Fiber and WebGPU for creating architectural projects through a modular Turborepo architecture. It has a flat Node hierarchy managed by Zustand, enabling real-time geometry updates and interactive 3D design.
SuperSplat Editor (GitHub Repo)
The SuperSplat Editor is a free, open-source, web-based tool designed for inspecting, editing, optimizing, and publishing 3D Gaussian Splats.
From skeptic to convert: how Fieldy adopted Expo for their AI wearable (10 minute read)
Fieldy (an AI wearable app relying heavily on BLE and background audio) was skeptical of Expo adding overhead, but migrated and cut app size by 25%. The gains came from Expo Atlas-guided bundle cleanup (removing bloated date/calendar libs, fixing lodash tree shaking, and deleting an 18MB bundled SF Pro font), replacing XHR/blob hacks, and fully automating deployments with EAS Workflows.
My minute-by-minute response to the LiteLLM malware attack (18 minute read)
An engineer discovered and responded to a supply chain attack on the LiteLLM PyPI package, which began as a routine investigation into a frozen laptop. Using Claude Code, they quickly identified the malicious `litellm_init.pth` file designed to steal credentials, exfiltrate data, and establish persistence. From the first symptom to public disclosure and reporting to PyPI security and LiteLLM maintainers, the entire incident was resolved in just 72 minutes.
Model Collapse Is Already Happening, We Just Pretend It Isn't (6 minute read)
Model collapse, where AI models degrade over time, is already happening as new systems are increasingly trained on data generated by previous AI models. This recursive process smooths out rare and diverse patterns, leading to a loss of variance and homogenization in outputs.
The most important software engineering news in one daily email
Join 450,000 readers for
one daily email