TLDR

TLDR Data 2026-05-18

Query Planning Slowdown 🐢, Airbnb’s Data Mesh 🧩, Ontology-Driven Policies 🧬

📱

Deep Dives

Our billing pipeline was suddenly slow. The culprit was a hidden bottleneck in ClickHouse (9 minute read)

AWS Outage May 2026: Lessons for Database Disaster Recovery (10 minute read)

Viaduct 1.0 and the Future of Airbnb's Data Mesh (5 minute read)

🚀

Opinions & Advice

The Modern Data Stack is Overcomplicated: Data Ingestion (17 minute read)

Welcome to ORDER BY Jungle (11 minute read)

Exploring schema evolution with ontology-driven propagation (4 minute read)

💻

Launches & Tools

ducklake-sdk (GitHub Repo)

Apache Arrow as Data Interchange (5 minute read)

What Matters in Production RAG (8 minute read)

🎁

Miscellaneous

Context pruning: cut LLM tokens without losing quality (9 minute read)

Your AI agent deletes critical data: Who is responsible? (5 minute read)

⚡️

Quick Links

Curated deep dives, tools and trends in big data, data science and data engineering 📊

Join 400,000 readers for one daily email