TLDR

TLDR Data 2026-04-09

Netflix’s Time-Series Caching 🗄️, Airflow 3.2 Released 🚀, Meta’s Pipeline Context 🗺️

📱

Deep Dives

Stop Answering the Same Question Twice: Interval-Aware Caching for Druid at Netflix Scale (10 minute read)

How Meta Used AI to Map Tribal Knowledge in Large-Scale Data Pipelines (8 minute read)

Proxy-Pointer RAG: Achieving Vectorless Accuracy at Vector RAG Scale and Cost (23 minute read)

Semantic Layer vs. Text-to-SQL: 2026 Benchmark Update (11 minute read)

🚀

Opinions & Advice

Is Data Visualization dead? (4 minute read)

SQL Superpowers: Your Streaming Delta Lake Pipeline Has Been Quietly Falling Apart (5 minute read)

💻

Launches & Tools

Apache Airflow 3.2.0: Data-Aware Workflows at Scale (6 minute read)

When Every Bit Counts: How Valkey Rebuilt Its Hashtable for Modern Hardware (37 minute video)

🎁

Miscellaneous

Dagster vs Airflow 3 (Reddit Thread)

Simplest hash functions (11 minute read)

⚡️

Quick Links

Curated deep dives, tools and trends in big data, data science and data engineering 📊

Join 400,000 readers for one daily email