TLDR

TLDR Data 2025-08-21

Semantic Layers Matter πŸ—‚οΈ, Grab Decentralizes Data Ownership βœ…, Databricks Goes Real-Time ⚑

πŸ“±

Deep Dives

Data mesh at Grab part I: Building trust through certification (7 minute read)

Inside ClickHouse Full-text Search: Fast, Native, and Columnar (25 minute read)

Kafka to Iceberg - Exploring the Options (13 minute read)

πŸš€

Opinions & Advice

LLM Evaluation: Practical Tips at Booking.com (11 minute read)

No More Excuses for Stream/Table Duality (2 minute read)

5 Things in Data Engineering That Still Hold True After 10 Years (9 minute read)

πŸ’»

Launches & Tools

Lance (GitHub Repo)

Presidio (GitHub Repo)

Introducing Real-Time Mode in Apache Sparkβ„’ Structured Streaming (5 minute read)

🎁

Miscellaneous

The Pragmatic Engineer 2025 Survey: What's in your tech stack? (15 minute read)

Why Semantic Layers Matter β€” And How to Build One with Duckdb (22 minute read)

⚑️

Quick Links

Spotify Data Tech Stack (4 minute read)

Curated deep dives, tools and trends in big data, data science and data engineering πŸ“Š

Join 400,000 readers for one daily email