Scaling Trino Simply 🌐, Snowflake Join Trap 🐌, Death of BI Layers 📉

“The database is still provisioning? Sure, I'd love to wait longer,” said no one ever. Skip the wait and get to the real work with Lakebase. (Sponsor)

Lakebase is a serverless Postgres DB that lets you branch your whole DB almost instantly. Run tests on prod data, spin up AI apps, and more. With separate storage and compute, you wait less and build faster. With Lakebase you can:

Branch databases for testing
Scale up fast — and down to zero just as easily
Run apps, agents and AI on one database
Use one database for operational and analytical data

Get the Databricks founders' rundown on Lakebase, jump straight to building or watch a walk-through. The choice is yours.

TLDR Data 2026-03-26

Scaling Trino Simply 🌐, Snowflake Join Trap 🐌, Death of BI Layers 📉

“The database is still provisioning? Sure, I'd love to wait longer,” said no one ever. Skip the wait and get to the real work with Lakebase. (Sponsor)

Deep Dives

Why Your Snowflake Joins Are Slow: Fix OR Joins Fast (12 minute read)

Volga - Data Processing for Real-Time AI/ML (20 minute read)

Beyond the Vector Store: Building the Full Data Layer for AI Applications (7 minute read)

Opinions & Advice

Future Casting the Modern Data Stack (20 minute read)

Where Is the Right Place to Catch Data Volume Anomalies? (6 minute read)

Databricks Metric Views and the Reality of the Semantic Layer (5 minute read)

Launches & Tools

Apache Iceberg Rust 0.9.0 Release (2 minute read)

When upserts don't update but still write: Debugging Postgres performance at scale (11 minute read)

Operating Trino at Scale With Trino Gateway (9 minute read)

Miscellaneous

State of Context Engineering in 2026 (10 minute read)

What COVID did to our forecasting models (12 minute read)

The Death of model.fit(): What Data Scientists Actually Do in the Age of AI Agents (12 minute read)

Quick Links

Why Data Engineers Should Care About Pydantic (5 minute read)

QL, Typescript, and Agents (6 minute read)

Curated deep dives, tools and trends in big data, data science and data engineering 📊