Iceberg for AI 🤖, HashMap Freeze Lesson 🧊, Choosing Graph Models 🕸️

Play to win with NVIDIA at Microsoft Build, online & on-site (Sponsor)

Unlock developer-first, hands-on experiences with NVIDIA agentic solutions on Azure at Microsoft Build, happening in San Francisco and online June 2–3.

From hands-on labs and demos to speaking sessions and interactive events, Microsoft Build offers developers a unique opportunity to go deep on real code, real systems, and real workflows.

New this year: Online only, June 1–5, visit the NVIDIA Builder's Arcade daily for developer challenges and the chance to score exclusive NVIDIA discounts.

Learn more

TLDR Data 2026-05-25

Iceberg for AI 🤖, HashMap Freeze Lesson 🧊, Choosing Graph Models 🕸️

Play to win with NVIDIA at Microsoft Build, online & on-site (Sponsor)

Deep Dives

The 58-Million-Key Freeze: What a HashMap Resize Taught Us About Memory Allocation at Scale (10 minute read)

Choosing the Right Graph (28 minute read)

The Hugo evolution: Engineering Grab's unified, one-click data ingestion platform with Apache Flink (4 minute read)

Opinions & Advice

From Batch to Streaming and AI, Iceberg for Everyone by Everyone (34 minute video)

Plan Mode All the Time, Substrait over SQL, and the End of the DE Role ft (15 minute read)

Of Hammers and Nails: What AI Can and Cannot Do for a Data Analyst (6 minute read)

Launches & Tools

DuckDB 1.5.3: Not an Ordinary Patch Release (4 minute read)

Introducing Dimster, a performance benchmarking tool for Apache Kafka (13 minute read)

pg_infer 1.0.0 released -- transformer model knowledge as SQL relations (4 minute read)

Miscellaneous

Same buffers, same instructions, same hardware. Where Is the JVM Tax? (17 minute read)

SAM 3: Segment Anything with Concepts (GitHub Repo)

Quick Links

Bintrail: MySQL Time-Travel Queries Using Indexed Binlogs (3 minute read)

Cloud Native Computing Foundation Announces OpenTelemetry's Graduation, Solidifying Status as the De Facto Observability Standard (10 minute read)

7 Temporal Blind Spots Breaking Enterprise RAG (7 minute read)

Curated deep dives, tools and trends in big data, data science and data engineering 📊