Self-correcting AI SQL 📝, Blazing-fast JOINs 🏎️, SQLRooms Local-first Analytics 🐤

Your RAG prototype works. Now what? (Sponsor)

AI-savvy developers can spin up a basic RAG app in an afternoon. But when you start looking at real-time data pipelines, LLM observability, and enterprise security - suddenly your weekend project needs serious infrastructure.

AWS Marketplace offers RAG-specific tools that handle the hard parts: vector and graph databases, hybrid search capabilities, context augmentations, and monitoring solutions built for LLM applications — all integrated with your existing AWS services like Bedrock and Lambda.

Explore the latest tools and technical guides

TLDR Data 2025-06-30

Self-correcting AI SQL 📝, Blazing-fast JOINs 🏎️, SQLRooms Local-first Analytics 🐤

Your RAG prototype works. Now what? (Sponsor)

Deep Dives

Join Me if You Can: ClickHouse vs. Databricks & Snowflake - Part 2 (11 minute read)

MUVERA: Making multi-vector retrieval as fast as single-vector search (5 minute read)

Discovering DuckDB Use Cases via GitHub (8 minute read)

I Made Cursor + AI Write Perfect SQL. Here's the Exact Setup (17 minute read)

Opinions & Advice

Lakebase: Databricks' Bold Play to Fuse OLTP and the Lakehouse (3 minute read)

You've Got 99 Problems but Data Shouldn't Be One (29 minute podcast)

Stop Building AI Agents (9 minute read)

Launches & Tools

What's Driving Fluent Bit Adoption? (7 minute read)

videoparquet (GitHub Repo)

Foursquare Introduces SQLRooms (8 minute read)

Miscellaneous

Pipelining AI/ML Training Workloads with CUDA Streams (7 minute read)

Grafana's CTO on the State of the Observability Market (8 minute read)

Quick Links

MS-MARCO-Web-Search (GitHub Repo)

REGEN: Empowering Personalized Recommendations with Natural Language (7 minute read)

How to Fix Data Skew in Apache Spark with the Salting Technique (2 minute read)

Curated deep dives, tools and trends in big data, data science and data engineering 📊