Inside Reddit’s Architecture 🏛️, Lightweight Semantic Layer 🪶, Iceberg Spec Issues ⚠️

Rethinking Data Science Interviews in the Age of AI (11 minute read)

AI-powered tools are rapidly changing data science interviews by automating technical screenings, generating code solutions, and assessing candidates' analytical thinking with greater objectivity. Hiring managers must now focus on evaluating business problem-solving, communication, and real-world data skills beyond automated code tests, while candidates should demonstrate adaptability and contextual understanding.

TLDR Data 2025-07-07

Inside Reddit’s Architecture 🏛️, Lightweight Semantic Layer 🪶, Iceberg Spec Issues ⚠️

Deep Dives

Boring Semantic Layer + MCP = 🔥 (5 minute read)

Driving Content Delivery Efficiency Through Classifying Cache Misses (11 minute read)

Atlassian's 4 Million PostgreSQL Database Migration: When Standard Cloud Strategies Fail (12 minute read)

Opinions & Advice

9 Trends Shaping the Future of Data Management in 2025 (6 minute read)

Iceberg, The Right Idea - The Wrong Spec - Part 1 of 2: History (15 minute read)

How AI is Changing Software Engineering at Shopify with Farhan Thawar (47 minute podcast)

Launches & Tools

Event-Driven AI Agents: Why Flink Agents Are the Future of Enterprise AI (6 minute read)

A Guide to Converting ADK Agents with MCP to the A2A framework (5 minute read)

RAPIDS Adds GPU Polars Streaming, a Unified GNN API, and Zero-Code ML Speedups (2 minute read)

DuckLake 0.2 (5 minute read)

Miscellaneous

Rethinking Data Science Interviews in the Age of AI (11 minute read)

How Reddit Works 🔥 (15 minute read)

Quick Links

The One Trillion Row challenge with Apache Impala (7 minute read)

Analyzing PostgreSQL Performance Using Flame Graphs (7 minute read)

Curated deep dives, tools and trends in big data, data science and data engineering 📊