TLDR DevOps 2026-03-23
Kubernetes 1.36 π, Datadog Terraform Provider v4 π, Agents On Kubernetes β¨
βοΈ Multi-cloud is tough, ngrok's API Gateway makes it easy βοΈ (Sponsor)
Learn how to solve common issues with multi-cloud networking - either active/active or failover - with ngrok's k8s Ingress/Gateway controller.
ngrok's k8s Ingress/Gateway controller routes, shifts and balances traffic between different clouds, giving you fine-grained bi-directional private communication between any k8s clusters regardless of environment (even locally).
No more learning cloud-specific tools and implementations, and no more worrying about if your securities policies are consistent.
π Get the guide and wrangle those cloudsπ€ βοΈ
What to Expect From Kubernetes 1.36 (4 minute read)
Kubernetes 1.36, expected April 22, introduces security and platform improvements, including stronger Linux user namespace support, Gateway API adoption as Ingress-Nginx retires, WatchCache performance updates, enhanced Dynamic Resource Allocation, OCI artifact volumes, and manifest-based admission control configuration.
Announcing the Datadog Terraform provider v4.0.0 (4 minute read)
Datadog released version 4.0.0 of its Terraform provider, introducing predictable monitor access controls, a unified AWS integration resource that replaces four legacy resources, and enhanced security standards, including one-time read application keys. The upgrade also moves the provider to Terraform protocol v6 to support future schema improvements, though teams can continue using v3 configurations until they're ready to migrate.
Announcing Ingress2Gateway 1.0: Your Path to Gateway API (5 minute read)
Kubernetes SIG Network released Ingress2Gateway 1.0, a migration tool that translates Ingress resources to Gateway API configurations and now supports over 30 common Ingress-NGINX annotations (up from just three before 1.0), ahead of the March 2026 Ingress-NGINX retirement deadline. The tool includes controller-level integration tests that verify behavioral equivalence in live clusters and provides warnings about untranslatable configurations to help teams safely modernize their networking stack.
Running Agents on Kubernetes with Agent Sandbox (3 minute read)
Kubernetes is getting a new Agent Sandbox project designed to handle the shift from short-lived AI tasks to long-running autonomous agents that need persistent identity, isolated environments, and the ability to suspend and rapidly resume. The project introduces a Sandbox CRD and SandboxWarmPool feature that eliminates cold-start delays by maintaining pre-provisioned pods, solving the problem of managing stateful, singleton AI workloads at scale.
A Case Against Currying (8 minute read)
Curried functions are elegant and common in functional programming, but their main advantageβpartial applicationβis not unique and can be replicated with other styles using simple syntactic techniques. Tuple-style functions are often more intuitive, composable, and better aligned with how functions conceptually map inputs to outputs, despite currying's aesthetic and niche advantages.
Scaling Autonomous Site Reliability Engineering: Architecture, Orchestration, and Validation for a 90,000+ Server Fleet (6 minute read)
Cloudways built CW Copilot, an AI-powered SRE agent that monitors over 90,000 servers and analyzes incidents to generate automated diagnostics, remediation steps, and fixes, reducing support workload and speeding resolution using LLM reasoning, orchestration via Ansible, and DigitalOcean serverless inference.
AWS Lambda Managed Instances now supports Rust (2 minute read)
AWS Lambda Managed Instances now supports Rust, allowing developers to run high-performance Rust functions on Lambda-managed EC2 instances with built-in scaling, routing, and load balancing while benefiting from improved utilization and EC2 pricing models.
Aurora Superintelligence: AI That Doesn't Guess (Sponsor)
From Terraform to Autopilot: AI-Assisted Automation for Azure Container Apps (28 minute read)
Prevent infrastructure mistakes by combining GitHub Copilot custom instructions, GitHub Actions pipelines, and Managed Identities to enforce conventions, automate deployments, and eliminate credentials.
What's new in Azure SRE Agent in the GA release (2 minute read)
Azure SRE Agent is now generally available with guided onboarding, Deep Context learning, and integrations across logs, code, incidents, and Azure resources to enable automated investigations, faster root cause analysis, and workflow automation across operational systems.
Monitor Model Context Protocol (MCP) servers with OpenLIT and Grafana Cloud (4 minute read)
Grafana Cloud now offers built-in observability for Model Context Protocol (MCP) servers through integration with OpenLIT's auto-instrumentation tool, allowing developers to monitor AI agent interactions with external data sources using pre-built dashboards that track tool performance, latency, and errors.
Get our free daily newsletter with curated tools π», trends π, and insights π‘, for DevOps Engineers π¨βπ»
Join 340,000 readers for
one daily email