TLDR DevOps 2026-02-25
AI Operating System ✨, Agentic DevOps 🧱, Lines of Code 🫥
Kubernetes as AI's operating system: 1.35 release signals (4 minute read)
Kubernetes v1.35 ("Timbernetes") shipped with stable in-place Pod resource resizing, gang scheduling for distributed AI workloads, and a new default to KYAML output format designed to reduce configuration errors. The release addresses operational pain points for teams running mixed production workloads—particularly AI/ML jobs—by enabling coordinated Pod placement, CPU/memory adjustments without container restarts, and stricter YAML formatting guardrails.
New in Pulumi IaC: `onError` Resource Hook (2 minute read)
Pulumi released a new onError hook feature that lets developers automatically retry failed cloud infrastructure operations or handle errors with custom code, useful for transient failures like DNS propagation delays or resource readiness issues. The feature is available in Node, Python, Go, and .NET SDKs as of version 3.219.0.
Choosing between Amazon ECS Blue/Green Native or AWS CodeDeploy in AWS CDK (7 minute read)
Amazon Elastic Container Service now offers native blue-green deployments, enabling zero-downtime releases with built-in lifecycle hooks, bake time, and rollback without requiring AWS CodeDeploy. While CodeDeploy still supports canary and linear traffic shifting and CodePipeline integration, ECS native is the recommended default for simpler, service-centric deployments in CDK.
Lines of Code Are Back (10 minute read)
AI has revived lines of code as a success metric. When code is nearly free to generate, volume measures nothing and correlates with more duplication, churn, and false productivity gains. Instead, engineering leaders should track outcomes like time-to-value, code durability, defect rates, and team comprehension, which reflect real impact rather than output.
The End of CI/CD Pipelines: The Dawn of Agentic DevOps (9 minute read)
Agentic DevOps tools like GitHub Copilot and Microsoft's Azure SRE Agent shift CI CD from deterministic pipelines to AI-driven judgment that can autonomously debug tests, remediate incidents, and reduce toil. While increasing velocity, they introduce opaque decision-making, novel failure modes, and new operational risks that require guardrails, shadow deployments, and strong oversight.
300 Days of RuboCop (20 minute read)
A Ruby developer spent 300 days introducing RuboCop into a 1M+ line legacy Rails monolith, enabling cops incrementally and fixing offenses through hundreds of small, reviewable PRs instead of a big-bang migration. The effort reduced PR review noise, uncovered real bugs and risky patterns, led to upstream contributions to RuboCop, and concluded once the remaining cops offered diminishing returns relative to ROI.
Refactoring Azure Quick Review with GitHub Copilot (12 minute read)
Azure Quick Review azqr was completely refactored using GitHub Copilot plan and agent modes under architectural supervision, eliminating major technical debt and 72 scanner packages. The overhaul removed 16,506 net lines across 560 files and improved scan performance by up to 90 percent.
Get our free daily newsletter with curated tools 💻, trends 📈, and insights 💡, for DevOps Engineers 👨💻
Join 340,000 readers for
one daily email