Kubernetes As AI’s Working System: 1.35 Launch Alerts

Posted on February 23, 2026
by Angel Ramirez, CEO of Cuemby and CNCF Ambassador

CNCF tasks highlighted on this publish

Why v1.35 reads like an AI-infrastructure launch

Kubernetes has turn out to be the place the place groups coordinate blended manufacturing workloads: providers, batch jobs, information pipelines, and ML coaching. The Kubernetes v1.35 (“Timbernetes”) launch reinforces that trajectory with adjustments that scale back operational friction in scheduling, useful resource management, and configuration workflows.

What stands out in v1.35 is sensible: fewer restarts for resizing, new primitives for coordinated placement, and safer defaults for a way groups generate and assessment manifests at scale.

Taken collectively, these updates level to a Kubernetes management aircraft that’s adapting to bursty jobs, tightly coupled coaching runs, and repeatedly tuned inference providers. Groups working blended clusters are likely to really feel the strain first in placement effectivity, resize churn, and configuration assessment hygiene. The remainder of this piece focuses on the v1.35 adjustments that ease these pressures and make AI/ML operations extra predictable at scale.

The adjustments that matter for AI/ML operations

Workload-aware scheduling arrives (alpha)

Kubernetes v1.35 introduces the workload API and workload-aware scheduling, together with an preliminary implementation of gang scheduling for “all-or-nothing” placement throughout a gaggle of Pods. This helps distributed coaching and tightly coupled jobs keep away from partial placement patterns that waste capability and stall progress.

In order for you the deeper design context, the upstream proposal lives within the gang scheduling KEP.

In-place Pod resize is secure

v1.35 graduates in-place Pod useful resource resize to Steady. CPU and reminiscence changes can occur with out restarting containers, which reduces churn in inference providers that want quick tuning beneath load and improves restoration choices for long-running workloads.

The upstream spec is captured within the in-place replace of Pod sources KEP.

Gadget allocation retains shifting towards a baseline functionality

Dynamic Useful resource Allocation (DRA) is handled as a core constructing block for device-aware orchestration, and v1.35 retains DRA enabled as a part of the platform’s ongoing work on this space (see the DRA part within the v1.35 launch notes). For AI/ML groups, the discharge continues ongoing efforts in direction of extra predictable machine claims and a cleaner path to richer scheduling semantics over accelerators.

KYAML turns into the default kubectl output format

Kubernetes can be tightening the “last mile” of configuration workflows. With v1.35, kubectl output defaults to KYAML, a stricter subset designed to scale back ambiguous YAML behaviors and customary formatting hazards. The canonical description is within the KYAML reference.

In the event you want compatibility testing or managed rollouts, the upstream design and toggles are documented within the KYAML KEP (together with KUBECTL_KYAML=false).

Why AI retains pushing groups towards a shared working layer

Manufacturing AI techniques mix workloads with essentially totally different working profiles: bursty coaching jobs, regular inference providers, and pipelines that feed each. The frequent requirement is a constant operational floor for scheduling, scaling, governance, and coverage enforcement throughout groups and environments.

That strain is rising. Gartner tasks over 40% of agentic AI tasks will likely be canceled by the top of 2027, pushed by escalating prices, unclear enterprise worth, and insufficient danger controls. Groups that need sturdy outcomes want repeatable paths to manufacturing that make price, reliability, and governance measurable.

Platform engineering implications

The groups that scale AI applications normally standardize “how AI ships” throughout the group. Inner platforms and golden paths assist encode guardrails with out blocking iteration:

curated patterns for coaching and inference
policy-as-code for quotas, entry, and controls
self-service workflows with auditability

v1.35’s course—workload-aware scheduling primitives, secure in-place resize, and safer configuration output—helps platform groups that wish to scale back bespoke infrastructure work whereas maintaining Kubernetes because the constant substrate.

Ecosystem notice: Ingress NGINX retirement timeline

Ingress NGINX is in best-effort upkeep till March 2026, after which will probably be retired with no additional releases, bug fixes, or safety updates. The retirement announcement and operational implications are captured in Ingress NGINX Retirement: What You Must Know and bolstered by the Kubernetes Steering and Safety assertion.

For operators, it is a planning merchandise: stock present utilization, outline a migration path, validate in staging, and doc rollback expectations.

Sensible analysis steps for v1.35

In the event you run AI/ML workloads on Kubernetes:

Kubernetes v1.35 improves the components of the platform that have a tendency to interrupt first beneath AI load: coordinated placement, useful resource management with much less disruption, and safer configuration output. Groups that deal with Kubernetes because the shared working layer for AI acquire a less complicated path to scale as a result of the platform absorbs extra of the operational complexity over time.

In regards to the creator

Angel Ramirez

Angel Ramirez is a CNCF Ambassador and Kubestronaut with over 17 years of expertise in cloud-native structure and platform engineering. He focuses on operationalizing Kubernetes for AI and enterprise workloads and contributes to group discussions round platform standardization and governance.

Top Posts

When AI Ignites Acceleration: Rewriting the Rules of Tech Risk Management

Half a Million Join Korea’s Bold CBDC Experiment as Pilot Expands Rapidly

AGIBOT’s WAIC Reveal: Four Embodied AI Powerhouses for Real-World Mission Control

Kubernetes as AI’s working system: 1.35 launch alerts

Beyond Bandwidth: Why Goodput is the True Metric for LLM Serving Performance

🚀 Serverless Magic & AI Power-Up: Your July 20, 2026 AWS Pulse Check

Meta and Anthropic Ignite a $10 Billion AI Power Struggle

SBA’s 8(a) Overhaul Sparks Democratic Uprising: Eligibility Battle Looms

House GOP’s $95 Billion Reconciliation Package Surges Past Critical Early Test

Building America’s Future: The Hidden Security Risk in Every Shipment of Cement

When AI Ignites Acceleration: Rewriting the Rules of Tech Risk Management

Half a Million Join Korea’s Bold CBDC Experiment as Pilot Expands Rapidly

AGIBOT’s WAIC Reveal: Four Embodied AI Powerhouses for Real-World Mission Control

Beyond Bandwidth: Why Goodput is the True Metric for LLM Serving Performance

OT Security Investments Rise, Yet Factory Floor Risks Lurk Unchecked

I Tested a 4TB Quantum-Safe USB Drive: Maximum Security, Zero $3000 Price Tag

Unlock Peak Performance: Your Blueprint for Lightning-Fast Agentic Coding with Claude

Robots Unleashed: Thriving in Chaos with Zero Infrastructure

Trending

When AI Ignites Acceleration: Rewriting the Rules of Tech Risk Management

Half a Million Join Korea’s Bold CBDC Experiment as Pilot Expands Rapidly

Latest Posts

Not More Data, but Better World Models – Unite.AI

OpenAI Is Hiring Head of Preparedness, Amid AI Cyberattack Fears

Subscribe to Updates

Top Posts

Kubernetes as AI’s working system: 1.35 launch alerts

Why v1.35 reads like an AI-infrastructure launch

The adjustments that matter for AI/ML operations

Workload-aware scheduling arrives (alpha)

In-place Pod resize is secure

Gadget allocation retains shifting towards a baseline functionality

KYAML turns into the default kubectl output format

Why AI retains pushing groups towards a shared working layer

Platform engineering implications

Ecosystem notice: Ingress NGINX retirement timeline

Sensible analysis steps for v1.35

In regards to the creator

Related Posts