KAI-Scheduler (https://github.com/kai-scheduler/KAI-Scheduler) is an open-source CNCF project focused on delivering the best scheduling experience for AI workloads on Kubernetes. Adopted by AI frontier labs, leading enterprises, and some of the largest AI infrastructure deployments in the world, KAI helps organizations efficiently run AI at scale.
KAI is designed to support any AI infrastructure—from the latest GPU and networking technologies to future hardware generations—while maximizing performance, utilization, and scalability. As a Principal Engineer for KAI, you will drive the technical direction of the project and shape the future of AI scheduling in the Kubernetes ecosystem.
What you’ll be doing:
+ Define the technical roadmap of KAI-Scheduler, driving its architecture, APIs, extensibility, performance, and long-term direction, ensuring alignment with the evolving Kubernetes ecosystem.
+ Drive the scalability strategy of KAI for massive-scal...