Great on paper but serious limitations in autonomy and security.
In the default Kubeflow model, tenants are assigned namespaces within a single shared Kubernetes cluster. This leads to impressive GPU utilization (often >90%)—but creates issues for security, autonomy, and flexibility:
Weak tenant isolation: Shared control plane means no clear boundary between tenants.
Limited autonomy: Tenants can’t self-manage Kubeflow projects or other workloads.
CRD restrictions: Difficult or impossible for tenants to install or manage custom resources.
Bottom Line: You’re maximizing hardware use, but compromising on platform flexibility and security.