Deep Dive into the Gateway API Inference Extension
Running AI inference workloads on Kubernetes has some unique characteristics and challenges, and the Gateway API Inference Extension project aims to solve some of those challenges. I recently wrote about these new capabilities in the kgateway… ⌘ Read more