| LiteLLM | Portkey | Kong AI | Envoy AI | |
|---|---|---|---|---|
| AuthN / AuthZ | • | • | • | • |
| Guardrails (in + out) | • | • | • | • |
| Model routing | • | • | • | • |
| Semantic cache | • | • | • | ◐ |
| Token rate limiting | • | • | • | • |
| Protocol translation | • | • | • | • |
| Credential management | • | • | • | • |
| KV-aware endpoint picking | ○ | ○ | ○ | • |
| Observability & metering | • | • | • | • |
| Retry / fallback | • | • | • | • |
| Repo | github.com/kubernetes-sigs/wg-ai-gateway |
|---|---|
| Slack | #wg-ai-gateway (Kubernetes) |
| Meetings | Thursdays · 2 pm EST · weekly |
| Open for input | Payload processing · Retry semantics · Backend × multi-cluster |