Skip to content

Commit 4f16a96

Browse files
authored
Update envoy-ai-gateway docs and configs (#403)
1 parent fe74a6d commit 4f16a96

File tree

5 files changed

+15
-8
lines changed

5 files changed

+15
-8
lines changed

chart/templates/prometheus/serviceaccount.yaml

+1-1
Original file line numberDiff line numberDiff line change
@@ -38,5 +38,5 @@ roleRef:
3838
subjects:
3939
- kind: ServiceAccount
4040
name: {{ include "chart.fullname" . }}-prometheus
41-
namespace: llmaz-system
41+
namespace: {{ .Release.Namespace }}
4242
{{- end }}

chart/values.global.yaml

+3-1
Original file line numberDiff line numberDiff line change
@@ -42,7 +42,9 @@ open-webui:
4242
persistence:
4343
enabled: false
4444
enableOpenaiApi: true
45-
openaiBaseApiUrl: "https://api.openai.com/v1"
45+
# Assumes the OpenAI-compatible API is exposed via the Envoy AI Gateway. Please replace this value if you
46+
# deployed the envoy-ai-gateway example to a namespace other than "default".
47+
openaiBaseApiUrl: "http://envoy-default-default-envoy-ai-gateway-dbec795a.llmaz-system.svc.cluster.local/v1"
4648
extraEnvVars:
4749
- name: OPENAI_API_KEY
4850
value: "ChangeMe"

docs/examples/envoy-ai-gateway/basic.yaml

-4
Original file line numberDiff line numberDiff line change
@@ -59,7 +59,6 @@ apiVersion: gateway.networking.k8s.io/v1
5959
kind: Gateway
6060
metadata:
6161
name: default-envoy-ai-gateway
62-
namespace: default
6362
spec:
6463
gatewayClassName: default-envoy-ai-gateway
6564
listeners:
@@ -71,7 +70,6 @@ apiVersion: aigateway.envoyproxy.io/v1alpha1
7170
kind: AIGatewayRoute
7271
metadata:
7372
name: default-envoy-ai-gateway
74-
namespace: default
7573
spec:
7674
schema:
7775
name: OpenAI
@@ -99,7 +97,6 @@ apiVersion: aigateway.envoyproxy.io/v1alpha1
9997
kind: AIServiceBackend
10098
metadata:
10199
name: qwen2-0--5b
102-
namespace: default
103100
spec:
104101
timeouts:
105102
request: 3m
@@ -114,7 +111,6 @@ apiVersion: aigateway.envoyproxy.io/v1alpha1
114111
kind: AIServiceBackend
115112
metadata:
116113
name: qwen2--5-coder
117-
namespace: default
118114
spec:
119115
timeouts:
120116
request: 3m

site/content/en/docs/integrations/envoy-ai-gateway.md

+9-1
Original file line numberDiff line numberDiff line change
@@ -29,14 +29,22 @@ We'll deploy two models `Qwen/Qwen2-0.5B-Instruct-GGUF` and `Qwen/Qwen2.5-Coder-
2929

3030
The full example is [here](https://github.com/InftyAI/llmaz/blob/main/docs/examples/envoy-ai-gateway/basic.yaml), apply it.
3131

32+
```bash
33+
kubectl apply -f https://raw.githubusercontent.com/InftyAI/llmaz/refs/heads/main/docs/examples/envoy-ai-gateway/basic.yaml
34+
```
35+
3236
### 3. Check Envoy AI Gateway APIs
3337

3438
If Open-WebUI is enabled, you can chat via the webui (recommended), see [documentation](./open-webui.md). Otherwise, following the steps below to test the Envoy AI Gateway APIs.
3539

3640
I. Port-forwarding the `LoadBalancer` service in llmaz-system, like:
3741

3842
```bash
39-
kubectl port-forward svc/envoy-default-default-envoy-ai-gateway-dbec795a 8080:80
43+
kubectl -n llmaz-system port-forward \
44+
$(kubectl -n llmaz-system get svc \
45+
-l gateway.envoyproxy.io/owning-gateway-name=default-envoy-ai-gateway \
46+
-o name) \
47+
8080:80
4048
```
4149

4250
II. Query `http://localhost:8008/v1/models | jq .`, available models will be listed. Expected response will look like this:

site/content/en/docs/integrations/open-webui.md

+2-1
Original file line numberDiff line numberDiff line change
@@ -12,7 +12,7 @@ weight: 2
1212

1313
## How to use
1414

15-
If open-webui already installed, what you need to do is just update the OpenAI API endpoint in the admin settings. You can get the value from step2 & 3 below. Otherwise, following the steps here to install open-webui.
15+
If open-webui already installed, what you need to do is just update the OpenAI API endpoint in the admin settings if you deployed the [Basic AI Gateway Example](docs/envoy-ai-gateway.md) to a namespace other than "default". You can get the value from step2 & 3 below. Otherwise, following the steps here to install open-webui.
1616

1717
1. Enable Open WebUI in the `values.global.yaml` file, open-webui is enabled by default.
1818

@@ -35,6 +35,7 @@ If open-webui already installed, what you need to do is just update the OpenAI A
3535
```yaml
3636
open-webui:
3737
enabled: true
38+
# Please replace this value if you deployed the envoy-ai-gateway example to a namespace other than "default".
3839
openaiBaseApiUrl: http://envoy-default-default-envoy-ai-gateway-dbec795a.llmaz-system.svc.cluster.local/v1
3940
```
4041

0 commit comments

Comments
 (0)