You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: tutorials-and-examples/skypilot/dws-and-kueue/README.md
+12-12
Original file line number
Diff line number
Diff line change
@@ -82,7 +82,7 @@ Server Version: v1.30.6-gke.1596000
82
82
```
83
83
If not you can change the version in terraform with the `kubectl_version` variable
84
84
## Install and configure Kueue
85
-
1. Install Kueue from the official manifest. Note that --server-side switch . Without it the client cannot render the CRDs because of annotation size limitations.
85
+
1. Install Kueue from the official manifest. Note that `--server-side` switch . Without it the client cannot render the CRDs because of annotation size limitations.
@@ -182,7 +182,7 @@ Note: The following clouds were disabled because they were not included in allow
182
182
✔ Kubernetes
183
183
```
184
184
## Configure and Run SkyPilot Job
185
-
For SkyPilot to create pods with the necessary pod config we need to add the following config to train_dws.yaml.
185
+
For SkyPilot to create pods with the necessary pod config we need to add the following config to `train_dws.yaml`.
186
186
```yaml
187
187
experimental:
188
188
config_overrides:
@@ -268,10 +268,10 @@ This section details how to fine-tune Gemma 2B for SQL generation on GKE Autopil
268
268
- A GKE cluster configured with SkyPilot
269
269
- HuggingFace account with access to Gemma model
270
270
271
-
###Fine-tuning Implementation
271
+
###Fine-tuning Implementation
272
272
The [finetune.py](https://github.com/GoogleCloudPlatform/ai-on-gke/blob/skypilot_dws_kueue/tutorials-and-examples/skypilot/dws-and-kueue/finetune.yaml) script uses QLoRA with 4-bit quantization to fine-tune Gemma 2B on SQL generation tasks.
273
273
274
-
###Configure GCS Storage Access
274
+
###Configure GCS Storage Access
275
275
The infrastructure Terraform configuration in [main.tf](https://github.com/GoogleCloudPlatform/ai-on-gke/blob/skypilot_dws_kueue/tutorials-and-examples/skypilot/dws-and-kueue/main.tf) includes Workload Identity and GCS bucket setup:
5. Update gcsfuse configuration in finetune.yaml and sever.yaml
314
+
5. Update gcsfuse configuration in `finetune.yaml` and `sever.yaml`
315
315
Replace the [BUCKET_NAME](https://github.com/GoogleCloudPlatform/ai-on-gke/blob/skypilot_dws_kueue/tutorials-and-examples/skypilot/dws-and-kueue/finetune.yaml#L27)
316
316
317
-
###Fine-tune the Model
317
+
###Fine-tune the Model
318
318
1. Set up HuggingFace access:
319
319
Finetune script needs a HuggingFace token and to sign the licence consent agreement. Follow instructions on the following link: Get access to the [model](https://cloud.google.com/kubernetes-engine/docs/tutorials/serve-gemma-gpu-vllm#model-access)
0 commit comments