You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
@@ -22,10 +36,16 @@ RUN curl https://packages.cloud.google.com/apt/doc/apt-key.gpg | apt-key --keyri
22
36
RUN echo "deb [signed-by=/usr/share/keyrings/cloud.google.gpg] https://packages.cloud.google.com/apt cloud-sdk main" | tee -a /etc/apt/sources.list.d/google-cloud-sdk.list
23
37
RUN apt -y update && apt install -y google-cloud-cli
24
38
39
+
RUN git clone https://github.com/AI-Hypercomputer/maxtext.git && \
m) MODEL_PATH: (str) Model path, varies depending on inference server and location of base checkpoint
40
-
n) MODEL_NAME: (str) Model name, ex. llama-2, llama-3, gemma
41
-
h) HUGGINGFACE: (bool) Checkpoint is from HuggingFace.
42
-
q) QUANTIZE_WEIGHTS: (str) Whether to quantize weights
43
-
t) QUANTIZE_TYPE: (str) Quantization type, QUANTIZE_WEIGHTS must be set to true. Availabe quantize type: {"int8", "int4"} x {"per_channel", "blockwise"},
44
-
v) VERSION: (str) Version of inference server to override, ex. jetstream-v0.2.2, jetstream-v0.2.3
45
-
i) INPUT_DIRECTORY: (str) Input checkpoint directory, likely a GSBucket path
46
-
o) OUTPUT_DIRECTORY: (str) Output checkpoint directory, likely a GSBucket path
31
+
- -s, --inference_server: [string] The name of the inference server that serves your model.
32
+
- -m, --model_path: [string] The model path.
33
+
- -n, --model_name: [string] The model name, Model name, ex. llama-2, llama-3, gemma.
34
+
- -q, --quantize_weights: [bool] The checkpoint is to be quantized. (Optional) (default=False)
35
+
- -t, --quantize_type: [string] The type of quantization. Availabe quantize type: {"int8", "int4"} x {"per_channel", "blockwise"}. (Optional) (default=int8_per_channel)
36
+
- -v, --version: [string] The version of repository to override, ex. jetstream-v0.2.2, jetstream-v0.2.3. (Optional) (default=main)
37
+
- -i, --input_directory: [string] The input directory, likely a GSBucket path. (Optional)
38
+
- -o, --output_directory: [string] The output directory, likely a GSBucket path.
39
+
- -h, --huggingface: [bool] The model is from Hugging Face. (Optional) (default=False)
0 commit comments