Update README.md

tybalex · web-flow · commit 7df7b7fa74fe · 2024-06-27T13:31:22.000-07:00
diff --git a/README.md b/README.md
@@ -2,22 +2,23 @@
 
 ### tools.cpp quickstart
 1. build from source:
-Mac user
+   
+- Mac user
 ```
 make
 ```
 
-Nvidia-Cuda user:
+- Nvidia-Cuda user:
 ```
 make LLAMA_CUDA=1
 ```
 
-2. Install helper package:
+2. Install a helper package that fixes some rare edgecases:
 ```
 npm install jsonrepair
 ```
 
-3. Download a compatible gguf model:
+3. Download a compatible Rubra's gguf model:
 For example:
 ```
 wget https://huggingface.co/sanjay920/Llama-3-8b-function-calling-alpha-v1.gguf/resolve/main/Llama-3-8b-function-calling-alpha-v1.gguf
@@ -28,7 +29,7 @@ wget https://huggingface.co/sanjay920/Llama-3-8b-function-calling-alpha-v1.gguf/
 ./llama-server -ngl 35 -m Llama-3-8b-function-calling-alpha-v1.gguf   --port 1234 --host 0.0.0.0  -c 16000 --chat-template llama3
 ```
 
-5. That's it! Make sure you turn `stream` off when making api calls to the server, as streaming feature is not supported yet.
+5. That's it! MAKE SURE you turn `stream` OFF when making api calls to the server, as the streaming feature is not supported yet. And we will support streaming too soon.
 
 ### Recent API changes