Skip to content

Commit 7df7b7f

Browse files
authored
Update README.md
1 parent 010f4d2 commit 7df7b7f

File tree

1 file changed

+6
-5
lines changed

1 file changed

+6
-5
lines changed

README.md

Lines changed: 6 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -2,22 +2,23 @@
22

33
### tools.cpp quickstart
44
1. build from source:
5-
Mac user
5+
6+
- Mac user
67
```
78
make
89
```
910

10-
Nvidia-Cuda user:
11+
- Nvidia-Cuda user:
1112
```
1213
make LLAMA_CUDA=1
1314
```
1415

15-
2. Install helper package:
16+
2. Install a helper package that fixes some rare edgecases:
1617
```
1718
npm install jsonrepair
1819
```
1920

20-
3. Download a compatible gguf model:
21+
3. Download a compatible Rubra's gguf model:
2122
For example:
2223
```
2324
wget https://huggingface.co/sanjay920/Llama-3-8b-function-calling-alpha-v1.gguf/resolve/main/Llama-3-8b-function-calling-alpha-v1.gguf
@@ -28,7 +29,7 @@ wget https://huggingface.co/sanjay920/Llama-3-8b-function-calling-alpha-v1.gguf/
2829
./llama-server -ngl 35 -m Llama-3-8b-function-calling-alpha-v1.gguf --port 1234 --host 0.0.0.0 -c 16000 --chat-template llama3
2930
```
3031

31-
5. That's it! Make sure you turn `stream` off when making api calls to the server, as streaming feature is not supported yet.
32+
5. That's it! MAKE SURE you turn `stream` OFF when making api calls to the server, as the streaming feature is not supported yet. And we will support streaming too soon.
3233

3334
### Recent API changes
3435

0 commit comments

Comments
 (0)