Skip to content

Commit 28eae64

Browse files
authored
Update ollama.md
1 parent 424ef2b commit 28eae64

File tree

1 file changed

+28
-1
lines changed

1 file changed

+28
-1
lines changed

Cheat Sheets/DevOps Linux/ollama.md

Lines changed: 28 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,34 @@
44
ollama pull <model>
55
ollama run <model>
66

7-
## Simple usage with curl
7+
## Checking HW
8+
9+
Find out the amount of VRAM
10+
11+
lspci -v | grep -A 20 VGA
12+
13+
In the output watch for `Memory at`:
14+
15+
0000:00:02.0 VGA compatible controller: Intel Corporation TigerLake-LP GT2 [Iris Xe Graphics] (rev 01) (prog-if 00 [VGA controller])
16+
[...]
17+
Memory at 6078000000 (64-bit, non-prefetchable) [size=16M]
18+
Memory at 4000000000 (64-bit, prefetchable) [size=256M] <<<--- 256 MB VRAM not really good :-)
19+
[...]
20+
21+
Check for AVX 512 support
22+
23+
cat /proc/cpuinfo | grep avx512
24+
25+
General performance to decide which models to run locally
26+
27+
| GPU | RAM | VRAM | Models / Quantization |
28+
|-------------|-------|-------------|-------------------------------------|
29+
| no | 16GB | <8 GB | smallest 3B models only, 4-bit GGUF |
30+
| yes | 32GB | 8 GB | up to 7B models, q2-3 |
31+
| yes | 128GB | 16 GB | up to 13B models, GGUF/EXL2 |
32+
| yes | 128GB | 24 GB | up to 30B models |
33+
34+
## Simple API usage with curl
835

936
Get available model names with
1037

0 commit comments

Comments
 (0)