...
| Model | started in (seconds) | param | SIZE | CPU Model Buffer size | prompt eval rate | eval ratetokens/s |
|---|---|---|---|---|---|---|
deepseek-r1:70b | 42 GB | |||||
llama3.3:70b | 42 GB | |||||
Qwen3 32B | 10.04 | 32B | 20 GB | 19259.71 MiB | 5.63 tokens/s | 2.54 tokens/s |
phi3:14b | 3.52 | 14B | 7.9 GB | 7530.58 MiB | 15.12 tokens/s | 6.05 tokens/s |
openchat7b | 4.1 GB | |||||
llama4:scout | ||||||
gemma3:27b | 17 GB | |||||
mistral-small3.1:24b | 15 GB |
...