Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

Modelstarted in (seconds)paramSIZEprompt eval rateeval rate

deepseek-r1:70b

21.3470B42 GB2.20 tokens/s1.24 tokens/s

llama3.3:70b

21.3470B42 GB2.39 tokens/s1.23 tokens/s

Qwen3 32B

10.0432B20 GB5.63 tokens/s2.54 tokens/s

phi3:14b

3.5214B7.9 GB15.12 tokens/s6.05 tokens/s

openchat7b

10.047B4.1 GB

llama4:scout

13.55


67 GB11.47 tokens/s4.76 tokens/s

gemma3:27b

1.76

27B17 GB6.66 tokens/s3.03 tokens/s

mistral-small3.1:24b

3/26

24B15 GB

llama.cpp

https://github.com/ggml-org/llama.cpp

...