Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

Modelstarted in (seconds)paramSIZECPU Model Buffer sizeprompt eval rateeval ratetokens/s

deepseek-r1:70b



42 GB


llama3.3:70b



42 GB


Qwen3 32B

10.0432B20 GB19259.71 MiB5.63 tokens/s2.54 tokens/s

phi3:14b

3.5214B7.9 GB7530.58 MiB15.12 tokens/s6.05 tokens/s

openchat7b



4.1 GB


llama4:scout







gemma3:27b



17 GB


mistral-small3.1:24b



15 GB


...