Llama 3
Flagship general‑purpose model. Great balance of quality and speed.
Pick a model, copy the pull command, and start your local AI in seconds. Works with the official Ollama app on Windows.
Flagship general‑purpose model. Great balance of quality and speed.
Fast, lightweight models for everyday tasks and coding.
Strong reasoning and multilingual capabilities.
Compact, capable model from Google — efficient on smaller GPUs.
Small, instruction‑tuned model — great for edge and quick replies.
Explore additional community models and variants via the CLI.
Tip: Model size and speed vary by parameter count and quantization. For best performance, see GPU Acceleration.
Community‑driven guide. Not affiliated with the official Ollama project.