Download guide

Download Ollama for Windows — official installer guide

Run Llama 3, Mistral, Qwen 2.5 and other open-source LLMs locally on your Windows PC. Private, offline, no subscription. Official Windows installer with step-by-step setup.

Install guide →
Windows 10 / 11 100% offline after setup NVIDIA & AMD GPU Free & open source
Not affiliated with Ollama. Official site: ollama.com
This is an unofficial Windows guide site for Ollama. The official Ollama project is at ollama.com. Download links are partner links.

Run AI models locally on Windows — free and private

Ollama is a free, open-source tool that lets you run large language models (LLMs) like Llama 3, Mistral and Qwen 2.5 directly on your Windows PC. No cloud subscription, no API key, no data leaving your machine. After the initial model download, everything works completely offline.

On Windows, Ollama installs as a background service that exposes a local API at localhost:11434. You interact with it via the command line or through any compatible chat UI.

Get Ollama running on Windows in 4 steps

  • 1

    Download OllamaSetup.exe

    Use the download button above. The installer is ~5 MB and has no dependencies. Works on Windows 10 and 11 (64-bit). Verify the file is genuine before running.

  • 2

    Run the installer

    Double-click the .exe and follow the one-page wizard. Ollama installs in under a minute and starts a background service automatically. See the full install guide for tips on GPU detection and firewall prompts.

  • 3

    Pull your first model

    cmd.exe or PowerShell
    C:\> ollama pull llama3
    pulling manifest...
    pulling 8934d96d3f08... 100% ████████████ 4.7 GB
    success

    The first pull downloads the model weights (~4–8 GB depending on the model). After that it runs fully offline.

  • 4

    Start chatting

    cmd.exe
    C:\> ollama run llama3
    >>> Send a message (/? for help)
    >>> Hello! What can you do?
    I can help with writing, coding, analysis, Q&A...

Minimum requirements for Ollama on Windows

ComponentMinimumRecommended
OSWindows 10 64-bitWindows 11 64-bit
RAM8 GB16 GB or more
Disk10 GB free (for one model)50+ GB SSD
CPUAny x64 with AVX2Modern multi-core (Ryzen 5/i5+)
GPUOptionalNVIDIA (8+ GB VRAM) or AMD (DirectML)

Without a GPU, Ollama runs models on the CPU — functional but slower. A GPU with 8+ GB VRAM dramatically improves speed. See GPU Acceleration.

Frequently asked questions

Is Ollama free?
Yes. Ollama is free and open source (MIT license). The models it runs are also free to download and use. There is no subscription, API key or usage limit.
Does Ollama work offline?
Yes, completely. After downloading the installer and pulling a model, Ollama works with no internet connection. Prompts and responses never leave your machine. See Offline & Privacy guide.
Which models can I run on Ollama for Windows?
Ollama supports Llama 3, Mistral, Qwen 2.5, Gemma 2, Phi-4, DeepSeek, CodeLlama and many more. Browse the full list at Models Hub or at ollama.com/library.
Do I need a GPU to run Ollama?
No. Ollama runs on CPU only, but it is significantly slower without a GPU. For usable response speeds, a mid-range NVIDIA GPU with 8+ GB VRAM is recommended. AMD GPUs are supported via DirectML. See GPU Acceleration.
Is 'ollama' not recognized after installing?
This usually means the terminal was open before Ollama was installed, so PATH was not refreshed. Close all terminal windows and open a new one. If it still fails, see the not recognized fix guide.

Ready to install?

Official step-by-step install guide for Windows 10 and 11.

Install guide