FAQ

Ollama Windows FAQ — all common questions answered

Quick answers to the most common Ollama questions on Windows. Download, install, models, GPU acceleration, offline use, updates and troubleshooting — all in one place.

Install & download

How do I download and install Ollama on Windows?
Download OllamaSetup.exe from the download page, run the installer, and open a new terminal. Verify with ollama --version. Full guide: Install on Windows.
Is Ollama free?
Yes. Ollama is free and open source (MIT license). The models it runs are also free to download. There is no subscription, API key or usage limit.
Which Windows versions does Ollama support?
Ollama supports Windows 10 and Windows 11 (64-bit). It also runs on macOS and Linux.
What are the minimum system requirements?
8 GB RAM minimum, 64-bit Windows 10 or 11, and at least 10 GB free disk space for one model. A GPU is optional but strongly recommended for usable speed. See system requirements.
Does Ollama install a system service?
Yes. The installer registers Ollama as a user-level Windows service that starts automatically on login. You can see it in the system tray. To disable auto-start, right-click the tray icon and adjust startup settings, or remove it from Task Manager > Startup apps.

Models

Which models can I run with Ollama?
Ollama supports Llama 3, Mistral, Qwen 2.5, Gemma 2, Phi-4, DeepSeek, CodeLlama and hundreds more. Browse the full list at ollama.com/library or on the Models Hub.
How much disk space do models take?
Models range from ~1 GB (Gemma 2 2B) to 40+ GB (Llama 3 70B). A typical 7B model takes 4–5 GB. Models are stored in C:\Users\YourName\.ollama\models.
How do I remove a model to free disk space?
Run ollama rm modelname. For example: ollama rm llama3. To see all installed models: ollama list. The model files are deleted immediately.
Can I run multiple models at the same time?
Yes. Ollama can run multiple models simultaneously if you have enough VRAM or RAM. Use ollama ps to see running models and ollama stop modelname to unload one.
Where are Ollama models stored on Windows?
Models are stored in C:\Users\YourName\.ollama\models. You can move this folder by setting the OLLAMA_MODELS environment variable to a different path. See Models location guide.

GPU & performance

Do I need a GPU to run Ollama?
No, but response speed without a GPU is slow (5–15 tokens/s on CPU). A mid-range NVIDIA GPU with 8+ GB VRAM gives 40–80 tokens/s. AMD GPUs are supported via DirectML.
How do I check if Ollama is using my GPU?
Run ollama run modelname --verbose and type a prompt. Check the gpu layers value in the stats. If it is 0, GPU is not active. See GPU Acceleration guide.
NVIDIA CUDA vs AMD DirectML — which is faster?
NVIDIA CUDA is generally faster than AMD DirectML for the same price bracket. CUDA has more mature optimisations in Ollama. AMD DirectML works well and is improving with each Ollama release.
How can I make Ollama faster?
Top options: (1) Enable GPU acceleration. (2) Use a smaller quantization: ollama pull llama3:8b-instruct-q4_K_M. (3) Keep models on an NVMe SSD. (4) Close GPU-heavy apps to free VRAM. See Benchmarks.

Offline & privacy

Is Ollama completely offline after setup?
Yes. After downloading a model with ollama pull, Ollama runs with no internet connection. Prompts and responses never leave your machine. See Offline & Privacy guide.
Does Ollama collect my prompts or data?
Ollama does not send prompts, responses or personal data to any server. Inference runs entirely on your local hardware. For site analytics see our Privacy Policy.
Where is Ollama conversation history stored?
Ollama does not persist conversation history between sessions by default. Each session starts fresh. Within a session, context is held in RAM only. No conversation logs are written to disk.

Troubleshooting

'ollama' is not recognized as an internal or external command
Close all terminal windows and open a new one after installing Ollama. PATH changes require a fresh session. If it still fails, see Not recognized fix guide.
Ollama won't start or the service crashes
Check the Windows Event Viewer for errors, or look at Ollama logs. Common causes: port 11434 already in use, antivirus blocking the process, or corrupted install. See Troubleshooting guide.
How do I update Ollama to the latest version?
Quit Ollama from the system tray, download the new installer, and run it over the existing installation. Models are preserved. See Update guide.
How do I completely uninstall Ollama?
Uninstall via Settings > Apps, then manually delete C:\Users\YourName\.ollama to remove models and config. See Uninstall guide.

Ready to get started?

Download Ollama and run your first local LLM in minutes.

Download Ollama