Troubleshooting

Ollama not working on Windows? — fix common issues

Most Ollama problems on Windows fall into four categories: CLI not found, service not running, port conflict, or GPU not detected. This guide covers all of them with exact commands to diagnose and fix each issue.

Start with the quick diagnostics below before diving into specific issues. Most Ollama problems on Windows fall into one of four categories: CLI not found, service not running, port conflict, or GPU not detected.

Run these first

cmd.exe — new window
# 1. Is Ollama installed and on PATH?
C:\> ollama --version
ollama version 0.6.5
# 2. Is the service running?
C:\> curl http://localhost:11434
Ollama is running
# 3. Is GPU being used?
C:\> ollama run llama3 --verbose
# Check: gpu layers should be > 0

Common issues and fixes

'ollama' is not recognized as an internal or external command

This means ollama.exe is not on your PATH, or the terminal was not restarted after install. Fix:

cmd.exe — NEW window
# Step 1: open a brand new terminal window
C:\> ollama --version
ollama version 0.6.5

If it still fails: Settings → Apps → confirm Ollama is installed. If not, download and install. For full PATH fix steps see Not recognized guide.

Port 11434 is already in use

Another process is using the port Ollama needs. Find and stop it:

cmd.exe
# Find PID using port 11434:
C:\> netstat -ano | findstr 11434
TCP 0.0.0.0:11434 0.0.0.0:0 LISTENING 4821
# Kill the process (replace 4821 with actual PID):
C:\> taskkill /PID 4821 /F
# Then restart Ollama from Start menu

See Port 11434 guide for more scenarios.

Model download is stuck or very slow

Ollama resumes interrupted downloads. If it appears stuck, check disk space and internet. To cancel and retry:

cmd.exe
# Press Ctrl+C to cancel, then retry:
C:\> ollama pull llama3
pulling 8934d96d3f08... 47% ████░░░░ 2.2 GB
# It will resume from where it left off

Also check: antivirus may be scanning large model files and slowing the download. Add the .ollama\models folder to your antivirus exclusion list.

GPU not being used (gpu layers: 0)

Ollama is falling back to CPU. Most common causes on Windows: outdated NVIDIA driver (needs 527+), GPU VRAM too low for the model, or DirectML not detected for AMD. See the full GPU Acceleration guide.

Out of memory error

The model does not fit in available RAM or VRAM. Solutions: (1) Use a smaller model or lower quantization — try ollama pull llama3:8b-instruct-q4_K_M instead of the default. (2) Close other GPU-heavy applications. (3) On CPU: ensure you have at least 8 GB RAM free.

Firewall or antivirus blocks Ollama

Ollama needs to bind to localhost:11434. When prompted by Windows Firewall, click Allow access for private networks. If a corporate policy blocks it, add an exception for ollama.exe in your antivirus/firewall settings.

ollama serve: listen tcp 0.0.0.0:11434: bind: address already in use

Ollama is already running in the background (check the system tray). You cannot run two instances at once. Right-click the tray icon → Quit, then reopen Ollama, or just use the existing running instance.

Error: model not found

The model name is wrong or not downloaded yet. Check available models: ollama list. Pull the model first: ollama pull modelname. Browse valid model names at Models Hub.

Clean up and manage models

cmd.exe
# List all installed models:
C:\> ollama list
# Remove a model:
C:\> ollama rm llama3
# Show running models:
C:\> ollama ps
# Stop a running model:
C:\> ollama stop llama3

Still stuck?

Check the not-recognized fix or port 11434 guide for specific error messages.

Not recognized fix