Question 1

How do I install Ollama on Windows?

Accepted Answer

Download the official Windows installer from ollama.com/download/windows, run the .exe, and follow prompts. Then open a new terminal and verify with 'ollama --version'.

Question 2

Where do I get models and how do I run them?

Accepted Answer

Use the CLI. Example: 'ollama pull llama3' to download, then 'ollama run llama3' to chat locally. Models are stored on your PC.

Question 3

Why is 'ollama' not recognized after install?

Accepted Answer

Close and reopen your terminal so PATH refreshes. If needed, sign out and sign back in or reinstall Ollama.

Question 4

How do I enable GPU acceleration on Windows?

Accepted Answer

Install the latest NVIDIA driver for CUDA or use DirectML on Windows. Ollama will use the GPU automatically when available. See the GPU Acceleration guide for details.

Question 5

Can I use Ollama offline?

Accepted Answer

Yes. After pulling models, you can disconnect from the internet and continue generating text locally. Only downloads require the internet.

Question 6

Where are models stored and how do I free disk space?

Accepted Answer

Models live under your user profile (commonly a '.ollama' directory). List with 'ollama list' and remove with 'ollama rm MODEL_NAME'.

Question 7

How do I call Ollama from Python?

Accepted Answer

Install 'pip install ollama', then use 'ollama.chat' or the REST API at http://localhost:11434. See the Python page for examples.

Question 8

The port 11434 is busy or unreachable — what now?

Accepted Answer

Check which process uses it with 'netstat -ano | findstr 11434'. Stop the conflicting app, allow local firewall access, and retry.

Question 9

Downloads are slow or stuck

Accepted Answer

Check internet and disk space, pause antivirus scans, and retry later. If corruption is suspected, remove and re‑pull the model.

Question 10

Out of memory or low performance

Accepted Answer

Use smaller models or lower quantizations, keep models on SSD, close GPU‑heavy apps, and update GPU drivers.

Question 11

How do I update Ollama and models?

Accepted Answer

Re‑run the latest Windows installer to update the app. Re‑pull a model name (e.g., 'ollama pull llama3') to fetch its latest default build.

Question 12

Is my data sent to the cloud?

Accepted Answer

Prompts and outputs are processed locally. No cloud calls are needed for generation; only model downloads use the internet.

Question 13

How do I benchmark my setup?

Accepted Answer

Warm up the model once, then measure with a fixed prompt and average several runs. Track tokens/s and latency. See the Benchmarks page for a method.

Question 14

How to uninstall and clean models?

Accepted Answer

Uninstall via Windows settings, stop the service if needed, and delete models from your user profile to free disk space. See the Uninstall page.

Question 15

Can I integrate with other languages or tools?

Accepted Answer

Yes. Use the REST API at http://localhost:11434 from any language to send prompts and receive responses.

FAQ (Extended)