Ollama

Ollama

Run LLMs locally with one command

General Infrastructure

What it's used for

Ollama is the simplest way to run large language models locally on your Mac, Linux, or Windows machine. It packages models with their runtime into a single binary, so you can run Llama, Gemma, Mistral, Phi, and dozens of other open models with a single terminal command.

  • One-command setupollama run llama3.2 downloads and starts the model
  • Local API server — OpenAI-compatible API at localhost:11434
  • Model library — 100+ models pre-configured and ready to run
  • GPU acceleration — automatically uses Apple Silicon, NVIDIA, or AMD GPUs
  • Modelfile — customize system prompts, parameters, and model behavior

Developers use Ollama for local development, privacy-sensitive applications, offline use, and as a cost-free alternative to API calls during prototyping.

Getting started

  1. Install Ollama:
    curl -fsSL https://ollama.com/install.sh | sh
    Or download from ollama.com/download for Mac/Windows.
  2. Run a model:
    ollama run llama3.2
  3. Use the API from your code:
    curl http://localhost:11434/api/generate -d '{"model": "llama3.2", "prompt": "Hello"}'

Ollama is completely free and open-source. No API keys, no accounts, no usage limits.

No case studies yet

Be the first to share a Ollama case study and get discovered by clients.

Submit a case study

Related tools in General

Need a Ollama expert?

Submit a brief and we'll match you with vetted specialists who have proven Ollama experience.

Submit a brief — it's free