On this page

Quickstart

LocalAI is a free, open-source alternative to OpenAI (Anthropic, etc.), functioning as a drop-in replacement REST API for local inferencing. It allows you to run LLMs, generate images, and produce audio, all locally or on-premises with consumer-grade hardware, supporting multiple model families and architectures.

💡

Security considerations

If you are exposing LocalAI remotely, make sure you protect the API endpoints adequately with a mechanism which allows to protect from the incoming traffic or alternatively, run LocalAI with API_KEY to gate the access with an API key. The API key guarantees a total access to the features (there is no role separation), and it is to be considered as likely as an admin role.

Quickstart

Using the Bash Installer

  # Basic installation
curl https://localai.io/install.sh | sh

See Installer for all the supported options

Run with docker:

  # CPU only image:
docker run -ti --name local-ai -p 8080:8080 localai/localai:latest-cpu

# Nvidia GPU:
docker run -ti --name local-ai -p 8080:8080 --gpus all localai/localai:latest-gpu-nvidia-cuda-12

# CPU and GPU image (bigger size):
docker run -ti --name local-ai -p 8080:8080 localai/localai:latest

# AIO images (it will pre-download a set of models ready for use, see https://localai.io/basics/container/)
docker run -ti --name local-ai -p 8080:8080 localai/localai:latest-aio-cpu

Load models:

  # From the model gallery (see available models with `local-ai models list`, in the WebUI from the model tab, or visiting https://models.localai.io)
local-ai run llama-3.2-1b-instruct:q4_k_m
# Start LocalAI with the phi-2 model directly from huggingface
local-ai run huggingface://TheBloke/phi-2-GGUF/phi-2.Q8_0.gguf
# Install and run a model from the Ollama OCI registry
local-ai run ollama://gemma:2b
# Run a model from a configuration file
local-ai run https://gist.githubusercontent.com/.../phi-2.yaml
# Install and run a model from a standard OCI registry (e.g., Docker Hub)
local-ai run oci://localai/phi-2:latest

For a full list of options, refer to the Installer Options documentation.

Binaries can also be manually downloaded.

Using Homebrew on MacOS

⚠️

The Homebrew formula currently doesn’t have the same options than the bash script

You can install Homebrew’s LocalAI with the following command:

  brew install localai

Using Container Images or Kubernetes

LocalAI is available as a container image compatible with various container engines such as Docker, Podman, and Kubernetes. Container images are published on quay.io and Docker Hub.

For detailed instructions, see Using container images. For Kubernetes deployment, see Run with Kubernetes.

Running LocalAI with All-in-One (AIO) Images

Already have a model file? Skip to Run models manually.

LocalAI’s All-in-One (AIO) images are pre-configured with a set of models and backends to fully leverage almost all the features of LocalAI. If pre-configured models are not required, you can use the standard images.

These images are available for both CPU and GPU environments. AIO images are designed for ease of use and require no additional configuration.

It is recommended to use AIO images if you prefer not to configure the models manually or via the web interface. For running specific models, refer to the manual method.

The AIO images come pre-configured with the following features:

Text to Speech (TTS)
Speech to Text
Function calling
Large Language Models (LLM) for text generation
Image generation
Embedding server

For instructions on using AIO images, see Using container images.

What’s Next?

There is much more to explore with LocalAI! You can run any model from Hugging Face, perform video generation, and also voice cloning. For a comprehensive overview, check out the features section.

Explore additional resources and community contributions:

Edit this page

Last updated 21 Apr 2025, 22:11 +0200 . history

Overview

What is LocalAI?

Install and Run Models

Star us on GitHub !

Quickstart

Quickstart link

Using the Bash Installer link

Run with docker: link

Load models: link

Using Homebrew on MacOS link

Using Container Images or Kubernetes link

Running LocalAI with All-in-One (AIO) Images link

What’s Next? link

Quickstart

Using the Bash Installer

Run with docker:

Load models:

Using Homebrew on MacOS

Using Container Images or Kubernetes

Running LocalAI with All-in-One (AIO) Images

What’s Next?