Local Agent: Fast AI Backend with Docker Model Runner

A flexible, extensible AI agent backend built with NestJS—designed for running local, open-source LLMs (Llama, Gemma, Qwen, DeepSeek, etc.) via Docker Model Runner. Real-time streaming, Redis messaging, web search, and Postgres memory out of the box. No cloud APIs required!

🚀 Quick Start

Clone the repository

git clone <your-repo-url>
cd <your-repo-folder>

Copy and edit environment variables

cp .env.example .env
# Edit .env and fill in your model and service config

Start required services (Redis, PostgreSQL, Local LLM) with Docker Compose
```
docker compose up -d
```
- PostgreSQL: localhost:5433
- Redis: localhost:6379
- Local LLM runner: localhost:12434 (Model Runner guide)
Install dependencies
```
pnpm install
```
Start the development server
```
pnpm run start:dev
```

🛠️ Environment Variables

See .env.example for all options. Key variables:

MODEL_BASE_URL — e.g. http://localhost:12434/engines/llama.cpp/v1
MODEL_NAME — e.g. ai/gemma3:latest, llama-3, qwen, deepseek
TAVILY_API_KEY — for web search (Get your key)
REDIS_HOST, REDIS_PORT, etc.
POSTGRES_* — for memory

✨ Features

🤖 Local, open-source LLMs (Llama, Gemma, Qwen, DeepSeek, etc.)
🌊 Real-time streaming responses
💾 Conversation history with Postgres memory
🌐 Web search integration (Tavily)
🧵 Custom ThreadService for conversations
📡 Redis pub/sub for real-time messaging
🎯 Clean, maintainable architecture

🧩 Model Setup (Docker Model Runner)

This project is designed for local LLMs only, using Docker Model Runner.
Supported models: Llama, Gemma, Qwen, DeepSeek, and other open-source models.
Set MODEL_BASE_URL and MODEL_NAME in your .env.
Start the ai_runner service with Docker Compose.
For other providers, see Agent Initializr.

🔌 Web Search (Tavily)

Set TAVILY_API_KEY in .env

Example usage in code:

AgentFactory.createAgent(
  ModelProvider.LOCAL,
  [new TavilySearch({ maxResults: 5, topic: 'general' })],
  postgresCheckpointer,
);

🗄️ Project Structure

src/
├── agent/       # AI agent implementation
├── api/         # HTTP endpoints and DTOs
└── messaging/   # Redis messaging service

🛣️ API Endpoints

POST /api/agent/chat — Send a message to the agent
GET /api/agent/stream — Stream agent responses (SSE)
GET /api/agent/history/:threadId — Get conversation history
GET /api/agent/threads — List all threads

💬 Chat UI

For a ready-to-use frontend, use agentailor-chat-ui, which is fully compatible with this backend.

📝 Required: Postgres Saver Checkpointer

This project uses Postgres for memory. You must initialize the checkpointer before chatting:

// In agentService
async stream(message: SseMessageDto): Promise<Observable<SseMessage>> {
  const channel = `agent-stream:${message.threadId}`;
  // Run only once
  this.agent.initCheckpointer();
  // ...rest of code
}

ℹ️ Note

This project is opinionated for local, open-source LLMs only.

📚 Further Information

For more details and project resources, visit Initializr.

📄 License

MIT License

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.github/workflows/.github/workflows		.github/workflows/.github/workflows
src		src
test		test
.env.example		.env.example
.gitignore		.gitignore
.prettierrc		.prettierrc
README.md		README.md
docker-compose.yml		docker-compose.yml
eslint.config.mjs		eslint.config.mjs
jest.config.js		jest.config.js
nest-cli.json		nest-cli.json
package.json		package.json
pnpm-lock.yaml		pnpm-lock.yaml
tsconfig.build.json		tsconfig.build.json
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Local Agent: Fast AI Backend with Docker Model Runner

🚀 Quick Start

🛠️ Environment Variables

✨ Features

🧩 Model Setup (Docker Model Runner)

🔌 Web Search (Tavily)

🗄️ Project Structure

🛣️ API Endpoints

💬 Chat UI

📝 Required: Postgres Saver Checkpointer

ℹ️ Note

📚 Further Information

📄 License

About

Uh oh!

Releases

Packages

Languages

IBJunior/local-agent-docker-model-runner

Folders and files

Latest commit

History

Repository files navigation

Local Agent: Fast AI Backend with Docker Model Runner

🚀 Quick Start

🛠️ Environment Variables

✨ Features

🧩 Model Setup (Docker Model Runner)

🔌 Web Search (Tavily)

🗄️ Project Structure

🛣️ API Endpoints

💬 Chat UI

📝 Required: Postgres Saver Checkpointer

ℹ️ Note

📚 Further Information

📄 License

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages