LLM-Sentinel

Privacy-first proxy that automatically detects and masks sensitive data before it reaches AI models without compromising latency.

Installation

NPM (Recommended)

npm install -g llm-sentinel

llmsentinel help

Docker

docker pull raaihank/llm-sentinel:latest
docker run -p 5050:5050 raaihank/llm-sentinel:latest

Quick Start

Once running, replace your AI API base URLs:

OpenAI: http://localhost:5050/openai/v1 (instead of https://api.openai.com/v1)
Ollama: http://localhost:5050/ollama (instead of http://localhost:11434)

Usage Examples

OpenAI SDK (Python)

import openai

client = openai.OpenAI(
    api_key="sk-your-key-here",  
    base_url="http://localhost:5050/openai/v1"  # ← Add this line
)

# Your sensitive data is automatically protected
response = client.chat.completions.create(
    model="gpt-3.5-turbo",
    messages=[{
        "role": "user", 
        "content": "My API key is sk-abc123 and config at /Users/john/secrets"
    }]
)
# LLM receives: "My API key is [OPENAI_API_KEY_MASKED] and config at /Users/[USERNAME]/secrets"

OpenAI SDK (JavaScript)

import OpenAI from 'openai';

const openai = new OpenAI({
  apiKey: 'sk-your-key-here',
  baseURL: 'http://localhost:5050/openai/v1'  // ← Add this line
});

const response = await openai.chat.completions.create({
  model: 'gpt-3.5-turbo',
  messages: [{
    role: 'user',
    content: 'My AWS key is AKIAIOSFODNN7EXAMPLE and email user@company.com'
  }]
});
// LLM receives: "My AWS key is [AWS_ACCESS_KEY_MASKED] and email [EMAIL_MASKED]"

Streaming Support: All streaming requests are automatically supported - just add stream: true to your requests and LLM-Sentinel will mask sensitive data in real-time.

Ollama SDK

import ollama

client = ollama.Client(host='http://localhost:5050/ollama')  # ← Change this line

response = client.chat(
    model='llama2',
    messages=[{
        'role': 'user', 
        'content': 'My credit card is 4532-1234-5678-9012'
    }]
)
# LLM receives: "My credit card is [CREDIT_CARD_MASKED]"

Streaming Support: Ollama streaming requests work seamlessly - use stream=True in your client calls and all sensitive data will be masked in real-time.

cURL Examples

OpenAI:

curl -X POST http://localhost:5050/openai/v1/chat/completions \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer sk-your-key" \
  -d '{
    "model": "gpt-3.5-turbo",
    "messages": [{"role": "user", "content": "My SSH key is ssh-rsa AAAAB3..."}]
  }'

Ollama:

curl -X POST http://localhost:5050/ollama/api/generate \
  -H "Content-Type: application/json" \
  -d '{
    "model": "llama2",
    "prompt": "My database URL is postgresql://user:pass@localhost/db"
  }'

What Gets Protected (52 Detectors)

🤖 AI/ML Services: OpenAI, Claude, Google AI, Azure OpenAI, Cohere, HuggingFace
☁️ Cloud: AWS keys, Azure subscriptions, GCP credentials, Heroku, Cloudflare
🔧 Development: GitHub tokens, NPM tokens, PyPI, Docker Hub
💬 Services: Slack, Discord, Twilio, SendGrid, Mailgun, Stripe
🗄️ Databases: PostgreSQL, MongoDB, MySQL, Redis, Elasticsearch URLs
🔐 Security: SSH keys, JWT tokens, Kubernetes tokens, PGP keys
👤 Personal: Emails, phone numbers, credit cards, SSNs, IP addresses

Commands

Server Management

llmsentinel start [-p 5050] [-d]   # Start server (daemon with -d)
llmsentinel status                 # Check if running
llmsentinel stop                   # Stop daemon
llmsentinel restart                # Restart server
llmsentinel logs [-n 50]           # View logs

Configuration

llmsentinel info                   # Show protection status
llmsentinel port 8080              # Change server port
llmsentinel rules                  # List all detectors
llmsentinel protect                # Enable all protection (default)
llmsentinel no-protect             # ⚠️ Disable all protection

Fine-tuned Control

llmsentinel rules:disable email    # Disable email detection
llmsentinel rules:enable openaiApiKey # Enable OpenAI key detection
llmsentinel debug                  # Show detected entity types in logs
llmsentinel no-debug               # Hide sensitive details (default)
llmsentinel notifications          # Toggle desktop alerts

Docker Usage

Basic Run

docker run -p 5050:5050 raaihank/llm-sentinel:latest

With Custom Configuration (Optional)

# Copy sample config (optional)
cp config.sample.json config.json
nano config.json

# Run with custom config
docker run -d \
  --name llm-sentinel \
  -p 5050:5050 \
  -v $(pwd)/config.json:/app/.llm-sentinel/config.json \
  -v $(pwd)/logs:/app/logs \
  raaihank/llm-sentinel:latest

Docker Commands

docker logs llm-sentinel                    # View logs
docker exec -it llm-sentinel llmsentinel   # Run commands
docker stop llm-sentinel                   # Stop
docker restart llm-sentinel                # Restart

Real-Time Dashboard 🔥

Access the beautiful monitoring dashboard at:

http://localhost:5050

✨ Features:

Real-time WebSocket monitoring - Live updates of intercepted requests
Detailed event inspection - Click on any event to see:
- Complete request/response data
- Headers (with API keys redacted)
- Original vs masked content comparison
- Processing logs and timing
- Provider identification (OpenAI, Ollama, Claude, etc.)
OLED dark mode - Battery-friendly pure black interface
Interactive controls - View all 52 detectors, toggle settings
Complete configuration - See all settings with CLI examples
Horizontal scrolling - All JSON/logs properly readable

🎯 What You'll See:

Live detection events as they happen
Color-coded provider tags (OpenAI, Ollama, etc.)
Processing time metrics
Detection counts and statistics
Masked sensitive data previews

📱 Docker Dashboard Access:

# Run with dashboard
docker run -p 5050:5050 raaihank/llm-sentinel:latest

# Then open browser to:
http://localhost:5050

The dashboard automatically updates in real-time as your applications make AI API calls through LLM-Sentinel!

Health Check

curl http://localhost:5050/health
# {"status":"healthy","uptime":3600}

How It Works

graph LR
    A[Your App] -->|HTTP Request| B[LLM-Sentinel]
    B -->|Clean Request| C[AI Model]
    C -->|Response| B
    B -->|Response| A
    
    B --> D[52 Detectors]
    D --> E[API Keys]
    D --> F[Credentials] 
    D --> G[Personal Data]
    D --> H[Private Keys]
    
    style B fill:#e1f5fe
    style D fill:#f3e5f5
    style E fill:#fff3e0
    style F fill:#fff3e0
    style G fill:#fff3e0
    style H fill:#fff3e0

Intercepts requests to AI APIs
Scans content with 52 specialized detectors
Masks sensitive data with safe placeholders
Forwards clean requests to AI models
Logs detections (secure by default)

Configuration

LLM-Sentinel works out-of-the-box with secure defaults. Configuration is optional.

Config file location: ~/.llm-sentinel/config.json

Sample configuration:

{
  "server": { 
    "port": 5050,
    "openaiTarget": "https://api.openai.com",
    "ollamaTarget": "http://localhost:11434"
  },
  "detection": { 
    "enabled": true,
    "enabledRules": ["email", "openaiApiKey", "awsAccessKey"],
    "customRules": []
  },
  "logging": {
    "showDetectedEntity": false,
    "logLevel": "INFO",
    "logToConsole": true,
    "logToFile": true
  },
  "notifications": { 
    "enabled": true,
    "sound": false
  },
  "security": {
    "redactApiKeys": true,
    "redactCustomHeaders": ["x-api-key"]
  }
}

Security Features

✅ 52 specialized detectors for comprehensive coverage
✅ Streaming support - works with real-time streaming requests
✅ Privacy-first logging - sensitive data never stored
✅ Zero data retention - proxy only, no storage
✅ Configurable detection - enable/disable specific types
✅ Desktop notifications - real-time detection alerts
✅ API key redaction - request headers cleaned in logs

Links

📖 Development Guide - Setup, architecture, adding detectors
🤝 Contributing - How to contribute, code style, testing
🚀 Docker Hub - Pre-built images
🐛 Issues - Bug reports and feature requests

License

Custom License - Free for personal and non-commercial use. Commercial use requires explicit consent.

See LICENSE for full terms.

🛡️ Protect your sensitive data. Enable all 52 detectors by default.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
dashboard		dashboard
docs		docs
logs		logs
src		src
.dockerignore		.dockerignore
.gitignore		.gitignore
.npmignore		.npmignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
config.sample.json		config.sample.json
package-lock.json		package-lock.json
package.json		package.json
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

LLM-Sentinel

Installation

NPM (Recommended)

Docker

Quick Start

Usage Examples

OpenAI SDK (Python)

OpenAI SDK (JavaScript)

Ollama SDK

cURL Examples

What Gets Protected (52 Detectors)

Commands

Server Management

Configuration

Fine-tuned Control

Docker Usage

Basic Run

With Custom Configuration (Optional)

Docker Commands

Real-Time Dashboard 🔥

✨ Features:

🎯 What You'll See:

📱 Docker Dashboard Access:

Health Check

How It Works

Configuration

Security Features

Links

License

About

Uh oh!

Releases

Packages

Languages

License

raaihank/llm-sentinel

Folders and files

Latest commit

History

Repository files navigation

LLM-Sentinel

Installation

NPM (Recommended)

Docker

Quick Start

Usage Examples

OpenAI SDK (Python)

OpenAI SDK (JavaScript)

Ollama SDK

cURL Examples

What Gets Protected (52 Detectors)

Commands

Server Management

Configuration

Fine-tuned Control

Docker Usage

Basic Run

With Custom Configuration (Optional)

Docker Commands

Real-Time Dashboard 🔥

✨ Features:

🎯 What You'll See:

📱 Docker Dashboard Access:

Health Check

How It Works

Configuration

Security Features

Links

License

About

Topics

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages