DeepMed Search

Note: This project is still a work in progress. Some features may be incomplete or subject to change.

English | 中文

DeepMed Search is a versatile search application built with the Next.js App Router, featuring a unified interface for Web Search, LLM Introspection, and Knowledge Base (KB) Search.

✨ Features

Unified Search Interface

Single search bar with tabs to seamlessly switch between three search modes
Modern responsive design that adapts to all devices
Smooth interaction experience

Web Search

Support for multiple search engines:
- Tavily: AI-optimized search engine
- Jina: Intelligent web content extraction
- DuckDuckGo: Privacy-focused search
Real-time web information retrieval
Clear display of search results

LLM Introspection

Support for major language models:
- GPT (OpenAI)
- DeepSeek
- Gemini (Google)
Direct answers based on model's internal knowledge
Quick access to structured responses

Knowledge Base Search

Intelligent Retrieval: Precise search based on semantic similarity
Hybrid Search: Combines vector search and BM25 full-text search, balancing semantic understanding and keyword matching
Vector Optimization: Uses Milvus professional vector database for high-performance retrieval
Detailed Results: Displays source document, relevance score, page number, and more
Interactive Experience: Click results to view full text chunks and details

Knowledge Base Management

Create and manage multiple knowledge bases
Upload and process documents (PDF, DOCX, TXT, etc.)
Automatic vector embedding generation
View and delete knowledge base content

🛠 Technology Stack

Frontend

Framework: Next.js 14+ (App Router)
Language: TypeScript
UI Library: React 19
Styling: Tailwind CSS
Component Library: shadcn/ui, Radix UI
Icons: Lucide Icons
Internationalization: react-i18next, i18next
Forms: React Hook Form, Zod
File Upload: react-dropzone

Backend

Runtime: Next.js Server Actions
Database: PostgreSQL (structured data)
ORM: Prisma
Authentication: NextAuth.js
Vector Database: Milvus (vector storage and retrieval)
AI SDK: Vercel AI SDK (@ai-sdk/openai)

External Services

AI Services: Vercel AI SDK with OpenAI provider (embeddings and chat)
Search Services: Tavily, Jina, DuckDuckGo
LLM Providers: OpenAI, DeepSeek, Google Vertex AI
File Storage: MinIO (optional)
Cache: Redis (optional)

Development Tools

Code Standards: ESLint, Prettier
Git Hooks: Husky

📐 System Architecture

graph TD
    subgraph Frontend["Frontend (Next.js)"]
        UI["User Interface<br/>(React Components)"]
        Hooks["Hooks & Context<br/>(State Management)"]
    end

    subgraph Backend["Backend (Next.js Server)"]
        Actions["Server Actions<br/>(Request Handling)"]
        Lib["Core Libraries<br/>(Business Logic)"]
        Auth["Authentication<br/>(NextAuth)"]
    end

    subgraph Storage["Data Storage"]
        PG["PostgreSQL<br/>(Structured Data)"]
        Milvus["Milvus<br/>(Vector Database)"]
        Files["MinIO<br/>(Files + Vector Storage)"]
    end

    subgraph External["External Services"]
        AISDK["Vercel AI SDK<br/>(Embeddings & Chat)"]
        SearchAPI["Search APIs<br/>(Tavily/Jina/DuckDuckGo)"]
        LLMProviders["LLM Providers<br/>(OpenAI/DeepSeek/Vertex AI)"]
    end

    %% Connections
    UI <--> Hooks
    Hooks --> Actions
    Actions --> Lib
    Lib --> PG
    Lib --> Milvus
    Lib --> Files
    Lib --> AISDK
    Lib --> SearchAPI
    Lib --> LLMProviders
    Auth <--> PG

    %% Styles
    classDef frontend fill:#D1E8FF,stroke:#333
    classDef backend fill:#E0E0E0,stroke:#333
    classDef database fill:#FFF2CC,stroke:#333
    classDef external fill:#FFD1DC,stroke:#333

    class Frontend frontend
    class Backend backend
    class Storage database
    class External external

🚀 Quick Start

Prerequisites

Node.js 18+
Docker and Docker Compose
PostgreSQL 14+ (or use Docker)

1. Clone the Project

git clone <repository-url>
cd deepmed-search

2. Start Dependencies

This project uses Docker Compose to manage development environment dependencies, including PostgreSQL, Redis, and MinIO.

Start All Services

# Start all services (PostgreSQL, Redis, MinIO)
docker-compose up -d

# Or start only specific services
docker-compose up -d postgres redis

Check Service Status

# View all service status
docker-compose ps

# View service logs
docker-compose logs -f postgres
docker-compose logs -f redis
docker-compose logs -f minio

Stop and Restart Services

# Stop all services
docker-compose stop

# Restart services
docker-compose restart

# Stop and remove containers (keep data)
docker-compose down

# Complete cleanup (including data volumes, use with caution!)
docker-compose down -v

Service Description

PostgreSQL: Stores structured data (users, documents, knowledge bases, etc.)
Milvus: Professional vector database for high-performance vector retrieval
Redis: Used for caching and queue system
MinIO: S3-compatible object storage for file storage and Milvus vector persistence

2. Install Dependencies

npm install
# or
yarn install

3. Configure Environment Variables

# Copy environment variable template
cp .env.example .env.local

Edit .env.local and configure the following required items:

# Database connection
DATABASE_URL="postgresql://postgres:postgres@localhost:5432/deepmed"

# NextAuth authentication
NEXTAUTH_URL="http://localhost:3000"
NEXTAUTH_SECRET="your-secret-key-here"

# OpenAI API (used by Vercel AI SDK for embeddings and chat, required for KB search)
OPENAI_API_KEY="your-openai-api-key"
OPENAI_BASE_URL="https://api.openai.com/v1"
OPENAI_API_MODEL="gpt-4o-mini"  # Chat model
OPENAI_API_REASON_MODEL="o4-mini"  # Reasoning model (optional)

# Web search APIs
TAVILY_API_KEY="your-tavily-api-key"
JINA_API_KEY="your-jina-api-key"

# Optional: Other LLM APIs
# DEEPSEEK_API_KEY="your-deepseek-api-key"
# GEMINI_API_KEY="your-gemini-api-key"

# Optional: MinIO file storage
# MINIO_ENDPOINT="localhost:9000"
# MINIO_ACCESS_KEY="minioadmin"
# MINIO_SECRET_KEY="minioadmin"

4. Initialize Database

# Run database migrations
npx prisma migrate dev
# or
yarn db:migrate

# Initialize PostgreSQL extensions
yarn db:init

# Verify extension installation
yarn db:test

5. Create Test User

# Create default test user
npm run create:user
# or
yarn create:user

This will create the following test account:

Field	Value
Email	`test@example.com`
Password	`password123`
Name	Test User
Language	Chinese (zh)

Note: First run will automatically create a test tenant and test user. If the user already exists, the creation will be skipped.

6. Start Development Server

npm run dev
# or
yarn dev

Visit http://localhost:3000 to start using the application!

7. Login to the System

Open your browser and visit http://localhost:3000
Click the login button
Use the test account to login:
- Email: test@example.com
- Password: password123

Service Access Points

Service	Address	Credentials
Application	http://localhost:3000	See test account above
PostgreSQL	`localhost:5432`	User: `postgres` Password: `postgres` Database: `deepmed`
Milvus	`localhost:19530`	gRPC endpoint for vector operations
Attu (Milvus UI)	http://localhost:8000	Milvus administration interface
Redis	`localhost:6379`	No password
MinIO API	http://localhost:9000	User: `minioadmin` Password: `minioadmin`
MinIO Console	http://localhost:9001	User: `minioadmin` Password: `minioadmin`
Prisma Studio	http://localhost:5555	Access after running `yarn db:studio`

📖 Development Guide

Database Management

Create Migration

# Generate migration after modifying schema.prisma
npx prisma migrate dev --name <migration-name>

View Database

# Start Prisma Studio
yarn db:studio

Knowledge Base Vector Search

How It Works

Knowledge base search is based on vector embedding technology:

Document Upload: Users upload documents (PDF, DOCX, TXT, etc.)
Text Extraction: System extracts text content from documents
Chunking: Long texts are split into appropriately sized chunks
Generate Embeddings: Vercel AI SDK (with OpenAI provider) generates vector representations for each text chunk
Store Vectors: Vectors are stored in Milvus vector database
Retrieve Matches: During search, query text is also converted to vectors, and most relevant text chunks are found through similarity search

Search Modes

The application supports three search modes:

Vector Search
- Based on semantic similarity
- Understands synonyms and context
- Suitable for conceptual questions
Full-Text Search
- Based on BM25 algorithm
- Precise keyword matching
- Suitable for finding specific terms
Hybrid Search (Recommended)
- Combines vector search and full-text search
- Balances semantic understanding and keyword matching
- Suitable for most use cases

Adjust Search Parameters

You can adjust search parameters in src/lib/milvus/operations.ts:

// Weight configuration
bm25Weight: 0.3,      // Full-text search weight
vectorWeight: 0.7,    // Vector search weight

// Threshold configuration
bm25Threshold: 0.1,   // Minimum score for full-text search
vectorThreshold: 0.3, // Minimum similarity for vector search
minSimilarity: 0.3,   // Final result minimum similarity

// Result count
limit: 10             // Number of results to return

Milvus Vector Database

The project uses Milvus as the professional vector database:

High Performance: Optimized for large-scale vector similarity search
Scalability: Supports billions of vectors with distributed architecture
Multiple Index Types: HNSW, IVF_FLAT, IVF_SQ8, etc.
Flexible Deployment: Standalone or cluster mode

Adding UI Components

This project uses shadcn/ui component library:

# Add new component
npx shadcn@latest add <component-name>

# Example: Add button component
npx shadcn@latest add button

Components will be automatically added to the src/components/ui directory.

Code Standards

# Run code linting
yarn lint

# Auto-fix issues
yarn lint --fix

📝 Available Scripts

# Development
yarn dev              # Start development server
yarn build            # Build for production
yarn start            # Start production server

# Code Quality
yarn lint             # Run code linting
yarn test             # Run tests (if configured)

# Database
yarn db:generate      # Generate Prisma Client
yarn db:migrate       # Run database migrations
yarn db:push          # Push schema to database (dev)
yarn db:studio        # Start Prisma Studio
yarn db:init          # Initialize PostgreSQL extensions
yarn db:test          # Test database extensions

# Utilities
yarn create:user      # Create test user (if exists)

🔧 Troubleshooting

Knowledge Base Search Returns Empty Results

1. Check Database Extensions

# Test if extensions are correctly installed
yarn db:test

If the test fails, reinitialize:

yarn db:init

2. Adjust Search Parameters

In src/lib/milvus/operations.ts:

Lower similarity threshold: Get more results (but may be less relevant)
Adjust search parameters:
- nprobe: Number of units to query (higher = more accurate but slower)
- ef: Size of the dynamic candidate list (for HNSW index)
- topK: Number of results to return

3. Check Milvus Connection

# Check Milvus service status
docker-compose ps milvus-standalone

# View Milvus logs
docker-compose logs -f milvus-standalone

4. Rebuild Milvus Collection

# If vector data is corrupted, you may need to recreate the collection
# This will delete all vectors, so use with caution

5. Check Embedding Model Configuration

Ensure correct configuration in .env.local:

OPENAI_API_KEY="your-key"
OPENAI_BASE_URL="https://api.openai.com/v1"

Test API connection:

curl $OPENAI_BASE_URL/models \
  -H "Authorization: Bearer $OPENAI_API_KEY"

6. Search Optimization Tips

Use short and clear keywords
Try different phrasings
Appropriately lower similarity thresholds
Ensure vectors are properly indexed in Milvus

Docker Service Issues

# View container logs
docker-compose logs postgres

# Restart service
docker-compose restart postgres

# Complete reset (Warning: will delete all data)
docker-compose down -v
docker-compose up -d postgres

Database Connection Issues

Check connection string format:

# Correct format
DATABASE_URL="postgresql://username:password@host:port/database"

# Example
DATABASE_URL="postgresql://postgres:postgres@localhost:5432/deepmed"

💡 Usage Examples

Create Knowledge Base and Upload Documents

Start Services

# Start database
docker-compose up -d postgres

# Start application
yarn dev

Create Knowledge Base

Visit /knowledge page
Click "Create Knowledge Base" button
Fill in knowledge base information:
- Name: e.g., "Medical Literature"
- Description: Explain the purpose
- Other configurations (optional)
Confirm creation

Upload Documents

Enter knowledge base details page
Switch to "Documents" tab
Drag and drop files or click to upload
Supported formats: PDF, DOCX, TXT, Markdown
Wait for document processing (generating vector embeddings)

Search Knowledge Base

Visit home page /search
Select "Knowledge Base" tab
Select knowledge base from dropdown
Enter query
View search results

Using Web Search

# 1. Visit home page
http://localhost:3000

# 2. Select "Web Search" tab

# 3. Choose search engine:
#    - Tavily: Fastest, AI-optimized
#    - Jina: Most complete content extraction
#    - DuckDuckGo: Privacy-focused

# 4. Enter query and search

Using LLM Q&A

# 1. Select "LLM" tab

# 2. Choose model:
#    - GPT: Most versatile
#    - DeepSeek: Good Chinese performance
#    - Gemini: Large context window

# 3. Enter question
# 4. Get structured answer

🤝 Contributing

Contributions are welcome! Please follow these steps:

Fork this repository
Create a feature branch (git checkout -b feature/AmazingFeature)
Commit your changes (git commit -m 'Add some AmazingFeature')
Push to the branch (git push origin feature/AmazingFeature)
Open a Pull Request

📄 License

This project is licensed under the MIT License - see the LICENSE file for details

📧 Contact

For questions or suggestions, feel free to open an Issue or Pull Request.

Built with ❤️ by H!NT Lab

Name		Name	Last commit message	Last commit date
Latest commit History 72 Commits
.husky		.husky
.vscode		.vscode
logs		logs
prisma		prisma
public		public
scripts		scripts
src		src
test-results		test-results
test/data		test/data
.cursorignore		.cursorignore
.env.example		.env.example
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
README.zh-CN.md		README.zh-CN.md
components.json		components.json
docker-compose.yml		docker-compose.yml
eng.traineddata		eng.traineddata
eslint.config.mjs		eslint.config.mjs
next-env.d.ts		next-env.d.ts
next.config.ts		next.config.ts
package.json		package.json
postcss.config.mjs		postcss.config.mjs
tsconfig.json		tsconfig.json
yarn.lock		yarn.lock

License

hint-lab/deepmed-search

Folders and files

Latest commit

History

Repository files navigation