-
Notifications
You must be signed in to change notification settings - Fork 6
enhance benchmark with dataset discovery, validation, performance monitoring, and improved Docker support #32
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
- Enhanced Dockerfile with multi-stage build and security best practices - Added Docker build, run, and test scripts with Redis-specific configurations - Created GitHub Actions workflows for PR validation, master publishing, and release publishing - Added docker-compose.yml for local development with Redis - Updated documentation with Docker usage examples - Configured for redis-performance/vector-db-benchmark Docker Hub repository - Default configuration: engines=redis, dataset=random-100, experiment=redis-m-16-ef-64 - Multi-platform support (linux/amd64, linux/arm64) - Security scanning with Trivy for releases
- Updated PR validation to trigger on update-redisearch branch - Updated publishing workflow to use update-redisearch branch instead of master - Updated Docker tags to use update-redisearch-{sha} format - Updated documentation to reflect correct default branch
🐳 Docker Build Validation✅ Docker build successful! Platforms tested:
Git SHA: Docker Hub Status: ✅ Docker Hub credentials configured Image details:
Tests performed:
The Docker image is ready for deployment! 🚀 |
…ion, and performance monitoring - Add --describe command for datasets and engines with columnar display - Implement real-time performance summaries (QPS, P50/P95 latency) - Add comprehensive dataset validation system with GitHub Actions - Complete dataset metadata with vector_count and description fields - Improve download reliability with proper HTTP headers - Standardize precision formatting (0.01 increments up to 0.97, then 0.0025) - Enhanced Docker configurations for better Redis testing defaults - Add validation documentation and automated CI/CD checks This maintains backward compatibility while significantly improving usability, data quality, and performance insights for vector database benchmarking.
🐳 Docker Build Validation✅ Docker build successful! Platforms tested:
Git SHA: Docker Hub Status: ✅ Docker Hub credentials configured Image details:
Tests performed:
The Docker image is ready for deployment! 🚀 |
🐳 Docker Build Validation✅ Docker build successful! Platforms tested:
Git SHA: Docker Hub Status: ✅ Docker Hub credentials configured Image details:
Tests performed:
The Docker image is ready for deployment! 🚀 |
- Add Poetry installation to validate-datasets workflow - Use --no-root to install dependencies without packaging the project - Run validation script with 'poetry run' to access all dependencies - Fixes ModuleNotFoundError for stopit and other dependencies when testing --describe functionality
🐳 Docker Build Validation✅ Docker build successful! Platforms tested:
Git SHA: Docker Hub Status: ✅ Docker Hub credentials configured Image details:
Tests performed:
The Docker image is ready for deployment! 🚀 |
🐳 Docker Build Validation✅ Docker build successful! Platforms tested:
Git SHA: Docker Hub Status: ✅ Docker Hub credentials configured Image details:
Tests performed:
The Docker image is ready for deployment! 🚀 |
🐳 Docker Build Validation✅ Docker build successful! Platforms tested:
Git SHA: Docker Hub Status: ✅ Docker Hub credentials configured Image details:
Tests performed:
The Docker image is ready for deployment! 🚀 |
🐳 Docker Build Validation✅ Docker build successful! Platforms tested:
Git SHA: Docker Hub Status: ✅ Docker Hub credentials configured Image details:
Tests performed:
The Docker image is ready for deployment! 🚀 |
🐳 Docker Build Validation✅ Docker build successful! Platforms tested:
Git SHA: Docker Hub Status: ✅ Docker Hub credentials configured Image details:
Tests performed:
The Docker image is ready for deployment! 🚀 |
🐳 Docker Build Validation✅ Docker build successful! Platforms tested:
Git SHA: Docker Hub Status: ✅ Docker Hub credentials configured Image details:
Tests performed:
The Docker image is ready for deployment! 🚀 |
🐳 Docker Build Validation✅ Docker build successful! Platforms tested:
Git SHA: Docker Hub Status: ✅ Docker Hub credentials configured Image details:
Tests performed:
The Docker image is ready for deployment! 🚀 |
🐳 Docker Build Validation✅ Docker build successful! Platforms tested:
Git SHA: Docker Hub Status: ✅ Docker Hub credentials configured Image details:
Tests performed:
The Docker image is ready for deployment! 🚀 |
🐳 Docker Build Validation✅ Docker build successful! Platforms tested:
Git SHA: Docker Hub Status: ✅ Docker Hub credentials configured Image details:
Tests performed:
The Docker image is ready for deployment! 🚀 |
🐳 Docker Build Validation✅ Docker build successful! Platforms tested:
Git SHA: Docker Hub Status: ✅ Docker Hub credentials configured Image details:
Tests performed:
The Docker image is ready for deployment! 🚀 |
Key Features
Dataset & Engine Discovery
--describe datasets
and--describe engines
commands for easy explorationReal-time Performance Monitoring
Enhanced Docker Experience
.dockerignore
for optimized build contextsdocker-compose.yml
in favor of simplified Docker workflowsComprehensive Data Validation
--describe
functionality to ensure reliabilityEnhanced Dataset Metadata
vector_count
anddescription
fields for all 42 datasetsImproved Reliability
Technical Details
run.py
, engine clients, dataset handling), Docker infrastructure, CI/CD workflows