Skip to content

Conversation

@dittops
Copy link
Contributor

@dittops dittops commented Nov 7, 2025

Summary

Add comprehensive hardware requirements documentation targeting Cloud Service Providers (CSPs) and infrastructure architects planning bud-stack deployments.

This documentation provides aggregate infrastructure requirements without service-specific microservice details, focusing on what CSPs need for capacity planning.

Key Features

Infrastructure Requirements Summary

  • Minimum (Dev/Test): 26-30 cores, 46-50 GiB RAM, 350-400 GiB storage
  • Recommended (Staging): 60-80 cores, 80-120 GiB RAM, 500-1,000 GiB storage
  • Production (Large): 120-200 cores, 250-500 GiB RAM, 2-5 TiB storage

Cloud-Specific Configurations

  • Azure AKS with VM sizes and cost estimates
  • AWS EKS with instance types and cost estimates
  • On-premises OpenShift requirements

Production Architecture

  • Specialized node pools (Control Plane, Application, Data Plane, Gateway)
  • Resource breakdown by workload type
  • Node specifications and scaling considerations

Storage & Network Requirements

  • Storage performance specifications (IOPS, throughput, latency)
  • Network bandwidth requirements (1-40 Gbps)
  • Port requirements for external and internal communication

Operational Guidance

  • High availability configuration (RPO/RTO)
  • Multi-region disaster recovery
  • Security requirements
  • Cost optimization strategies
  • Quick reference sizing cheat sheet

Changes

  • Added docs/HARDWARE_REQUIREMENTS.md with CSP-focused infrastructure requirements
  • Includes three deployment profiles with detailed resource specifications
  • Provides cost estimates and optimization strategies
  • Contains quick reference tables for at-a-glance planning

Target Audience

  • Cloud Service Providers (CSPs)
  • Infrastructure Architects
  • DevOps Teams
  • Capacity Planners

Test Plan

  • Document reviewed for accuracy against Helm charts and infrastructure code
  • Resource requirements validated from actual deployment specifications
  • Cost estimates aligned with current cloud provider pricing
  • Pre-commit hooks passed

🤖 Generated with Claude Code

Add detailed hardware requirements documentation targeting Cloud Service
Providers and infrastructure architects planning bud-stack deployments.

Key features:
- Aggregate infrastructure requirements (CPU, memory, storage, network)
- Three deployment profiles: Dev/Test, Staging, Production
- Cloud-specific configurations for Azure AKS, AWS EKS, and on-premises
- Node pool breakdown for production with specialized workload separation
- Storage performance requirements with IOPS and latency specifications
- Network bandwidth and latency requirements
- High availability and disaster recovery guidance
- Cost estimates and optimization strategies
- Quick reference sizing cheat sheet

Focus on CSP-level infrastructure planning without service-specific details.

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude <noreply@anthropic.com>
@gemini-code-assist
Copy link
Contributor

Summary of Changes

Hello @dittops, I'm Gemini Code Assist1! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request introduces new, comprehensive documentation detailing the hardware requirements for deploying the Bud-Stack platform. The document is tailored for Cloud Service Providers and infrastructure architects, offering clear guidelines for capacity planning across various deployment scales, from development to large-scale production, and includes cloud-specific considerations and operational best practices.

Highlights

  • New Documentation: A comprehensive HARDWARE_REQUIREMENTS.md document has been added, specifically targeting Cloud Service Providers (CSPs) and infrastructure architects for planning Bud-Stack deployments.
  • Detailed Resource Specifications: The document outlines hardware requirements across three deployment profiles: Minimum (Dev/Test), Recommended (Staging), and Production (Large), covering CPU, RAM, and storage.
  • Cloud-Specific Guidance: It includes configurations and cost estimates for Azure AKS and AWS EKS, as well as requirements for on-premises OpenShift.
  • Production Architecture Breakdown: Detailed information on specialized node pools (Control Plane, Application, Data Plane, Gateway) for production environments is provided, along with resource allocation per workload type.
  • Storage and Network Requirements: The documentation specifies storage performance (IOPS, throughput, latency), network bandwidth (1-40 Gbps), and port requirements.
  • Operational Considerations: Guidance on high availability, multi-region disaster recovery, security, and cost optimization strategies is included, along with a quick reference sizing cheat sheet.
Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point by creating a comment using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands on the current page.

Feature Command Description
Code Review /gemini review Performs a code review for the current pull request in its current state.
Pull Request Summary /gemini summary Provides a summary of the current pull request in its current state.
Comment @gemini-code-assist Responds in comments when explicitly tagged, both in pull request comments and review comments.
Help /gemini help Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for Github and other Google products, sign up here.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Footnotes

  1. Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution.

Copy link
Contributor

@gemini-code-assist gemini-code-assist bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request introduces a comprehensive hardware requirements document, which is a valuable addition for users planning deployments. The document is well-structured and detailed. However, my review identified several critical inconsistencies and calculation errors in the resource specifications. The production requirements summary does not align with the detailed breakdown, and some of the 'typical configuration' examples are misleading as they don't match the stated requirements. There are also minor inconsistencies in software version numbers. I've provided specific comments and suggestions to address these issues to ensure the document is accurate and clear for capacity planning.

Comment on lines 53 to 60
| Resource | Requirement |
|----------|-------------|
| **CPU Cores** | 120-200 cores |
| **Memory (RAM)** | 250-500 GiB |
| **Storage (SSD)** | 2-5 TiB |
| **Network Bandwidth** | 10-40 Gbps |
| **Operating System** | Linux (Ubuntu 22.04+, RHEL 8+, or OpenShift 4.12+) |
| **Kubernetes** | Version 1.29+ |
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

critical

The resource requirements in this summary table are inconsistent with the totals derived from the 'Detailed Production Architecture' section below. For example, you specify 120-200 CPU cores here, but the detailed breakdown sums to 168-304 vCPU. Similar discrepancies exist for RAM and Storage. To avoid confusion for capacity planning, this summary table should be updated to accurately reflect the totals from the detailed breakdown (after correcting the calculation errors in that section).

| **Gateway** | API gateway, ingress | 8 vCPU, 16GB RAM, 100GB SSD | 2-3 | 16-24 vCPU, 32-48GB RAM |


**Total Production Resources**: 168-304 vCPU, 480-848GB RAM, 3-6TB storage
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

high

The total storage calculation for production resources appears incorrect. Based on the 'Node Pool Breakdown' table, the storage range should be 5.7-9.8 TB, not 3-6 TB.

Calculation:

  • Min: (3 * 500GB) + (5 * 200GB) + (3 * 1TB) + (2 * 100GB) = 5.7 TB
  • Max: (5 * 500GB) + (10 * 200GB) + (5 * 1TB) + (3 * 100GB) = 9.8 TB

Please update this line to reflect the correct total.

Suggested change
**Total Production Resources**: 168-304 vCPU, 480-848GB RAM, 3-6TB storage
**Total Production Resources**: 168-304 vCPU, 480-848GB RAM, 5.7-9.8TB storage

| **Operating System** | Linux (Ubuntu 22.04+, RHEL 8+, or OpenShift 4.12+) |
| **Kubernetes** | Version 1.29+ |

**Typical Configuration**: 3 nodes × (8 vCPU, 16GB RAM, 100GB SSD)
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

high

The 'Typical Configuration' does not meet the minimum requirements listed in the table above.

  • CPU: 3 nodes × 8 vCPU = 24 vCPU, which is less than the required 32 cores.
  • RAM: 3 nodes × 16GB RAM = 48GB RAM, which is less than the required 64 GiB.

This is misleading for users setting up a development environment. Please adjust the typical configuration to meet or exceed the minimums. For example, you could use 4 nodes.

Suggested change
**Typical Configuration**: 3 nodes × (8 vCPU, 16GB RAM, 100GB SSD)
**Typical Configuration**: 4 nodes × (8 vCPU, 16GB RAM, 100GB SSD)

| **Memory (RAM)** | 80-120 GiB |
| **Storage (SSD)** | 500-1,000 GiB |
| **Network Bandwidth** | 5-10 Gbps |
| **Operating System** | Linux (Ubuntu 20.04+, RHEL 8+, or OpenShift 4.12+) |
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

medium

The Ubuntu version for 'Recommended Requirements' is listed as 20.04+, which is older than the 22.04+ requirement for 'Minimum' and 'Production' environments. For consistency, please update this to 22.04+.

Suggested change
| **Operating System** | Linux (Ubuntu 20.04+, RHEL 8+, or OpenShift 4.12+) |
| **Operating System** | Linux (Ubuntu 22.04+, RHEL 8+, or OpenShift 4.12+) |

| **Storage (SSD)** | 500-1,000 GiB |
| **Network Bandwidth** | 5-10 Gbps |
| **Operating System** | Linux (Ubuntu 20.04+, RHEL 8+, or OpenShift 4.12+) |
| **Kubernetes** | Version 1.25+ |
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

medium

The Kubernetes version for 'Recommended Requirements' is listed as 1.25+, which is inconsistent with the 1.29+ requirement for 'Minimum' and 'Production' environments. To ensure consistency across the document, please update this to 1.29+.

Suggested change
| **Kubernetes** | Version 1.25+ |
| **Kubernetes** | Version 1.29+ |

| **Backups** | - | 500 GiB-1 TiB | Standard/Archive |

**Total Storage**:
- **Minimum**: 256 GiB
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

medium

The 'Minimum' total storage is listed as 256 GiB, but summing the 'Size (Min)' column in the 'Persistent Storage Breakdown' table gives 160 GiB (10+30+50+20+50). This discrepancy is confusing. Please either correct the total or clarify what other storage components are included in this figure.

| **Operating System** | Linux (Ubuntu 20.04+, RHEL 8+, or OpenShift 4.12+) |
| **Kubernetes** | Version 1.25+ |

**Typical Configuration**: 5-7 nodes × (16 vCPU, 32GB RAM, 200GB SSD)
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

medium

The upper bound of the 'Typical Configuration' (7 nodes) significantly exceeds the ranges specified in the 'Recommended Requirements' table. For instance, a 7-node cluster provides 112 vCPU (vs. 60-80 recommended) and 1.4 TB storage (vs. 500-1000 GiB recommended). This is confusing. Please revise the typical configuration to align better with the recommended ranges.

Copy link

@chatgpt-codex-connector chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

💡 Codex Review

Here are some automated review suggestions for this pull request.

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

  • Open a pull request for review
  • Mark a draft as ready
  • Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

| **Gateway** | API gateway, ingress | 8 vCPU, 16GB RAM, 100GB SSD | 2-3 | 16-24 vCPU, 32-48GB RAM |


**Total Production Resources**: 168-304 vCPU, 480-848GB RAM, 3-6TB storage

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

P1 Badge Fix production storage totals to match node pool sums

The totals listed after the node pool table claim 3-6TB of storage, but summing the node specs just above gives roughly 5.7-9.8TB (e.g., 3–5 × 500GB control plane + 5–10 × 200GB application + 3–5 × 1TB data + 2–3 × 100GB gateway). The incorrect figure would understate disk requirements by more than 40%, which can lead infrastructure planners to severely under‑provision storage for production deployments.

Useful? React with 👍 / 👎.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants