MaaS Code Assistant AI Quickstart pattern hub/datacenter cluster size
The MaaS Code Assistant AI Quickstart pattern has been tested with a defined set of specifically tested configurations that represent the most common combinations that Red Hat OpenShift Container Platform customers are using or deploying for the x86_64 architecture.
The datacenter hub OpenShift cluster uses the following the deployment configuration:
| Cloud Provider | Node Type | Number of nodes | Instance Type |
|---|---|---|---|
Amazon Web Services | Control Plane | 3 | m5.xlarge |
Amazon Web Services | Worker | 3 | m5.2xlarge |
GPU node requirements
In addition to the worker nodes listed above, this pattern requires at least 2 GPU-equipped nodes for model inference. On AWS, the pattern automatically provisions g6e.2xlarge instances with NVIDIA L40S GPUs. On other providers and bare metal, GPU nodes must already be part of the cluster before deploying the pattern.
| Cloud provider | Node type | Number of nodes | Instance type |
|---|---|---|---|
Amazon Web Services | GPU Worker | 2 | g6e.2xlarge |
