AI Generation with LLM and RAG pattern hub/datacenter cluster size
The AI Generation with LLM and RAG pattern has been tested with a defined set of specifically tested configurations that represent the most common combinations that Red Hat OpenShift Container Platform customers are using or deploying for the x86_64 architecture.
The datacenter hub OpenShift cluster uses the following the deployment configuration:
Cloud Provider | Node Type | Number of nodes | Instance Type |
---|---|---|---|
Amazon Web Services | Control Plane | 1 | m5.2xlarge |
Amazon Web Services | Worker | 3 | m5.2xlarge |