This deployment is based on validated pattern framework that uses GitOps to easily provision all operators and apps. It deploys a Chatbot application that leverages the power of Large Language Models (LLMs) in conjunction with the Retrieval-Augmented Generation (RAG) framework running on Red Hat OpenShift to generate a project proposal for a given Red Hat product.


Solution elements

  • LLM Application augmented with content from Red Hat product documentation.
  • Multiple LLM providers (OpenAI, Hugging Face, NVIDIA)
  • Redis Vector Database to store embeddings of RedHat product documentation.
  • Monitoring dashboard to provide key metrics such as ratings
  • GitOps setup to deploy e2e demo (frontend / vector database / served models)

Red Hat Technologies

  • Red Hat OpenShift Container Platform (Kubernetes)
  • Red Hat OpenShift GitOps (ArgoCD)
  • Red Hat OpenShift Pipelines (Tekton)
  • Red Hat OpenShift AI


The following diagram shows the relationship between the AI components:

