Retrieval Augmented Generation (RAG) is an AI framework for retrieving facts from an external knowledge base to ground large language models (LLMs) on the most accurate and up-to-date information. In this accelerator we convert documents from HTML or PDF to plain text, import document segments into an Elasticsearch vector index, deploy a python function that queries the vector index, retrieves top N results, runs LLM inference to generate an answer to the question and checks the answer for hallucinations.
