Before you run a retrieval-augmented generation experiment in AutoAI, you must choose a vector database to store and retrieve data.
For AutoAI RAG experiments, you can choose from two vector store options:
In-memory Chroma database
Milvus database
Chroma vector database
Copy link to section
An in-memory Chroma vector store stores data on your computer's RAM. The index of vectorized documents is fast for querying and retrieving data. Due to memory constraints, the in-memory database is built for small to medium-sized datasets.
If you don't specify a connection to a vector store, the vectorized content is saved to the default, in-memory Chroma database. The content does not persist beyond the experiment, so the Chroma option is not a viable production method for deploying
a RAG pattern.
Milvus vector database
Copy link to section
A Milvus vector store database has a scalable and distributed architecture. It offers high performance in retrieving data when it has optimized indexing and supports many indexing algorithms and distance metrics. It's built for large datasets
and is more durable than a Chroma database.
The vectorized content persists for future patterns. For details, see Working with Milvus.
If you are using Milvus in a coded solution, familiarize yourself with the rules for naming conventions and database schema:
Collection names use this format: autoai_rag_a0b1c2d3_ymdHMS> where y-year, m-month, d-day, H-hours, M-minute, S-second.