What is RAG (Retrieval-Augmented Generation)?
spaceto flip
A pattern where you retrieve relevant documents from your own data, include them in the LLM's prompt as context, and the LLM generates an answer grounded in that context. No model training needed. Knowledge is updatable (just re-index). Answers are traceable to source documents. The most practical way to make LLMs answer questions about private data.