05 · Retrieval & Memory
Grounding models in your data. RAG and its building blocks — how to connect a model to external knowledge and give agents persistent memory.
| Step | Topic | One-liner | Status |
|---|---|---|---|
| 1 | RAG | The core pattern: retrieve relevant context, inject into prompt, generate | 🔴 |
| 2 | Embedding models | Models that encode text into dense vectors for semantic comparison | 🔴 |
| 3 | Vector databases | Specialized stores for high-dimensional vectors with ANN search | 🔴 |
| 4 | Chunking strategies | How to split documents so retrieval finds the right piece | 🔴 |
| 5 | Semantic search | Finding documents by meaning, not keywords | 🔴 |
| 6 | Hybrid search | Combining BM25 keyword and dense vector search for better recall | 🔴 |
| 7 | Re-ranking | A second-pass model that reorders retrieval results by relevance | 🔴 |
| 8 | Knowledge graph | Structured entity-relationship graphs as an alternative to vector retrieval | 🔴 |
| 9 | GraphRAG | RAG over a knowledge graph — better for multi-hop and relational queries | 🔴 |
| 10 | Agentic RAG | Agents that decide when and how to retrieve, rather than retrieval being fixed | 🔴 |
| 11 | Long-context retrieval | Using extended context windows as an alternative to chunked retrieval | 🔴 |
| 12 | Working & episodic memory | Short-term (within session) and long-term (across sessions) agent memory | 🔴 |
← Previous section: 04 · Model Optimization & Formats | Next section → 06 · Agents & Orchestration