Skip to main content
Welcome to the RAG Fundamentals course. I’m Jeremy Morgan. In this course you’ll learn Retrieval-Augmented Generation (RAG) — a practical AI design pattern that blends retrieval from your knowledge sources with large language model (LLM) generation to produce accurate, grounded responses. Why RAG matters
  • When teams need fast, accurate answers from fragmented internal knowledge — runbooks, incident reports, Confluence pages, PDFs, email threads, and other silos — RAG retrieves the exact passages that matter and conditions an LLM on those snippets.
  • Unlike traditional search that returns documents or links, RAG returns relevant context (snippets) and uses that context to generate concise, actionable answers. This makes RAG ideal for incident diagnosis, onboarding, routine troubleshooting automation, and compliance or policy queries where source fidelity is critical.
What this course delivers
  • Conceptual foundations of RAG and when it outperforms pure retrieval or pure generation.
  • Hands-on architecture patterns and trade-offs for end-to-end RAG systems.
  • A practical developer setup and project structure for production-ready RAG workflows.
  • Document ingestion best practices for DOCX, PDF, CSV, and other common formats.
  • Chunking strategies (fixed-size, overlap-aware, semantic chunking) to preserve context and improve retrieval.
  • Retrieval techniques: keyword methods (TF-IDF, BM25), re-ranking, and semantic search with embeddings.
  • An introduction to vector databases, evaluation criteria, and a practical implementation example using ChromaDB.
  • A guided build of an end-to-end RAG pipeline with deployment and integration examples.
The image describes "The RAG Pipeline," featuring a sequence of three components: Knowledge Base, Retrieval System, and Generation System, with an explanation that LLM uses query and context to generate responses. There is also a small circular photo of an individual in the lower right corner.
Who should take this course: engineers, machine learning practitioners, and technical product managers who want to build reliable, explainable assistants over internal knowledge. Basic familiarity with LLM concepts, embeddings, and REST APIs is helpful but not required.
Course roadmap (at-a-glance)
ModuleFocusOutcomes
Introduction to RAGWhat RAG is and why it mattersUnderstand RAG use cases and trade-offs
Architecture patternsSystem-level designs and integration pointsChoose the right RAG architecture for your needs
Development setupProject layout and toolingCreate a reproducible dev environment for RAG
Document ingestionParsing and normalization for DOCX, PDF, CSV, etc.Build robust ingestion pipelines
Chunking & contextFixed vs. semantic chunking, overlap strategiesOptimize chunks for retrieval quality
Retrieval methodsTF-IDF, BM25, semantic search, re-rankingImplement and compare retrieval approaches
Embeddings & similarityTransformer-based sentence embeddings, metricsSelect embedding strategies and similarity metrics
Vector databasesWhy vector stores, evaluation criteriaImplement storage/retrieval using a vector DB (e.g., ChromaDB)
End-to-end buildIntegration and deployment examplesDeliver a production-ready RAG pipeline
Key terms and concepts (quick reference)
  • Retrieval-Augmented Generation (RAG): Combining retrieved context with an LLM to generate grounded, source-backed answers.
  • Embeddings: Vector representations that capture semantic similarity of text fragments.
  • Vector database / vector store: Specialized storage for embeddings enabling efficient similarity search.
  • Chunking: Splitting documents into retrievable segments while preserving necessary context.
  • Re-ranking: Secondary scoring or ordering of candidate passages after initial retrieval.
Recommended resources and further reading Are you ready to master RAG and start building retrieval-powered assistants that accelerate decision-making and productivity? Let’s get started.

Watch Video