Course Introduction

Welcome to the RAG Fundamentals course. I’m Jeremy Morgan. In this course you’ll learn Retrieval-Augmented Generation (RAG) — a practical AI design pattern that blends retrieval from your knowledge sources with large language model (LLM) generation to produce accurate, grounded responses. Why RAG matters

When teams need fast, accurate answers from fragmented internal knowledge — runbooks, incident reports, Confluence pages, PDFs, email threads, and other silos — RAG retrieves the exact passages that matter and conditions an LLM on those snippets.
Unlike traditional search that returns documents or links, RAG returns relevant context (snippets) and uses that context to generate concise, actionable answers. This makes RAG ideal for incident diagnosis, onboarding, routine troubleshooting automation, and compliance or policy queries where source fidelity is critical.

What this course delivers

Conceptual foundations of RAG and when it outperforms pure retrieval or pure generation.
Hands-on architecture patterns and trade-offs for end-to-end RAG systems.
A practical developer setup and project structure for production-ready RAG workflows.
Document ingestion best practices for DOCX, PDF, CSV, and other common formats.
Chunking strategies (fixed-size, overlap-aware, semantic chunking) to preserve context and improve retrieval.
Retrieval techniques: keyword methods (TF-IDF, BM25), re-ranking, and semantic search with embeddings.
An introduction to vector databases, evaluation criteria, and a practical implementation example using ChromaDB.
A guided build of an end-to-end RAG pipeline with deployment and integration examples.

The image describes "The RAG Pipeline," featuring a sequence of three components: Knowledge Base, Retrieval System, and Generation System, with an explanation that LLM uses query and context to generate responses. There is also a small circular photo of an individual in the lower right corner.

Who should take this course: engineers, machine learning practitioners, and technical product managers who want to build reliable, explainable assistants over internal knowledge. Basic familiarity with LLM concepts, embeddings, and REST APIs is helpful but not required.

Course roadmap (at-a-glance)

Module	Focus	Outcomes
Introduction to RAG	What RAG is and why it matters	Understand RAG use cases and trade-offs
Architecture patterns	System-level designs and integration points	Choose the right RAG architecture for your needs
Development setup	Project layout and tooling	Create a reproducible dev environment for RAG
Document ingestion	Parsing and normalization for DOCX, PDF, CSV, etc.	Build robust ingestion pipelines
Chunking & context	Fixed vs. semantic chunking, overlap strategies	Optimize chunks for retrieval quality
Retrieval methods	TF-IDF, BM25, semantic search, re-ranking	Implement and compare retrieval approaches
Embeddings & similarity	Transformer-based sentence embeddings, metrics	Select embedding strategies and similarity metrics
Vector databases	Why vector stores, evaluation criteria	Implement storage/retrieval using a vector DB (e.g., ChromaDB)
End-to-end build	Integration and deployment examples	Deliver a production-ready RAG pipeline

Key terms and concepts (quick reference)

Retrieval-Augmented Generation (RAG): Combining retrieved context with an LLM to generate grounded, source-backed answers.
Embeddings: Vector representations that capture semantic similarity of text fragments.
Vector database / vector store: Specialized storage for embeddings enabling efficient similarity search.
Chunking: Splitting documents into retrievable segments while preserving necessary context.
Re-ranking: Secondary scoring or ordering of candidate passages after initial retrieval.

Recommended resources and further reading

ChromaDB — Vector Database
Semantic Search and Embeddings Tutorials (examples and background)
Retrieval-Augmented Generation overview (research context)

Are you ready to master RAG and start building retrieval-powered assistants that accelerate decision-making and productivity? Let’s get started.

Introduction

Document Processing and Chunking

Keyword Search & Retrieval

Semantic Search & Embeddings

Vector Databases

Building the RAG Pipeline

Course Introduction

Watch Video