🔎

RAG & Vector Database Lab

GKE + Vertex AI Workbench

Overview

The RAG & Vector Database Lab is your sandbox for building knowledge-grounded AI applications. When an LLM needs to answer questions about your company's documents, customer data, or proprietary knowledge — that's RAG, and this is where you learn to build it. The lab provides pre-deployed vector databases (ChromaDB and Weaviate running on GKE), access to embedding APIs (Vertex AI and open-source models), document processing pipelines, and evaluation frameworks. You'll ingest real documents, build search pipelines, evaluate retrieval quality, and wire everything together into production-ready applications.

What You'll Do in This Lab

  • Ingest thousands of documents — PDFs, HTML, Markdown — with automated parsing and chunking
  • Generate embeddings using Vertex AI Embeddings API and open-source models
  • Store and query vectors in ChromaDB and Weaviate with metadata filtering
  • Build multi-step RAG pipelines with query decomposition and re-ranking
  • Build AI agents with tool use, memory, and multi-step reasoning
  • Evaluate RAG quality with RAGAS — faithfulness, relevance, and answer correctness

Lab Workflow

1

Launch

Start the lab. A Jupyter environment plus a dedicated ChromaDB/Weaviate instance on GKE spin up. Sample document corpora are pre-loaded.

2

Ingest

Process your document corpus using Unstructured — extract text from PDFs, chunk with your chosen strategy, generate embeddings.

3

Index

Store embeddings in your vector database. Configure metadata fields, distance metrics, and indexing parameters.

4

Query

Build retrieval pipelines with LangChain or LlamaIndex. Implement hybrid search (vector + keyword), re-ranking, and filtering.

5

Generate

Wire retrieval to an LLM. Build the full RAG chain — retrieve context, construct prompt, generate answer with citations.

6

Evaluate

Run RAGAS evaluation on your pipeline. Measure faithfulness, relevance, and correctness. Iterate on chunking and retrieval strategy.

Hardware & Environment

Compute n1-standard-8 (8 vCPU, 30 GB RAM)
Vector DB ChromaDB + Weaviate on dedicated GKE pods
Document Storage Cloud Storage bucket with sample corpora (10k+ documents)
Embedding APIs Vertex AI text-embedding-004, open-source sentence-transformers
LLM Access Vertex AI Gemini API, plus open-source models on connected serving labs
Session Length 2-4 hour sessions, vector DB persists between sessions

Pre-installed Tools

ChromaDB / Weaviate / Pinecone LangChain / LlamaIndex Vertex AI Embeddings API Unstructured (document parsing) RAGAS (evaluation)

Frequently asked questions about this lab

What is the RAG & Vector Database Lab? +
Pre-configured environment for building retrieval-augmented generation systems. Includes vector databases, embedding model APIs, document processing pipelines, and evaluation frameworks.
Which courses use this lab? +
This lab is included in: NLP & Large Language Model Engineering, Generative AI & Prompt Engineering, AI for Healthcare & Life Sciences.
What hardware does this lab run on? +
GKE + Vertex AI Workbench. Compute: n1-standard-8 (8 vCPU, 30 GB RAM); Vector DB: ChromaDB + Weaviate on dedicated GKE pods; Document Storage: Cloud Storage bucket with sample corpora (10k+ documents); Embedding APIs: Vertex AI text-embedding-004, open-source sentence-transformers.
What software comes pre-installed? +
Comes pre-loaded with ChromaDB / Weaviate / Pinecone, LangChain / LlamaIndex, Vertex AI Embeddings API, Unstructured (document parsing), RAGAS (evaluation). No local installs or dependency setup required — open your browser and start working.
Can I bring my own datasets and code into this lab? +
Yes. Datasets can be uploaded directly or synced from Google Cloud Storage. Notebooks and source files have built-in Git integration so you can push work to your own GitHub or GitLab repos.
Do I need to enroll in a course to use this lab? +
Yes. Lab environments are provisioned per-student as part of an AI Labs course enrollment. Browse the courses linked above to find programs that include this lab.

Related labs

Other AI Labs environments students typically use alongside this one.

Ready to Try This Lab?

Enroll in a course that uses this lab, or visit our Houston center for a hands-on demo.