GKE + Vertex AI Workbench
The RAG & Vector Database Lab is your sandbox for building knowledge-grounded AI applications. When an LLM needs to answer questions about your company's documents, customer data, or proprietary knowledge — that's RAG, and this is where you learn to build it. The lab provides pre-deployed vector databases (ChromaDB and Weaviate running on GKE), access to embedding APIs (Vertex AI and open-source models), document processing pipelines, and evaluation frameworks. You'll ingest real documents, build search pipelines, evaluate retrieval quality, and wire everything together into production-ready applications.
Start the lab. A Jupyter environment plus a dedicated ChromaDB/Weaviate instance on GKE spin up. Sample document corpora are pre-loaded.
Process your document corpus using Unstructured — extract text from PDFs, chunk with your chosen strategy, generate embeddings.
Store embeddings in your vector database. Configure metadata fields, distance metrics, and indexing parameters.
Build retrieval pipelines with LangChain or LlamaIndex. Implement hybrid search (vector + keyword), re-ranking, and filtering.
Wire retrieval to an LLM. Build the full RAG chain — retrieve context, construct prompt, generate answer with citations.
Run RAGAS evaluation on your pipeline. Measure faithfulness, relevance, and correctness. Iterate on chunking and retrieval strategy.
Other AI Labs environments students typically use alongside this one.
Single-GPU environment for training deep learning models, running computer vision pipelines, and experimenting with neural network architect…
Explore lab →High-memory GPU environment purpose-built for fine-tuning large language models. Supports full fine-tuning of 7B+ parameter models and param…
Explore lab →Standard Python development environment for data exploration, visualization, and introductory ML. Pre-configured with Python 3.11, NumPy, Pa…
Explore lab →Enroll in a course that uses this lab, or visit our Houston center for a hands-on demo.