GCP Compute Engine / Vertex AI Custom Training
The Multi-GPU Training Lab is our most powerful training environment — two NVIDIA A100 GPUs with 40 GB of VRAM each, connected via NVLink for ultra-fast inter-GPU communication. This is the same class of hardware used by AI research labs to train state-of-the-art models. You'll use this lab for distributed training exercises: scaling a training job across multiple GPUs using PyTorch FSDP or DeepSpeed, pre-training small transformer language models, training large diffusion models, and running compute-intensive research experiments. These labs are reserved for advanced course exercises where single-GPU training would be impractical.
Multi-GPU labs are scheduled sessions. Book a time slot from your course dashboard. Slots are 2-4 hours depending on the exercise.
At your scheduled time, the A100 instance launches automatically. You'll receive a JupyterLab link with the multi-GPU environment ready.
Set up your distributed training configuration — number of GPUs, batch size per GPU, gradient accumulation steps, and communication backend (NCCL).
Launch distributed training with torchrun or DeepSpeed launcher. Monitor both GPUs in real-time with TensorBoard and nvidia-smi.
Compare scaling efficiency: measure throughput (samples/sec) on 1 GPU vs 2 GPUs. Identify communication bottlenecks.
Save distributed model checkpoints to Cloud Storage. The lab auto-shuts down at the end of your time slot.
Other AI Labs environments students typically use alongside this one.
Single-GPU environment for training deep learning models, running computer vision pipelines, and experimenting with neural network architect…
Explore lab →GPU-accelerated environment for image and video processing. Pre-loaded with large-scale vision datasets, annotation tools, and state-of-the-…
Explore lab →Environment for optimizing and deploying AI models on edge hardware. Includes model compression tools, TensorRT optimization, and remote acc…
Explore lab →Enroll in a course that uses this lab, or visit our Houston center for a hands-on demo.