Vertex AI Workbench + Cloud IoT integration
Not every AI model runs in a data center. The Edge AI Lab teaches you to take models that were trained on powerful GPUs and compress, optimize, and deploy them on resource-constrained edge devices. The lab has two components: a cloud-based T4 GPU environment for running TensorRT optimizations, ONNX conversions, and quantization experiments, plus remote SSH/VNC access to physical NVIDIA Jetson Orin boards and Coral Edge TPU devices at our Houston training center. You'll measure real inference latency, power consumption, and accuracy on actual hardware — not just simulated benchmarks.
Start the cloud optimization environment (T4 GPU). A remote Jetson Orin or Coral device at our Houston lab is also reserved for you.
Load your trained model from a previous exercise. Export to ONNX format as the universal intermediate representation.
Run TensorRT optimization on the T4 — try FP16, INT8 with calibration data. Compare optimized vs original model size, speed, and accuracy.
SSH into your assigned Jetson Orin. Transfer the optimized model. Install the inference pipeline using provided deployment scripts.
Run inference on the Jetson. Measure real-world FPS, end-to-end latency, GPU/CPU utilization, and power draw.
For capstone exercises, connect a USB camera to the Jetson and run real-time inference with a live video stream.
Other AI Labs environments students typically use alongside this one.
Single-GPU environment for training deep learning models, running computer vision pipelines, and experimenting with neural network architect…
Explore lab →GPU-accelerated environment for image and video processing. Pre-loaded with large-scale vision datasets, annotation tools, and state-of-the-…
Explore lab →High-performance multi-GPU environment for distributed training, large model experiments, and advanced deep learning research. Supports data…
Explore lab →Enroll in a course that uses this lab, or visit our Houston center for a hands-on demo.