Edge AI & Optimization Lab

Overview

Not every AI model runs in a data center. The Edge AI Lab teaches you to take models that were trained on powerful GPUs and compress, optimize, and deploy them on resource-constrained edge devices. The lab has two components: a cloud-based T4 GPU environment for running TensorRT optimizations, ONNX conversions, and quantization experiments, plus remote SSH/VNC access to physical NVIDIA Jetson Orin boards and Coral Edge TPU devices at our Houston training center. You'll measure real inference latency, power consumption, and accuracy on actual hardware — not just simulated benchmarks.

What You'll Do in This Lab

Convert PyTorch and TensorFlow models to ONNX intermediate format
Optimize models with TensorRT — FP16, INT8 calibration, layer fusion
Quantize models for TensorFlow Lite and Coral Edge TPU deployment
Deploy optimized models to physical NVIDIA Jetson Orin via remote access
Benchmark inference: measure FPS, latency, power consumption, and accuracy on real hardware
Build complete edge pipelines — camera input → preprocessing → inference → output

Lab Workflow

1

Launch

Start the cloud optimization environment (T4 GPU). A remote Jetson Orin or Coral device at our Houston lab is also reserved for you.

2

Load Model

Load your trained model from a previous exercise. Export to ONNX format as the universal intermediate representation.

3

Optimize

Run TensorRT optimization on the T4 — try FP16, INT8 with calibration data. Compare optimized vs original model size, speed, and accuracy.

4

Deploy

SSH into your assigned Jetson Orin. Transfer the optimized model. Install the inference pipeline using provided deployment scripts.

5

Benchmark

Run inference on the Jetson. Measure real-world FPS, end-to-end latency, GPU/CPU utilization, and power draw.

6

Demo

For capstone exercises, connect a USB camera to the Jetson and run real-time inference with a live video stream.

Hardware & Environment

Cloud Optimization	n1-standard-8 + 1x NVIDIA T4 for TensorRT/ONNX optimization
Edge Device 1	NVIDIA Jetson Orin NX (8 GB, 6-core ARM, 1024 CUDA cores)
Edge Device 2	Google Coral Edge TPU USB Accelerator (4 TOPS INT8)
Remote Access	SSH + VNC to physical devices at Houston lab
Peripherals	USB cameras, GPIO sensors connected to Jetson boards
Session Length	2-3 hour scheduled slots (shared hardware)

Frequently asked questions about this lab

What is the Edge AI & Optimization Lab? +

Environment for optimizing and deploying AI models on edge hardware. Includes model compression tools, TensorRT optimization, and remote access to physical NVIDIA Jetson and Coral Edge TPU boards at our Houston lab.

Which courses use this lab? +

This lab is included in: Computer Vision & Visual AI, AI for Robotics & Edge Computing.

What hardware does this lab run on? +

Vertex AI Workbench + Cloud IoT integration. Cloud Optimization: n1-standard-8 + 1x NVIDIA T4 for TensorRT/ONNX optimization; Edge Device 1: NVIDIA Jetson Orin NX (8 GB, 6-core ARM, 1024 CUDA cores); Edge Device 2: Google Coral Edge TPU USB Accelerator (4 TOPS INT8); Remote Access: SSH + VNC to physical devices at Houston lab.

What software comes pre-installed? +

Comes pre-loaded with NVIDIA TensorRT, ONNX Runtime, TensorFlow Lite, Jetson SDK (remote access), Coral Edge TPU compiler. No local installs or dependency setup required — open your browser and start working.

Can I bring my own datasets and code into this lab? +

Yes. Datasets can be uploaded directly or synced from Google Cloud Storage. Notebooks and source files have built-in Git integration so you can push work to your own GitHub or GitLab repos.

Do I need to enroll in a course to use this lab? +

Yes. Lab environments are provisioned per-student as part of an AI Labs course enrollment. Browse the courses linked above to find programs that include this lab.

Related labs

Other AI Labs environments students typically use alongside this one.

Ready to Try This Lab?

Enroll in a course that uses this lab, or visit our Houston center for a hands-on demo.

Browse Courses View All Labs