
Running MMLU 5-shot on Nemotron Nano Omni with a DGX Spark
My first LLM eval end-to-end: scoring NVIDIA's Nemotron-3-Nano-Omni-30B (NVFP4) on MMLU 5-shot via vLLM on a DGX Spark, with interactive charts breaking down where it's strong, where it's weak, and how confident it is.
Last updated June 18, 2026

Deploying an AI model on GKE with NVIDIA NIM
An overview of my experience with the Google CodeLabs tutorial for deploying an AI model on GKE with NVIDIA NIM
Last updated February 20, 2026