THIS CHINESE TEACHER DEPLOYED A 70B AI MODEL LOCALLY ON THE NVIDIA DGX SPARK IN 3 COMMANDS - AND FILMED THE WHOLE THING
after first boot he opened the terminal, ran one command to check CUDA version and GPU status and had everything confirmed in seconds
then typed ollama run nemotrон - the DGX Spark automatically downloaded NemoTron 70B and ran it locally with zero additional configuration
if the terminal looks too basic - he showed how to install AnyLM, connect Ollama as the provider and get a full chat interface running in under 2 minutes
most people think running a 70B model locally requires a data center - he did it on a box the size of a paperback with 3 commands
128GB memory, 70B model, zero cloud costs and a clean chat interface - from unboxing to fully running AI in one afternoon
显示更多