2026.06.01 18:39

THIS CHINESE TEACHER DEPLOYED A 70B AI MODEL LOCALLY ON THE NVIDIA DGX SPARK IN 3 COMMANDS - AND FILMED THE WHOLE THING after first boot he opened the terminal, ran one command to check CUDA version and GPU status and had everything confirmed in seconds then typed ollama run nemotrон - the DGX Spark automatically downloaded NemoTron 70B and ran it locally with zero additional configuration if the terminal looks too basic - he showed how to install AnyLM, connect Ollama as the provider and get a full chat interface running in under 2 minutes most people think running a 70B model locally requires a data center - he did it on a box the size of a paperback with 3 commands 128GB memory, 70B model, zero cloud costs and a clean chat interface - from unboxing to fully running AI in one afternoon

显示更多