Added a new demo to minGPT that trains a GPT on pixels of CIFAR-10 images instead of text. Quite powerful that one can run the same training code/model on both domains. Notebook: . Produced reasonable samples after ~only 30 minutes on an 8-GPU V100 node:
显示更多