b200
Here are 7 public repositories matching this topic...
Spheron Network is a decentralized GPU and cloud compute marketplace that aggregates enterprise-grade NVIDIA GPU capacity from certified Tier 3/4 data centers worldwide and exposes it through a single on-demand, per-minute billed interface. The marketplace covers H100, H200, B200, B300, A100, GH200, L40S, RTX PRO 6000, RTX 5090, RTX 4090, and…
-
Updated
May 25, 2026
Code for pre-training a GPT-2 model on (eight) NVIDIA DGX B200 GPUs and short tutorial on the topic. Uses Torch and HF Transformers. It can pre-train GPT-2 Small on 32 GB of data in around 2.5 hours. It handles dataset tokenization too.
-
Updated
Sep 6, 2025 - Python
Improve this page
Add a description, image, and links to the b200 topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the b200 topic, visit your repo's landing page and select "manage topics."