🎯
Focusing
Asst Prof at UC Davis. MTS at xAI. Prev: PostDoc at UC Berkeley, PhD at Harvard.
Pinned Loading
-
uccl-project/uccl
uccl-project/uccl PublicUCCL is an efficient communication library for GPUs, covering collectives, P2P (e.g., KV cache transfer, RL weight transfer), and EP (e.g., GPU-driven)
-
uccl-project/mKernel
uccl-project/mKernel PublicmKernel: fast multi-node, multi-GPU fused kernels
-
NEO-MLSys25/NEO
NEO-MLSys25/NEO PublicNEO is a LLM inference engine built to save the GPU memory crisis by CPU offloading
-
-
DINT-NSDI24/DINT
DINT-NSDI24/DINT PublicDINT: Fast In-Kernel Distributed Transactions with eBPF
-
Electrode-NSDI23/Electrode
Electrode-NSDI23/Electrode PublicElectrode: Accelerating Distributed Protocols with eBPF
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.





