🎯
Focusing
Master student at Zhejiang University, interested in data management, machine learning system
-
Zhejiang University
- China Mainland
- in/kevin-zeng-457625271
Highlights
- Pro
Pinned Loading
-
inclusionAI/cuLA
inclusionAI/cuLA PublicCUDA kernels for linear attention variants, written in CuTe DSL and CUTLASS C++.
-
SandAI-org/MagiAttention
SandAI-org/MagiAttention PublicA Distributed Attention Towards Linear Scalability for Ultra-Long Context, Heterogeneous Data Training
-
flashinfer-ai/flashinfer
flashinfer-ai/flashinfer PublicFlashInfer: Kernel Library for LLM Serving
-
hao-ai-lab/FastVideo
hao-ai-lab/FastVideo PublicA unified inference and post-training framework for accelerated video generation.
-
vllm-project/vllm
vllm-project/vllm PublicA high-throughput and memory-efficient inference and serving engine for LLMs
-
agent-gpu-skills
agent-gpu-skills PublicForked from slowlyC/agent-gpu-skills
Personal Extensions for Agentic GPU Programming Skills
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.


