Skip to content

mmiovski/deepseek-reimplementation

About

Small-scale reimplementation and efficiency study of DeepSeek-inspired MLA, MoE routing, load balancing, and multi-token prediction.

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors