DeepSeek open-sources inference optimizations with 60–85% faster generation [pdf]
DeepSpec: a full-stack codebase for training and evaluating speculative decoding algorithms - DeepSpec/DSpark_paper.pdf at main · deepseek-ai/DeepSpec
Read full article →