#speculative-decoding 共 3 个条目 讲座 (1) L13: Reasoning 2/2 论文 (2) Self-Distillation for Multi-Token Prediction Fast Inference from Transformers via Speculative Decoding