Tools · MarkTechPost ·
Meet EAGLE 3.1: The Speculative Decoding Algorithm That Fixes Attention Drift in LLM Inference
The EAGLE team, vLLM, and TorchSpec have released EAGLE 3.1, a speculative decoding algorithm designed to address attention drift and instability in LLM inference.