Yichao Fu, Peter Bailis, Ion Stoica, Hao Zhang: Break the Sequential Dependency of LLM Inference Using Lookahead Decoding. ICML 2024: 14060-14079