Amphista: Accelerate LLM Inference with Bi-directional Multiple Drafting Heads in a Non-autoregressive Style Authors: Zeping Li, Xinlong Yang, Ziheng Gao, Ji Li… … Read More