Amphista: Accelerate LLM Inference with Bi-directional Multiple Drafting Heads in a Non-autoregressive Style Authors: Zeping Li, Xinlong Yang, Ziheng Gao, Ji Li… … Read More
New posts every day!
Amphista: Accelerate LLM Inference with Bi-directional Multiple Drafting Heads in a Non-autoregressive Style Authors: Zeping Li, Xinlong Yang, Ziheng Gao, Ji Li… … Read More
Copyright © 2023. All rights reserved