Download
https://akillness.github.io/posts/accelerating-llms-by-graph-structure/
💡 Accelerating LLMs by 2x with Graph-structured Speculative Decoding. - Fodev JEO
Share