xlite-dev/Awesome-LLM-Inference
๐A curated list of Awesome LLM/VLM Inference Papers with Codes: Flash-Attention, Paged-Attention, WINT8/4, Parallelism, etc.๐
0Active
On the radar โ signal detected
Stars
5.4k
Forks
415
Contributors
39
Language
Python
Score updated Jun 26, 2026
// SUBSCRIBE
The repos that moved this week, why they matter, and what to watch next. One email. No noise.