xlite-dev/Awesome-LLM-Inference
๐A curated list of Awesome LLM/VLM Inference Papers with Codes: Flash-Attention, Paged-Attention, WINT8/4, Parallelism, etc.๐
0Active
On the radar โ signal detected
Stars
5.1k
Forks
351
Contributors
33
Language
Python
Score updated Mar 26, 2026
// SUBSCRIBE
The repos that moved this week, why they matter, and what to watch next. One email. No noise.