GIT_FEED

xlite-dev/Awesome-LLM-Inference

๐Ÿ“šA curated list of Awesome LLM/VLM Inference Papers with Codes: Flash-Attention, Paged-Attention, WINT8/4, Parallelism, etc.๐ŸŽ‰

View on GitHub
0Active

On the radar โ€” signal detected

Stars
5.1k
Forks
351
Contributors
33
Language
Python

Score updated Mar 26, 2026

// SUBSCRIBE

The repos that moved this week, why they matter, and what to watch next. One email. No noise.

โฏ