Code Stars

vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Language:Python
Total stars: 34142
Stars trend:

20 Jan 2025
 8pm ▎ +2
 9pm ▎ +2
10pm █▏ +9
11pm ▍ +3
21 Jan 2025
12am ▎ +2
 1am █ +8
 2am ▉ +7
 3am ▉ +7
 4am █▌ +12
 5am ▊ +6
 6am █▎ +10
 7am █▍ +11

#python
#amd, #cuda, #gpt, #hpu, #inference, #inferentia, #llama, #llm, #llmserving, #llmops, #mlops, #modelserving, #pytorch, #rocm, #tpu, #trainium, #transformer, #xpu

70 views08:17

About

Blog

Apps

Platform