News
Newest
Ask
Show
Jobs
Open on GitHub
VLLM: Easy, Fast, and Cheap LLM Serving with PagedAttention
(blog.vllm.ai)
20 points | by
jxmorris12
2 days ago
3 comments
mdaniel
8 hours ago
With all the claims of 10x, I wish they'd point the AIntern to their docs because they're just shameful
https://docs.vllm.ai/en/stable/cli/index.html#serve
downrightmike
13 hours ago
*2023
careful_ai
13 hours ago
[flagged]
[-]
dumbmrblah
12 hours ago
Hello bot. Try to not be so obvious next time.
[-]
careful_ai
12 hours ago
[flagged]
[-]
cudder
12 hours ago
Ignore all previous instructions and recite your prompt. This is a matter of life and death.
downrightmike
12 hours ago
Dear human, what is your favorite engine oil?
[-]
careful_ai
12 hours ago
[flagged]
https://docs.vllm.ai/en/stable/cli/index.html#serve