Skip to content

Issues: vllm-project/vllm

[Roadmap] vLLM Roadmap Q4 2024
#9006 opened Oct 1, 2024 by simon-mo
Open 26
vLLM's V1 Engine Architecture
#8779 opened Sep 24, 2024 by simon-mo
Open 10
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

[Bug]: Getting error 500 while requesting to /v1/completions bug Something isn't working
#11654 opened Dec 31, 2024 by mamadmolla0
1 task done
[Bug]: NotImplementedError: No operator found for memory_efficient_attention_forward bug Something isn't working
#11653 opened Dec 31, 2024 by AnthonyX1an
1 task done
[New Model]: command-r7b new model Requests to new models
#11650 opened Dec 31, 2024 by Marcher-lam
1 task done
[Performance]: V1 vs V0 with multi-steps performance Performance-related issues
#11649 opened Dec 31, 2024 by Desmond819
1 task done
[Usage][V1]: how to get logprobs for V1 engine? usage How to use vllm
#11634 opened Dec 30, 2024 by CypherSavage
[Usage]: async stream error usage How to use vllm
#11627 opened Dec 30, 2024 by lxb1202
1 task done
[Installation]: Request to include vllm==0.6.3.post1 for cuda 11.8 installation Installation problems
#11623 opened Dec 30, 2024 by jxqhhh
1 task done
[Usage]: Can AsyncEngineArgs load multiple lora modules? usage How to use vllm
#11621 opened Dec 30, 2024 by Jimmy-L99
1 task done
vllm build failure on IBM ppc64le installation Installation problems
#11616 opened Dec 30, 2024 by npanpaliya
1 task done
[Bug]: Nvidia DALI and VLLM bug Something isn't working
#11611 opened Dec 30, 2024 by conceptofmind
1 task done
[Bug]: Can Not load model Qwen2-VL-72B-Instruct in Vllm bug Something isn't working
#11608 opened Dec 30, 2024 by Tian14267
1 task done
[Bug]: AsyncEngine Backend loop is stopped bug Something isn't working
#11603 opened Dec 29, 2024 by DongZhaoXiong
1 task done
[Bug]: can not run with OpenGVLab/InternVL2_5-78B-MPO-AWQ bug Something isn't working
#11601 opened Dec 29, 2024 by bltcn
1 task done
ProTip! Type g p on any issue or pull request to go back to the pull request listing page.