-
Notifications
You must be signed in to change notification settings - Fork 620
Issues: sgl-project/sglang
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
[Feature] FlashInfer new version integration
enhancement
New feature or request
flashinfer
high priority
#2620
opened Dec 27, 2024 by
zhyncs
2 tasks
[Bug] Launching Llama-3.2-11B-Vision-Instruct just hangs on generation
#2619
opened Dec 27, 2024 by
SuperMasterBlasterLaser
5 tasks done
[Bug] Cannot run bitsandbytes llama models
good first issue
Good for newcomers
help wanted
Extra attention is needed
#2600
opened Dec 26, 2024 by
merrymercy
[Bug] tuning deepseek v2/v3 fused_moe_triton crashed.
#2599
opened Dec 26, 2024 by
BBuf
5 tasks done
[Feature] DeepSeek V3 optimization
enhancement
New feature or request
high priority
performance
quant
LLM Quantization
#2591
opened Dec 26, 2024 by
zhyncs
5 of 15 tasks
[Bug] libcudart.so.12: cannot open shared object file: No such file or directory
#2584
opened Dec 26, 2024 by
githust66
5 tasks
[Feature] Proposal: Releasing SGLang memory when idle
feature
high priority
#2583
opened Dec 26, 2024 by
fzyzcjy
[Feature] Request to Include flashinfer as a Dependency for sglang Installation
dependencies
Pull requests that update a dependency file
high priority
#2578
opened Dec 25, 2024 by
richardodliu
2 tasks done
[Feature] Due to GIL issues, the overlap mode doesn't actually always bring benefits?
#2573
opened Dec 25, 2024 by
CSEEduanyu
2 tasks done
[Feature] (Willing to PR) Proposal: Drop-in fast replacement of New feature or request
feature
high priority
RLHF
Using SGLang for post training
PreTrainedModel.generate
collaboration
enhancement
#2569
opened Dec 24, 2024 by
fzyzcjy
2 tasks done
[Feature] Running multi-node offline engine inference ( via SLURM)
collaboration
feature
help wanted
Extra attention is needed
#2561
opened Dec 23, 2024 by
aflah02
2 tasks done
[Feature] Improve the Zero-Overhead Batch Scheduler performance for the small model
#2558
opened Dec 23, 2024 by
libratiger
2 tasks done
upgrade setuptools and wheel if you found "torch module not found" when installing
bug
Something isn't working
#2554
opened Dec 23, 2024 by
MiladInk
[Bug] Outlines version error for Grammar Backend
bug
Something isn't working
good first issue
Good for newcomers
#2550
opened Dec 23, 2024 by
zhaochenyang20
5 tasks done
[Feature] Set outlines and xgrammar as addtional dependency
enhancement
New feature or request
grammar-backend
#2549
opened Dec 23, 2024 by
zhaochenyang20
2 tasks done
[Feature] (Willing to PR) Avoid KV cache occupying GPU memory when not used
collaboration
feature
high priority
#2542
opened Dec 22, 2024 by
fzyzcjy
2 tasks done
[Bug] Eagle2 has an unstable sampling rate during multi concurrency。
#2537
opened Dec 21, 2024 by
coolhok
5 tasks done
[Feature] Add Docs For Quantization
good first issue
Good for newcomers
quant
LLM Quantization
#2531
opened Dec 20, 2024 by
binhtranmcs
Previous Next
ProTip!
Find all open issues with in progress development work with linked:pr.