sgl-project / sglang Public

Notifications You must be signed in to change notification settings
Fork 620
Star 6.8k

Code
Issues 137
Pull requests 35
Discussions
Actions
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Security
Insights

Issues: sgl-project/sglang

Development Roadmap (2024 Q4)

#1487 opened Sep 21, 2024 by Ying1123

Open 21

[Feature] DeepSeek V3 optimization

#2591 opened Dec 26, 2024 by zhyncs

Open 7

Labels 26 Milestones 0

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

137 Open 668 Closed

Author

Filter by author

Label

Filter by label

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Milestones

Filter by milestone

Assignee

Filter by who’s assigned

Assigned to nobody

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Issues list

[Feature] add AMD CIs amd

#2621 opened Dec 27, 2024 by zhyncs

2 tasks

[Feature] FlashInfer new version integration enhancement

New feature or request

flashinfer high priority

#2620 opened Dec 27, 2024 by zhyncs

2 tasks

[Bug] Launching Llama-3.2-11B-Vision-Instruct just hangs on generation

#2619 opened Dec 27, 2024 by SuperMasterBlasterLaser

5 tasks done

[Bug] LLVM/Triton issue

#2613 opened Dec 27, 2024 by NilayYadav

5 tasks done

[Bug] OOM when setting return_logprob=True

#2607 opened Dec 27, 2024 by CSammyfd

5 tasks done

[torch.compile] Large cache size limit

#2604 opened Dec 26, 2024 by anijain2305

Use absolute paths in imports

#2602 opened Dec 26, 2024 by merrymercy

[Bug] Cannot run bitsandbytes llama models good first issue

Good for newcomers

help wanted

Extra attention is needed

#2600 opened Dec 26, 2024 by merrymercy

[Bug] tuning deepseek v2/v3 fused_moe_triton crashed.

#2599 opened Dec 26, 2024 by BBuf

5 tasks done

[Bug] Deepseek v3 doesn't work on mi300x

#2595 opened Dec 26, 2024 by ferrybaltimore

5 tasks done

[Feature] DeepSeek V3 optimization enhancement

New feature or request

high priority performance quant

LLM Quantization

#2591 opened Dec 26, 2024 by zhyncs

5 of 15 tasks

[Bug] libcudart.so.12: cannot open shared object file: No such file or directory

#2584 opened Dec 26, 2024 by githust66

5 tasks

[Feature] Proposal: Releasing SGLang memory when idle feature high priority

#2583 opened Dec 26, 2024 by fzyzcjy

[Feature] Request to Include flashinfer as a Dependency for sglang Installation dependencies

Pull requests that update a dependency file

high priority

#2578 opened Dec 25, 2024 by richardodliu

2 tasks done

[Feature] Due to GIL issues, the overlap mode doesn't actually always bring benefits?

#2573 opened Dec 25, 2024 by CSEEduanyu

2 tasks done

[Feature] (Willing to PR) Proposal: Drop-in fast replacement of PreTrainedModel.generate collaboration enhancement

New feature or request

feature high priority RLHF

Using SGLang for post training

#2569 opened Dec 24, 2024 by fzyzcjy

2 tasks done

[Feature] Running multi-node offline engine inference ( via SLURM) collaboration feature help wanted

Extra attention is needed

#2561 opened Dec 23, 2024 by aflah02

2 tasks done

lora speed enhancement

New feature or request

#2559 opened Dec 23, 2024 by qingzhong1

[Feature] Improve the Zero-Overhead Batch Scheduler performance for the small model

#2558 opened Dec 23, 2024 by libratiger

2 tasks done

upgrade setuptools and wheel if you found "torch module not found" when installing bug

Something isn't working

#2554 opened Dec 23, 2024 by MiladInk

[Bug] Outlines version error for Grammar Backend bug

Something isn't working

good first issue

Good for newcomers

#2550 opened Dec 23, 2024 by zhaochenyang20

5 tasks done

[Feature] Set outlines and xgrammar as addtional dependency enhancement

New feature or request

grammar-backend

#2549 opened Dec 23, 2024 by zhaochenyang20

2 tasks done

[Feature] (Willing to PR) Avoid KV cache occupying GPU memory when not used collaboration feature high priority

#2542 opened Dec 22, 2024 by fzyzcjy

2 tasks done

[Bug] Eagle2 has an unstable sampling rate during multi concurrency。

#2537 opened Dec 21, 2024 by coolhok

5 tasks done

[Feature] Add Docs For Quantization good first issue

Good for newcomers

quant

LLM Quantization

#2531 opened Dec 20, 2024 by binhtranmcs

Previous 1 2 3 4 5 6 Next

Previous Next

ProTip! Find all open issues with in progress development work with linked:pr.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly