Skip to content

Issues: huggingface/text-generation-inference

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

Implement logit_bias correctly
#2869 opened Dec 28, 2024 by juanwisz
Deepseek-v2
#2865 opened Dec 26, 2024 by ehartford
2 tasks done
Tool Calling using Vercel's AI SDK not working as intended
#2864 opened Dec 23, 2024 by kldzj
2 of 4 tasks
Dynamically serve LoRA modules
#2860 opened Dec 20, 2024 by rikardradovac
Entire system crashes when get to warm up model
#2853 opened Dec 17, 2024 by ad-astra-video
1 of 4 tasks
Cohere2 aka Cohere2ForCausalLM
#2843 opened Dec 14, 2024 by kno10
2 tasks done
Load model weight fast
#2836 opened Dec 13, 2024 by Zzzz1111
use pip install TGI3.0
#2832 opened Dec 12, 2024 by xiezhipeng-git
Security aspects in TGI
#2830 opened Dec 12, 2024 by vitalyshalumov
1 of 4 tasks
Error for Qwen2-VL-2B-Instruct using v3.0.0
#2823 opened Dec 11, 2024 by tobiasvanderwerff
2 of 4 tasks
Unkown compute for card nvidia-a100-80gb-pcie
#2822 opened Dec 11, 2024 by ferreroal
2 of 4 tasks
Failure when start the model using TGI 3
#2819 opened Dec 10, 2024 by hahmad2008
2 of 4 tasks
text-generation-inference False make install exception
#2805 opened Dec 6, 2024 by tangliangwu
1 of 4 tasks
integration-test failures on MI300
#2804 opened Dec 6, 2024 by itej89
2 of 4 tasks
Qwen2-VL-7B does not run properly
#2801 opened Dec 5, 2024 by jvhgit
2 of 4 tasks
ProTip! Type g p on any issue or pull request to go back to the pull request listing page.