Implement logit_bias correctly #2869

juanwisz · 2024-12-28T03:15:54Z

Feature request

Currently, logit_bias is labeled as unused https://github.com/huggingface/text-generation-inference/blob/main/router/src/lib.rs

Also, the documentation states that a JSON is needed with a mapping from token_ids to a number between -100 and 100. But this is misaligned with the code's typing, that asks for a vector of floats.

Motivation

Logit_bias is a very important parameter, the documentation in InferenceClient from huggingface_hub states that it can be used, but it does not work.
Also see:
huggingface/huggingface_hub#2720

Your contribution

I can definitely help building a PR but I will need details on what type of solution is expected.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement logit_bias correctly #2869

Implement logit_bias correctly #2869

juanwisz commented Dec 28, 2024 •

edited

Loading

Implement logit_bias correctly #2869

Implement logit_bias correctly #2869

Comments

juanwisz commented Dec 28, 2024 • edited Loading

Feature request

Motivation

Your contribution

juanwisz commented Dec 28, 2024 •

edited

Loading