Add litellm inference #385

JoelNiklaus · 2024-11-11T09:08:55Z

This PR enables running inference using any model provider supported by litellm.

…ce#382) Co-authored-by: Clémentine Fourrier <[email protected]>

Fixes cache directory bug by using HF_HUB_CACHE instead of HF_HOME See documentation https://huggingface.co/docs/huggingface_hub/main/en/package_reference/environment_variables#hfhubcache

Co-authored-by: Nathan Habib <[email protected]> Co-authored-by: Clémentine Fourrier <[email protected]>

src/lighteval/models/litellm_model.py

src/lighteval/models/nanotron_model.py

HuggingFaceDocBuilderDev · 2024-11-28T19:53:17Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

NathanHB · 2024-12-03T10:50:12Z

Hi @JoelNiklaus ! Is this PR ready or do you need any hep with it ?

JoelNiklaus · 2024-12-03T12:24:28Z

From my side it is ready

NathanHB · 2024-12-05T11:27:46Z

Nice ! I will merge main and solve possible conflict as we have made a big change on the way we call the CLI. Is it ok for you ?

JoelNiklaus · 2024-12-05T12:43:08Z

Sounds great, thanks!

NathanHB · 2024-12-05T12:43:55Z

I'm trying to use it, can you provide me with the command you use ?

JoelNiklaus · 2024-12-05T13:25:27Z

For example like this:

lighteval accelerate \
  --model_args litellm,provider=openai,model=gpt-4o-2024-08-06 \
     --tasks "leaderboard|truthfulqa:mc|0|0" \
     --override_batch_size 1 \
     --output_dir="./evals/"
``
`

NathanHB · 2024-12-06T13:49:28Z

Should be good to go ! Added a few logging fixes for convenience. @JoelNiklaus Tell me what you think, are you able to use it ? :)

src/lighteval/models/litellm_model.py

JoelNiklaus · 2024-12-06T14:33:15Z

When testing it I receive this error:

Traceback (most recent call last):
  File "/opt/homebrew/Caskroom/miniconda/base/envs/legal_llm_evaluation/lib/python3.10/logging/config.py", line 544, in configure
    formatters[name] = self.configure_formatter(
  File "/opt/homebrew/Caskroom/miniconda/base/envs/legal_llm_evaluation/lib/python3.10/logging/config.py", line 656, in configure_formatter
    result = self.configure_custom(config)
  File "/opt/homebrew/Caskroom/miniconda/base/envs/legal_llm_evaluation/lib/python3.10/logging/config.py", line 471, in configure_custom
    c = self.resolve(c)
  File "/opt/homebrew/Caskroom/miniconda/base/envs/legal_llm_evaluation/lib/python3.10/logging/config.py", line 399, in resolve
    raise v
  File "/opt/homebrew/Caskroom/miniconda/base/envs/legal_llm_evaluation/lib/python3.10/logging/config.py", line 392, in resolve
    self.importer(used)
ValueError: Cannot resolve 'lighteval.logger.JSONFormatter': No module named 'lighteval.logger'

The above exception was the direct cause of the following exception:

Traceback (most recent call last):
  File "/opt/homebrew/Caskroom/miniconda/base/envs/legal_llm_evaluation/bin/lighteval", line 5, in <module>
    from lighteval.__main__ import app
  File "/Users/joel/Documents/LegalLLMEvaluation/lighteval/src/lighteval/__main__.py", line 85, in <module>
    dictConfig(logging_config)
  File "/opt/homebrew/Caskroom/miniconda/base/envs/legal_llm_evaluation/lib/python3.10/logging/config.py", line 811, in dictConfig
    dictConfigClass(config).configure()
  File "/opt/homebrew/Caskroom/miniconda/base/envs/legal_llm_evaluation/lib/python3.10/logging/config.py", line 547, in configure
    raise ValueError('Unable to configure '

…mpt.

NathanHB · 2024-12-17T15:51:37Z

Hey @JoelNiklaus modified the way we pass the chat-templated-messages to the litellm model.
This allows for system prompt and for better managment of fewshots, now fewshots will be a succession of user and assistant messages.

JoelNiklaus · 2024-12-17T16:11:00Z

Ok cool, thanks!

Would it make sense to add this the same way to the openai backend or remove that altogether?

NathanHB · 2024-12-18T10:52:42Z

I don;t think we need the openai backend anymore if we have the litellm backend, i'm pretty against having 2 ways of doing the same thing

clefourrier · 2024-12-18T10:56:59Z

Check the litellm docs in depth and I agree - I think we'll want to keep inference endpoints however (for leaderboards), even though it's slightly redundant

src/lighteval/tasks/prompt_manager.py

src/lighteval/models/litellm_model.py

clefourrier · 2024-12-18T11:22:08Z

Tests also need to be fixed ^^

… tokenizer

…val into add_litellm_inference

src/lighteval/models/litellm_model.py

Co-authored-by: Albert Villanova del Moral <[email protected]>

JoelNiklaus and others added 7 commits November 7, 2024 10:50

Added inference using litellm.

17783b2

Add Udmurt (udm) translation literals (huggingface#381)

9e92150

This PR adds translation literals for Belarusian language. (huggingfa…

30a624c

…ce#382) Co-authored-by: Clémentine Fourrier <[email protected]>

fix: cache directory variable (huggingface#378)

6e6fed6

Fixes cache directory bug by using HF_HUB_CACHE instead of HF_HOME See documentation https://huggingface.co/docs/huggingface_hub/main/en/package_reference/environment_variables#hfhubcache

greedy_until() fix (huggingface#344)

d1d4c69

Co-authored-by: Nathan Habib <[email protected]> Co-authored-by: Clémentine Fourrier <[email protected]>

Fixed some params in completion call to enable more model providers.

f69811f

Added diskcache.

dabb4a7

NathanHB reviewed Nov 19, 2024

View reviewed changes

src/lighteval/models/litellm_model.py Outdated Show resolved Hide resolved

NathanHB reviewed Nov 19, 2024

View reviewed changes

src/lighteval/models/nanotron_model.py Outdated Show resolved Hide resolved

JoelNiklaus and others added 7 commits November 20, 2024 02:29

Merge branch 'main' into add_litellm_inference

65f759c

Merge branch 'main' into add_litellm_inference

f74afd4

Fix issue for openai evaluation.

88a9838

Added support for stop sequences and generation size.

02ed461

Merge branch 'main' into add_litellm_inference

34596c2

Fixed issue with too many concurrent calls to APIs.

190738f

Merge branch 'main' into add_litellm_inference

2bb1917

Merge branch 'main' into add_litellm_inference

81e4404

Merge branch 'main' into add_litellm_inference

ebdd900

few fixes

251e181

JoelNiklaus commented Dec 6, 2024

View reviewed changes

src/lighteval/models/litellm_model.py Outdated Show resolved Hide resolved

Fixed issues with stop_sequence, max_completion_tokens and system_pro…

47b1888

…mpt.

JoelNiklaus and others added 5 commits December 17, 2024 04:36

Merge branch 'main' into add_litellm_inference

8d831b8

Made litellm inference robust to content management errors.

5115403

allow bette rmessage managment for litellm

78789c1

Merge branch 'main' into add_litellm_inference

3ebff6c

allow system prompt to be passed to litellm models

be77b15

Merge branch 'main' into add_litellm_inference

21d6112

clefourrier reviewed Dec 18, 2024

View reviewed changes

src/lighteval/tasks/prompt_manager.py Outdated Show resolved Hide resolved

clefourrier reviewed Dec 18, 2024

View reviewed changes

src/lighteval/models/litellm_model.py Outdated Show resolved Hide resolved

src/lighteval/models/litellm_model.py Show resolved Hide resolved

src/lighteval/models/litellm_model.py Outdated Show resolved Hide resolved

src/lighteval/models/litellm_model.py Show resolved Hide resolved

NathanHB and others added 8 commits December 18, 2024 12:12

use system prompt from the request and use litellm encode functino as…

d045d92

… tokenizer

fixes from review

f1ed682

Merge branch 'add_litellm_inference' of github.com:JoelNiklaus/lighte…

ec306fd

…val into add_litellm_inference

fix tests

bae4506

fix tests

6b0cb60

Merge branch 'main' into add_litellm_inference

c826b0e

remove unecessary doc

a6747f4

Merge branch 'add_litellm_inference' of github.com:JoelNiklaus/lighte…

5554787

…val into add_litellm_inference

albertvillanova reviewed Dec 19, 2024

View reviewed changes

src/lighteval/models/litellm_model.py Outdated Show resolved Hide resolved

albertvillanova reviewed Dec 19, 2024

View reviewed changes

src/lighteval/models/litellm_model.py Outdated Show resolved Hide resolved

src/lighteval/models/litellm_model.py Show resolved Hide resolved

NathanHB and others added 7 commits December 19, 2024 15:10

Update src/lighteval/models/litellm_model.py

5b2b72d

Co-authored-by: Albert Villanova del Moral <[email protected]>

Update src/lighteval/models/litellm_model.py

0265a74

Co-authored-by: Albert Villanova del Moral <[email protected]>

Merge branch 'main' into add_litellm_inference

4fa8311

Support retrying of empty cached model responses.

86dd849

Merge branch 'main' into add_litellm_inference

db983e3

Fixed error when stop sequence is None.

221d5d5

Added support for litellm as judge backend.

81f02ca

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add litellm inference #385

Add litellm inference #385

JoelNiklaus commented Nov 11, 2024

HuggingFaceDocBuilderDev commented Nov 28, 2024

NathanHB commented Dec 3, 2024

JoelNiklaus commented Dec 3, 2024

NathanHB commented Dec 5, 2024

JoelNiklaus commented Dec 5, 2024

NathanHB commented Dec 5, 2024

JoelNiklaus commented Dec 5, 2024

NathanHB commented Dec 6, 2024

JoelNiklaus commented Dec 6, 2024

NathanHB commented Dec 17, 2024

JoelNiklaus commented Dec 17, 2024

NathanHB commented Dec 18, 2024

clefourrier commented Dec 18, 2024

clefourrier commented Dec 18, 2024

Add litellm inference #385

Are you sure you want to change the base?

Add litellm inference #385

Conversation

JoelNiklaus commented Nov 11, 2024

HuggingFaceDocBuilderDev commented Nov 28, 2024

NathanHB commented Dec 3, 2024

JoelNiklaus commented Dec 3, 2024

NathanHB commented Dec 5, 2024

JoelNiklaus commented Dec 5, 2024

NathanHB commented Dec 5, 2024

JoelNiklaus commented Dec 5, 2024

NathanHB commented Dec 6, 2024

JoelNiklaus commented Dec 6, 2024

NathanHB commented Dec 17, 2024

JoelNiklaus commented Dec 17, 2024

NathanHB commented Dec 18, 2024

clefourrier commented Dec 18, 2024

clefourrier commented Dec 18, 2024