Fix gguf loading via Transformers #2596

CL-ModelCloud · 2024-12-25T05:48:01Z

No description provided.

Qubitium · 2024-12-25T05:58:39Z

@baberabb This PR contains simple fix so HF Transformers can properly load gguf models via the gguf_file parameter.

Qubitium · 2024-12-25T09:15:52Z

@baberabb During our testing, we found gguf tokenizer load will fail is you override default use_fast=True with use_fast=False + gguf_file enabled. We can conclude gguf format is not compatbile with use_fast and added code bypass.

LSinev · 2024-12-28T07:51:58Z

If I may suggest, please update the documentation (*.md files) with description of this new option and provide some examples of usage (from terminal, at least).

hf support load gguf file

58348a2

CL-ModelCloud changed the title ~~hf support load gguf file~~ Fix gguf loading via Transformers Dec 25, 2024

CL-ModelCloud marked this pull request as ready for review December 25, 2024 05:56

CL-ModelCloud requested review from baberabb and lintangsutawika as code owners December 25, 2024 05:56

CL-ModelCloud marked this pull request as draft December 25, 2024 08:59

CL-ModelCloud and others added 4 commits December 25, 2024 17:02

code review

9d90d7d

code review

e425566

code clean up

026c70c

note about use_fast compat with gguf

fad22cd

CL-ModelCloud marked this pull request as ready for review December 25, 2024 09:13

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix gguf loading via Transformers #2596

Fix gguf loading via Transformers #2596

CL-ModelCloud commented Dec 25, 2024

Qubitium commented Dec 25, 2024

Qubitium commented Dec 25, 2024

LSinev commented Dec 28, 2024

Fix gguf loading via Transformers #2596

Are you sure you want to change the base?

Fix gguf loading via Transformers #2596

Conversation

CL-ModelCloud commented Dec 25, 2024

Qubitium commented Dec 25, 2024

Qubitium commented Dec 25, 2024

LSinev commented Dec 28, 2024