Gliner quantization #145

carloronsi · 2024-07-04T10:24:36Z

carloronsi
Jul 4, 2024

Hi everyone! I've tried the quantization code present in this report:

https://github.com/urchade/GLiNER/blob/main/examples/convert_to_onnx.ipynb

Followed step-by-step, but the quantized model (last cell) is not returning any entity (if the threshold is reduced, texts are labeled incorrectly).

Am I missing something? Is there an additional step required to make the quantized model work?

Thank you all!

urchade · 2024-07-04T10:27:00Z

urchade
Jul 4, 2024
Maintainer

@Ingvarstep

0 replies

miguelwon · 2024-12-16T10:06:44Z

miguelwon
Dec 16, 2024

I'm also having a similar problem. When quantizing to 8bit I'm measuring a quite strong drop in performance. More than I usually see for this type of models.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Gliner quantization #145

{{title}}

Replies: 2 comments

{{title}}

{{title}}

Select a reply

Gliner quantization #145

carloronsi Jul 4, 2024

Replies: 2 comments

urchade Jul 4, 2024 Maintainer

miguelwon Dec 16, 2024

carloronsi
Jul 4, 2024

urchade
Jul 4, 2024
Maintainer

miguelwon
Dec 16, 2024