Fix vits low-precision dtype #35418

jiqing-feng · 2024-12-26T01:36:21Z

This PR fixed vits model dtype when torch_dtype=torch.float16.

To reproduce the error:

import torch
from transformers import pipeline

pipe = pipeline("text-to-speech", model="facebook/mms-tts-eng", torch_dtype=torch.float16)
output = pipe("Hello, my dog is cooler than you!")
print(output)

Traceback:

......
  File "/home/jiqing/transformers/src/transformers/models/vits/modeling_vits.py", line 916, in forward
    query_states = self.q_proj(hidden_states) * self.scaling
  File "/usr/local/lib/python3.10/dist-packages/torch/nn/modules/module.py", line 1736, in _wrapped_call_impl
    return self._call_impl(*args, **kwargs)
  File "/usr/local/lib/python3.10/dist-packages/torch/nn/modules/module.py", line 1747, in _call_impl
    return forward_call(*args, **kwargs)
  File "/usr/local/lib/python3.10/dist-packages/torch/nn/modules/linear.py", line 125, in forward
    return F.linear(input, self.weight, self.bias)
RuntimeError: mat1 and mat2 must have the same dtype, but got Float and Half

Signed-off-by: jiqing-feng <[email protected]>

fix vits dtype

e73927c

Signed-off-by: jiqing-feng <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix vits low-precision dtype #35418

Fix vits low-precision dtype #35418

jiqing-feng commented Dec 26, 2024 •

edited

Loading

Fix vits low-precision dtype #35418

Are you sure you want to change the base?

Fix vits low-precision dtype #35418

Conversation

jiqing-feng commented Dec 26, 2024 • edited Loading

jiqing-feng commented Dec 26, 2024 •

edited

Loading