You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I want to create a model that takes a screenshot of a front page and answers with the HTML code and JS. As you can tell the "input_ids" will be super long > 4096 tokens.
I was thinking of training a Blip2 model, but how can I efficiently train a model like this?
Thanks
The text was updated successfully, but these errors were encountered:
I want to create a model that takes a screenshot of a front page and answers with the HTML code and JS. As you can tell the "input_ids" will be super long > 4096 tokens.
I was thinking of training a Blip2 model, but how can I efficiently train a model like this?
Thanks
The text was updated successfully, but these errors were encountered: