Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fine tuning kosmos-2 #1562

Closed
FarzanRahmani opened this issue May 23, 2024 · 2 comments
Closed

Fine tuning kosmos-2 #1562

FarzanRahmani opened this issue May 23, 2024 · 2 comments

Comments

@FarzanRahmani
Copy link

Hi @pengzhiliang. I want to finetune kosmos-2 on a VQA task that answer is a single word (like a multi-class classification task) and I call this single word label. I only have question answer pairs but not bounding boxes. I was wondering that I should use <grounding> or not. I mean should I use <grounding> Question: Are there any <phrase>cats</phrase> in the image? Answer: label or Question: Are there any <phrase>cats</phrase> in the image? Answer: label. I am using Kosmos2ForConditionalGeneration.
and another question: is it rational to use Kosmos2ForConditionalGeneration for fine tuning or not?

@pengzhiliang
Copy link
Contributor

Thank you for your patience. @FarzanRahmani
If your downstream task does not involve bounding boxes, there's no need to use .
You can use it like this:
Question: {question} Answer: {answer}
or
Question: {question} Answer the question using a single word or phrase. Answer: {answer}

@FarzanRahmani
Copy link
Author

Thanks for your attention and answer. @pengzhiliang

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants