Fine tuning kosmos-2 #1562

FarzanRahmani · 2024-05-23T11:08:27Z

Hi @pengzhiliang. I want to finetune kosmos-2 on a VQA task that answer is a single word (like a multi-class classification task) and I call this single word label. I only have question answer pairs but not bounding boxes. I was wondering that I should use <grounding> or not. I mean should I use <grounding> Question: Are there any <phrase>cats</phrase> in the image? Answer: label or Question: Are there any <phrase>cats</phrase> in the image? Answer: label. I am using Kosmos2ForConditionalGeneration.
and another question: is it rational to use Kosmos2ForConditionalGeneration for fine tuning or not?

The text was updated successfully, but these errors were encountered:

pengzhiliang · 2024-06-20T13:34:53Z

Thank you for your patience. @FarzanRahmani
If your downstream task does not involve bounding boxes, there's no need to use .
You can use it like this:
Question: {question} Answer: {answer}
or
Question: {question} Answer the question using a single word or phrase. Answer: {answer}

FarzanRahmani · 2024-06-21T11:22:18Z

Thanks for your attention and answer. @pengzhiliang

FarzanRahmani closed this as completed Jun 21, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fine tuning kosmos-2 #1562

Fine tuning kosmos-2 #1562

FarzanRahmani commented May 23, 2024

pengzhiliang commented Jun 20, 2024

FarzanRahmani commented Jun 21, 2024

Fine tuning kosmos-2 #1562

Fine tuning kosmos-2 #1562

Comments

FarzanRahmani commented May 23, 2024

pengzhiliang commented Jun 20, 2024

FarzanRahmani commented Jun 21, 2024