Visual Question Answering (VQA)
https://blog.roboflow.com/what-is-vqa/
It’s taking an image and asking a question about it:
How is VQA evaled? Because there could be multiple variations of the answer that are correct?
I don’t know actually. Someone tell me please. https://chatgpt.com/share/68bf3c68-f8f0-8002-8c03-e2bb99ccc6e4