HuggingFaceM4/Idefics3-8B-Llama3 · How to use history prompts on the same image?

Aug 25, 2024

Is it possible to ask several prompts on the same image in a loop with respect to the assistant answer? In the example the messages use different images for each message.

HugoLaurencon

Aug 28, 2024

•

edited Aug 28, 2024

Yes it's totally possible (it was mostly trained this way during the SFT)

MotiHa

Sep 1, 2024

Could you give example?

HugoLaurencon

Sep 1, 2024

It's exactly like the given snippet

messages = [
    {
        "role": "user",
        "content": [
            {"type": "image"},
            {"type": "text", "text": "What do we see in this image?"},
        ]
    },
    {
        "role": "assistant",
        "content": [
            {"type": "text", "text": "In this image, we can see the city of New York, and more specifically the Statue of Liberty."},
        ]
    },
    {
        "role": "user",
        "content": [
            {"type": "image"},
            {"type": "text", "text": "And how about this image?"},
        ]
    },       
]

but you can remove the extra {"type": "image"} where you don't want to put an image