Format of dataset when fine-tuning with LoRA #239

coffeeBeansz · 2025-03-12T09:37:27Z

coffeeBeansz
Mar 12, 2025

Trying to use the lora.py script for fine-tuning. I have prepared a .json file with the following format:

{
    "images": [<image_path_1>, ...]
    "messages": [
            [
                    {
                          "role": "system", "content": [{"text": <system prompt>, "type": "text"}]
                    },
                    {
                           "role": "user", "content":
                                  [
                                         {"image": <image_path_1>, "type": "image"},
                                         {"text": <question>, "type": "text"}
                                  ]
                    },
                    {
                            "role": "assistant", "content": [{"text": <desired response>, "type": "text"}]
                    }
            ],
            ...
     ]
}

And then I have imported Dataset from dataset and used Dataset.from_dict() to convert it to a huggingface dataset.

My question is if this is the correct format of the data? Should I pass the image in the "messages" as well as in the "images"? Or do I only need to pass it in the "images" field? Also should I pass the local path or a PIL image?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Format of dataset when fine-tuning with LoRA #239

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Replies: 0 comments

Select a reply

Uh oh!

Uh oh!

Format of dataset when fine-tuning with LoRA #239

Uh oh!

Uh oh!

coffeeBeansz Mar 12, 2025

Replies: 0 comments

coffeeBeansz
Mar 12, 2025