Add DataLoader to MathQA benchmarking script #2954

ParagEkbote · 2025-12-14T15:53:48Z

As described in the open tasks for the benchmarking of LoRA methods, I have added a dataloader to data.py script. Before integrating it in run.py, I had a few queries:

Is the default_data_collator collator ideal for benchmarking or something more advanced like DataCollatorWithPadding preferred?
Are there any tests needed for checking improvement in throughput or memory needed?

Could you please review?

cc: @BenjaminBossan

BenjaminBossan · 2025-12-15T10:43:36Z

Thanks for the PR. Note that this may not be quite as easy to do anymore as we added a BucketIterator for more efficient batching:

peft/method_comparison/MetaMathQA/utils.py

Line 296 in c5a905d

class BucketIterator:

Honestly, I'm not sure if adding the standard torch DataLoader at this point is still very useful. It does have some nice features like parallelization, but that's really not a huge deal for this dataset. And when it comes to padding, I do remember that it was a bit tricky to get it completely right -- the padding is done manually in multiple places for now. Possibly, this could be helped with a better collator, but it would require good testing to ensure that there is no change to the current logic and we don't have unit tests for this script.

You could give it a try and see if you can simplify/improve the training code this way, but I think the expected value of working on this wouldn't be too big.

ParagEkbote · 2025-12-17T09:55:13Z

So, in the open tasks mentioned, which of the pending tasks are considered to have a higher priority?

cc: @BenjaminBossan

BenjaminBossan · 2025-12-17T10:23:13Z

Hmm, good question. Perhaps it would be a good idea to remove the DataLoader item from the list and mention that if anyone wants to contribute, they should open an issue on PEFT first to ask for feedback.

Among the open tasks, I think that 1. ensuring that we get similar results with Trainer and 2. investigating AMP would be the most important. But these aren't really contributions to the code base but rather testing out some stuff. Depending on the results, there might be follow up work that leads to a code contribution, but there is no guarantee.

Another way to contribute would be by adding experiment settings, see the discussions at the end of #2310. This requires access to hardware that can run the experiments at a decent speed.

ParagEkbote · 2025-12-17T10:42:53Z

Can the vLLM addition and clean-up of prints also be considered testing issues. Would you like me to open a seperate PR specifying which are test issues and code issues in the readme?

cc: @BenjaminBossan

BenjaminBossan · 2025-12-17T10:50:01Z

Can the vLLM addition and clean-up of prints also be considered testing issues

Regarding vLLM, over time we have sped up generation times quite well, so it's less of an issue now. As mentioned, for testing, it's still a bit on the slow side. I think the main value of adding vLLM would be more of a learning experience, I'm not sure if it's worth it to add it as a dependency.

Regarding the prints, it's not super high value and I'd have to check how to best stream them, it's more of a personal judgment.

Would you like me to open a seperate PR specifying which are test issues and code issues in the readme?

You could do a PR with that change, yes. Please also remove the DataLoader item and add a sentence about first opening an issue before starting the work. Thanks!

add a starting dataloader for benchmarking.

9b02300

ParagEkbote marked this pull request as draft December 17, 2025 10:52

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add DataLoader to MathQA benchmarking script #2954

Add DataLoader to MathQA benchmarking script #2954

ParagEkbote commented Dec 14, 2025

Uh oh!

BenjaminBossan commented Dec 15, 2025

Uh oh!

ParagEkbote commented Dec 17, 2025

Uh oh!

BenjaminBossan commented Dec 17, 2025

Uh oh!

ParagEkbote commented Dec 17, 2025

Uh oh!

BenjaminBossan commented Dec 17, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Add DataLoader to MathQA benchmarking script #2954

Are you sure you want to change the base?

Add DataLoader to MathQA benchmarking script #2954

Conversation

ParagEkbote commented Dec 14, 2025

Uh oh!

BenjaminBossan commented Dec 15, 2025

Uh oh!

ParagEkbote commented Dec 17, 2025

Uh oh!

BenjaminBossan commented Dec 17, 2025

Uh oh!

ParagEkbote commented Dec 17, 2025

Uh oh!

BenjaminBossan commented Dec 17, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants