-
Notifications
You must be signed in to change notification settings - Fork 317
Pull requests: huggingface/nanotron
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Use per-group initial_lr for constant warmup (fix muP custom-LR groups)
#404
opened Jun 5, 2026 by
lollinng
Loading…
chore: enable Dependabot weekly GitHub Actions bumps
dependabot
#403
opened May 26, 2026 by
hf-dependantbot-rollout
Bot
Loading…
Removed assertion for s3 datasets and handled string and object cases
#381
opened Jul 3, 2025 by
SulRash
Loading…
2 of 6 tasks
Fixed nanoset data stage handling during pretraining
#380
opened Jul 3, 2025 by
SulRash
Loading…
2 of 6 tasks
Fix issue while running tiny llama script on ADA 4000 gpu
#379
opened Jul 2, 2025 by
chetandhembre
Loading…
2 of 6 tasks
Extra name argument to select configuration of hf dataset
#378
opened Jun 30, 2025 by
SulRash
Loading…
1 of 6 tasks
[feature] Add debug_dataloader_samples utility to preview decoded dataloader samples (#184)
#368
opened May 26, 2025 by
garongkim
Loading…
6 tasks
Previous Next
ProTip!
Updated in the last three days: updated:>2026-06-07.