Skip to content

Issues: Lightning-AI/pytorch-lightning

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

Gradient checkpointing and ddp do not work together bug Something isn't working needs triage Waiting to be triaged by maintainers ver: 2.4.x
#20395 opened Nov 4, 2024 by rubenweitzman
Error if SLURM_NTASKS != SLURM_NTASKS_PER_NODE bug Something isn't working needs triage Waiting to be triaged by maintainers ver: 2.4.x
#20391 opened Nov 4, 2024 by guarin
Major performance degradation when multiple metrics/losses bug Something isn't working needs triage Waiting to be triaged by maintainers ver: 2.4.x
#20388 opened Nov 3, 2024 by EtayLivne
FSDP with HYBRID_SHARD loss doesn't improve with more nodes bug Something isn't working needs triage Waiting to be triaged by maintainers ver: 2.4.x
#20385 opened Nov 2, 2024 by zaptrem
Custom TQDMProgressBar changes not reflected bug Something isn't working needs triage Waiting to be triaged by maintainers ver: 2.4.x
#20384 opened Nov 1, 2024 by oseymour
Optimize fit_loop() to reduce train_dataloader()'s memory footprint feature Is an improvement or enhancement needs triage Waiting to be triaged by maintainers
#20382 opened Nov 1, 2024 by guillaume-rochette-oxb
Fabric and FFCV? feature Is an improvement or enhancement needs triage Waiting to be triaged by maintainers
#20380 opened Nov 1, 2024 by richardrl
Deepspeed ZERO MiCS support feature Is an improvement or enhancement needs triage Waiting to be triaged by maintainers
#20378 opened Oct 31, 2024 by hehepig4
Custom Subcommand without Model arg bug Something isn't working needs triage Waiting to be triaged by maintainers ver: 2.4.x
#20374 opened Oct 29, 2024 by enrico-stauss
FSDP checkpoint loading fails bug Something isn't working needs triage Waiting to be triaged by maintainers ver: 2.4.x
#20373 opened Oct 29, 2024 by Nilabhra
metrics csv in ddp mode bug Something isn't working needs triage Waiting to be triaged by maintainers ver: 2.2.x
#20371 opened Oct 29, 2024 by ruyanyinian
Wandb 1.x step handling feature Is an improvement or enhancement needs triage Waiting to be triaged by maintainers
#20368 opened Oct 28, 2024 by edmcman
Training stuck at the first iter can't get corresponding pid bug Something isn't working needs triage Waiting to be triaged by maintainers
#20367 opened Oct 28, 2024 by yejr0229
Tuner.scale_batch_size(max_val=1024) feature Is an improvement or enhancement needs triage Waiting to be triaged by maintainers
#20364 opened Oct 24, 2024 by edmcman
Resume training from checkpoints docs Documentation related needs triage Waiting to be triaged by maintainers
#20361 opened Oct 23, 2024 by ArkashJ
load data sequence is confusing bug Something isn't working needs triage Waiting to be triaged by maintainers ver: 2.4.x
#20360 opened Oct 22, 2024 by workhours
load data sequence is confusing bug Something isn't working needs triage Waiting to be triaged by maintainers ver: 2.4.x
#20359 opened Oct 22, 2024 by workhours
load data sequence is confusing bug Something isn't working needs triage Waiting to be triaged by maintainers ver: 2.4.x
#20358 opened Oct 22, 2024 by workhours
Type annotation for BasePredictionWriter subclass bug Something isn't working needs triage Waiting to be triaged by maintainers ver: 2.4.x
#20356 opened Oct 22, 2024 by saiden89
LearningRateFinder creates errors for schedulers in val stage bug Something isn't working needs triage Waiting to be triaged by maintainers ver: 2.4.x
#20355 opened Oct 21, 2024 by DeanLa
Gradient accumulation calcluation may be incorrect bug Something isn't working needs triage Waiting to be triaged by maintainers ver: 2.4.x
#20350 opened Oct 19, 2024 by tyler-rt
Add support S3 as a storage option for profiling results feature Is an improvement or enhancement needs triage Waiting to be triaged by maintainers
#20348 opened Oct 18, 2024 by kimminw00
Can't resume automatically a job, ckpt_path="hpc" throws ValueError from the start bug Something isn't working needs triage Waiting to be triaged by maintainers ver: 2.4.x
#20347 opened Oct 18, 2024 by F-Barto
tensorboard step and self.global_step do not correspond under accumulate_grad bug Something isn't working needs triage Waiting to be triaged by maintainers ver: 2.4.x
#20346 opened Oct 18, 2024 by wuzhiyue111
ProTip! Exclude everything labeled bug with -label:bug.