-
Notifications
You must be signed in to change notification settings - Fork 3.4k
Issues: Lightning-AI/pytorch-lightning
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Gradient checkpointing and ddp do not work together
bug
Something isn't working
needs triage
Waiting to be triaged by maintainers
ver: 2.4.x
#20395
opened Nov 4, 2024 by
rubenweitzman
Error if SLURM_NTASKS != SLURM_NTASKS_PER_NODE
bug
Something isn't working
needs triage
Waiting to be triaged by maintainers
ver: 2.4.x
#20391
opened Nov 4, 2024 by
guarin
Major performance degradation when multiple metrics/losses
bug
Something isn't working
needs triage
Waiting to be triaged by maintainers
ver: 2.4.x
#20388
opened Nov 3, 2024 by
EtayLivne
FSDP with HYBRID_SHARD loss doesn't improve with more nodes
bug
Something isn't working
needs triage
Waiting to be triaged by maintainers
ver: 2.4.x
#20385
opened Nov 2, 2024 by
zaptrem
Custom TQDMProgressBar changes not reflected
bug
Something isn't working
needs triage
Waiting to be triaged by maintainers
ver: 2.4.x
#20384
opened Nov 1, 2024 by
oseymour
Optimize Is an improvement or enhancement
needs triage
Waiting to be triaged by maintainers
fit_loop()
to reduce train_dataloader()
's memory footprint
feature
#20382
opened Nov 1, 2024 by
guillaume-rochette-oxb
Fabric and FFCV?
feature
Is an improvement or enhancement
needs triage
Waiting to be triaged by maintainers
#20380
opened Nov 1, 2024 by
richardrl
Deepspeed ZERO MiCS support
feature
Is an improvement or enhancement
needs triage
Waiting to be triaged by maintainers
#20378
opened Oct 31, 2024 by
hehepig4
Custom Subcommand without Model arg
bug
Something isn't working
needs triage
Waiting to be triaged by maintainers
ver: 2.4.x
#20374
opened Oct 29, 2024 by
enrico-stauss
FSDP checkpoint loading fails
bug
Something isn't working
needs triage
Waiting to be triaged by maintainers
ver: 2.4.x
#20373
opened Oct 29, 2024 by
Nilabhra
metrics csv in ddp mode
bug
Something isn't working
needs triage
Waiting to be triaged by maintainers
ver: 2.2.x
#20371
opened Oct 29, 2024 by
ruyanyinian
FutureWarning: Something isn't working
needs triage
Waiting to be triaged by maintainers
ver: 2.4.x
torch.cuda.amp.custom_bwd(args...)
is deprecated. Please use torch.amp.custom_bwd(args..., device_type='cuda')
instead.
bug
#20370
opened Oct 28, 2024 by
loretoparisi
Wandb 1.x step handling
feature
Is an improvement or enhancement
needs triage
Waiting to be triaged by maintainers
#20368
opened Oct 28, 2024 by
edmcman
Training stuck at the first iter can't get corresponding pid
bug
Something isn't working
needs triage
Waiting to be triaged by maintainers
#20367
opened Oct 28, 2024 by
yejr0229
Tuner.scale_batch_size(max_val=1024)
feature
Is an improvement or enhancement
needs triage
Waiting to be triaged by maintainers
#20364
opened Oct 24, 2024 by
edmcman
Resume training from checkpoints
docs
Documentation related
needs triage
Waiting to be triaged by maintainers
#20361
opened Oct 23, 2024 by
ArkashJ
load data sequence is confusing
bug
Something isn't working
needs triage
Waiting to be triaged by maintainers
ver: 2.4.x
#20360
opened Oct 22, 2024 by
workhours
load data sequence is confusing
bug
Something isn't working
needs triage
Waiting to be triaged by maintainers
ver: 2.4.x
#20359
opened Oct 22, 2024 by
workhours
load data sequence is confusing
bug
Something isn't working
needs triage
Waiting to be triaged by maintainers
ver: 2.4.x
#20358
opened Oct 22, 2024 by
workhours
Type annotation for Something isn't working
needs triage
Waiting to be triaged by maintainers
ver: 2.4.x
BasePredictionWriter
subclass
bug
#20356
opened Oct 22, 2024 by
saiden89
LearningRateFinder creates errors for schedulers in Something isn't working
needs triage
Waiting to be triaged by maintainers
ver: 2.4.x
val
stage
bug
#20355
opened Oct 21, 2024 by
DeanLa
Gradient accumulation calcluation may be incorrect
bug
Something isn't working
needs triage
Waiting to be triaged by maintainers
ver: 2.4.x
#20350
opened Oct 19, 2024 by
tyler-rt
Add support S3 as a storage option for profiling results
feature
Is an improvement or enhancement
needs triage
Waiting to be triaged by maintainers
#20348
opened Oct 18, 2024 by
kimminw00
Can't resume automatically a job, ckpt_path="hpc" throws ValueError from the start
bug
Something isn't working
needs triage
Waiting to be triaged by maintainers
ver: 2.4.x
#20347
opened Oct 18, 2024 by
F-Barto
tensorboard step and self.global_step do not correspond under accumulate_grad
bug
Something isn't working
needs triage
Waiting to be triaged by maintainers
ver: 2.4.x
#20346
opened Oct 18, 2024 by
wuzhiyue111
Previous Next
ProTip!
Exclude everything labeled
bug
with -label:bug.