-
Notifications
You must be signed in to change notification settings - Fork 3.4k
Issues: Lightning-AI/pytorch-lightning
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Everything prints fine, but the loss doesn't descent
bug
Something isn't working
needs triage
Waiting to be triaged by maintainers
ver: 2.3.x
#20344
opened Oct 15, 2024 by
2catycm
Impove how argument passing via CLI and config file is handled in regards to argument linking
feature
Is an improvement or enhancement
needs triage
Waiting to be triaged by maintainers
#20341
opened Oct 14, 2024 by
MrWhatZitToYaa
DDP and BackboneFinetuning: model weights get out of sync when unfreezing layers for training
bug
Something isn't working
needs triage
Waiting to be triaged by maintainers
ver: 2.4.x
#20340
opened Oct 13, 2024 by
ksikka
PyTorchProfiler: not showing CPU memory used even with Something isn't working
needs triage
Waiting to be triaged by maintainers
ver: 2.4.x
profile_memory=True
bug
#20339
opened Oct 13, 2024 by
Jack12xl
restore_training_state before on_fit_start?
feature
Is an improvement or enhancement
needs triage
Waiting to be triaged by maintainers
#20338
opened Oct 12, 2024 by
lampuiho
LightningCLI
doesn't fail when config.yaml
contains invalid arguments
bug
#20337
opened Oct 11, 2024 by
adosar
Unreadable font color theme of YAML files
docs
Documentation related
needs triage
Waiting to be triaged by maintainers
#20335
opened Oct 10, 2024 by
MrWhatZitToYaa
Stream outputs from Trainer.predict()
feature
Is an improvement or enhancement
needs triage
Waiting to be triaged by maintainers
#20334
opened Oct 10, 2024 by
Turakar
Add a Chinese version of README
docs
Documentation related
needs triage
Waiting to be triaged by maintainers
#20332
opened Oct 10, 2024 by
nocoding03
Support A Variable Number of Batches
feature
Is an improvement or enhancement
needs triage
Waiting to be triaged by maintainers
#20330
opened Oct 9, 2024 by
e-yi
Deepspeed Startegy doesn't set num_checkpoints while using activation partitions
bug
Something isn't working
needs triage
Waiting to be triaged by maintainers
ver: 2.4.x
#20329
opened Oct 9, 2024 by
Gforky
RuntimeError when running basic GAN model (from tutorial at lightning.ai) with DDP
bug
Something isn't working
needs triage
Waiting to be triaged by maintainers
ver: 2.4.x
#20328
opened Oct 9, 2024 by
pranavrao-qure
Custom Pytorch BatchSampler does not work well with pytorch lightning
bug
Something isn't working
needs triage
Waiting to be triaged by maintainers
ver: 2.4.x
#20326
opened Oct 9, 2024 by
dadwadw233
Add list to torch.Tensor injection in yaml config
feature
Is an improvement or enhancement
needs triage
Waiting to be triaged by maintainers
#20324
opened Oct 7, 2024 by
fguiotte
best-k-metrics in ModelCheckpoint
feature
Is an improvement or enhancement
needs triage
Waiting to be triaged by maintainers
#20321
opened Oct 5, 2024 by
gonzachiar
Import error on shutdown/KeyboardInterrupt if ran from Jupyter Lab notebook cell
bug
Something isn't working
needs triage
Waiting to be triaged by maintainers
ver: 2.4.x
#20317
opened Oct 3, 2024 by
asigalov61
Model Checkpointing + FSDP causes Cuda OOM
bug
Something isn't working
needs triage
Waiting to be triaged by maintainers
ver: 2.3.x
#20312
opened Oct 1, 2024 by
profPlum
Save save_hyperparameters no longer respects linked arguments.
bug
Something isn't working
needs triage
Waiting to be triaged by maintainers
ver: 2.3.x
ver: 2.4.x
#20311
opened Sep 30, 2024 by
Erotemic
hparams
not loaded when loading checkpoint via LightningCLI
bug
#20310
opened Sep 30, 2024 by
YouRik
The problem shows: version incompatibility from v1.3.x to v2.4
bug
Something isn't working
needs triage
Waiting to be triaged by maintainers
ver: 2.4.x
#20308
opened Sep 27, 2024 by
sunhan3787
Trainer
's .init_module()
context does not initialize model on target device
bug
#20307
opened Sep 27, 2024 by
jin-zhe
NCCL backend fails during multi-node, multi-GPU training
bug
Something isn't working
needs triage
Waiting to be triaged by maintainers
ver: 2.4.x
#20306
opened Sep 26, 2024 by
raketenolli
the example that shows "The LightningModule also has access to the Hyperparameters" is not correct
docs
Documentation related
needs triage
Waiting to be triaged by maintainers
#20303
opened Sep 26, 2024 by
XinleiRen
RichProgressBar: refresh_rate doesn't affect metric_component
bug
Something isn't working
needs triage
Waiting to be triaged by maintainers
ver: 2.4.x
#20300
opened Sep 24, 2024 by
marios1861
Incosistant memory usage comparing to huggingface trainer when using deepspeed
bug
Something isn't working
needs triage
Waiting to be triaged by maintainers
ver: 2.4.x
#20299
opened Sep 24, 2024 by
mickeysun0104
ProTip!
Follow long discussions with comments:>50.