You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I observed the 'nan' loss when using 'RTX A6000 ada' as gpu and attempting to train the ByteFormer by using the config file 'examples/byteformer/imagenet_file_encodings/encoding_type=TIFF.yaml'.
There were still observed nan loss when changing the gpu device to 'RTX 4090'.
I wonder whether you didn't see the 'nan' loss when training the ByteFormer using ImageNet as training set.