You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
[Neurips 2025] Code and experiments for the NeurIPS 2025 paper Mysteries of the Deep, studying how intermediate representations in CLIP, ViT, and SigLIP enable more robust zero-shot OOD detection across datasets.
We leverage 14 datasets as OOD test data and conduct evaluations on 8 NLU tasks over 21 popularly used models. Our findings confirm that the OOD accuracy in NLP tasks needs to be paid more attention to since the significant performance decay compared to ID accuracy has been found in all settings.