Evaluating the Robustness of Neural Networks: An Extreme Value...
Feb 15, 2018 · Our analysis yields a novel robustness metric called CLEVER, which is short for Cross Lipschitz Extreme Value for nEtwork Robustness. The proposed CLEVER score is …
579 In this paper, we have proposed a novel counter- factual framework CLEVER for debiasing fact- checking models. Unlike existing works, CLEVER is augmentation-free and mitigates …
te the CLEVER scores for the same set of images and attack targets. To the best of our knowledge, CLEVER is the first attack-independent robustness score that is capable of …
Leaving the barn door open for Clever Hans: Simple features …
Dec 31, 2023 · The integrity of AI benchmarks is fundamental to accurately assess the capabilities of AI systems. The internal validity of these benchmarks - i.e., making sure they …
Abstract Transformer-based large language models (LLMs) provide a powerful foundation for natural language tasks in large-scale customer-facing applications. However, studies that …
Weakly-Supervised Affordance Grounding Guided by Part-Level...
Jan 22, 2025 · In this work, we focus on the task of weakly supervised affordance grounding, where a model is trained to identify affordance regions on objects using human-object …
en prediction objectives for basic graph navigation tasks. In particular, 114 the work identifies a Clever-Hans cheat based on shortcuts in teacher forced training similar to theo- 15 retical …
Submissions | OpenReview
Jan 22, 2025 · Leaving the barn door open for Clever Hans: Simple features predict LLM benchmark answers Lorenzo Pacchiardi, Marko Tesic, Lucy G Cheke, Jose Hernandez-Orallo …
Learnable Representative Coefficient Image Denoiser for...
Dec 31, 2023 · Fully characterizing the spatial-spectral priors of hyperspectral images (HSIs) is crucial for HSI denoising tasks. Recently, HSI denoising models based on representative …
LLaVA-OneVision: Easy Visual Task Transfer | OpenReview
Feb 9, 2025 · We present LLaVA-OneVision, a family of open large multimodal models (LMMs) developed by consolidating our insights into data, models, and visual representations in the …