About 161,000 results
Open links in new tab
  1. Evaluating the Robustness of Neural Networks: An Extreme Value...

    Feb 15, 2018 · Our analysis yields a novel robustness metric called CLEVER, which is short for Cross Lipschitz Extreme Value for nEtwork Robustness. The proposed CLEVER score is …

  2. 579 In this paper, we have proposed a novel counter- factual framework CLEVER for debiasing fact- checking models. Unlike existing works, CLEVER is augmentation-free and mitigates …

  3. te the CLEVER scores for the same set of images and attack targets. To the best of our knowledge, CLEVER is the first attack-independent robustness score that is capable of …

  4. Leaving the barn door open for Clever Hans: Simple features …

    Dec 31, 2023 · The integrity of AI benchmarks is fundamental to accurately assess the capabilities of AI systems. The internal validity of these benchmarks - i.e., making sure they …

  5. Abstract Transformer-based large language models (LLMs) provide a powerful foundation for natural language tasks in large-scale customer-facing applications. However, studies that …

  6. Weakly-Supervised Affordance Grounding Guided by Part-Level...

    Jan 22, 2025 · In this work, we focus on the task of weakly supervised affordance grounding, where a model is trained to identify affordance regions on objects using human-object …

  7. en prediction objectives for basic graph navigation tasks. In particular, 114 the work identifies a Clever-Hans cheat based on shortcuts in teacher forced training similar to theo- 15 retical …

  8. Submissions | OpenReview

    Jan 22, 2025 · Leaving the barn door open for Clever Hans: Simple features predict LLM benchmark answers Lorenzo Pacchiardi, Marko Tesic, Lucy G Cheke, Jose Hernandez-Orallo …

  9. Learnable Representative Coefficient Image Denoiser for...

    Dec 31, 2023 · Fully characterizing the spatial-spectral priors of hyperspectral images (HSIs) is crucial for HSI denoising tasks. Recently, HSI denoising models based on representative …

  10. LLaVA-OneVision: Easy Visual Task Transfer | OpenReview

    Feb 9, 2025 · We present LLaVA-OneVision, a family of open large multimodal models (LMMs) developed by consolidating our insights into data, models, and visual representations in the …