Pinned Loading
-
is-my-problem-new
is-my-problem-new PublicSemantic search for competitive programming problems
-
safety-research/impossiblebench
safety-research/impossiblebench PublicOfficial Inspect Implementation for "ImpossibleBench: Measuring LLMs' Propensity of Exploiting Test Cases"
-
random_transformers
random_transformers PublicOfficial code for "Algorithmic Capabilities of Random Transformers" (NeurIPS 2024)
-
WeightWatch
WeightWatch PublicOfficial Repository of Paper "Watch the Weights: Unsupervised monitoring and control of fine-tuned LLMs"
Python 12
-
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.