Skip to content
@sunblaze-ucb

sunblaze-ucb

Popular repositories Loading

  1. cybergym cybergym Public

    CyberGym is a large-scale, high-quality cybersecurity evaluation framework designed to rigorously assess the capabilities of AI agents on real-world vulnerability analysis tasks.

    Python 412 59

  2. Intuitor Intuitor Public

    [ICLR 2026] Learning to Reason without External Rewards

    Python 409 44

  3. rl-generalization rl-generalization Public

    Modifiable OpenAI Gym environments for studying generalization in RL

    Python 90 14

  4. verina verina Public

    Verina (Verifiable Code Generation Arena) is a high-quality benchmark enabling a comprehensive and modular evaluation of code, specification, and proof generation as well as their compositions.

    Lean 72 11

  5. blackbox-attacks blackbox-attacks Public

    Code used in 'Exploring the Space of Black-box Attacks on Deep Neural Networks' (https://arxiv.org/abs/1712.09491)

    Python 63 13

  6. dpml-benchmark dpml-benchmark Public

    This repository contains the codes for first large-scale investigation of Differentially Private Convex Optimization algorithms.

    Python 63 18

Repositories

Showing 10 of 52 repositories

Top languages

Loading…

Most used topics

Loading…