Skip to content
View 1mocat's full-sized avatar
🍸
Enjoy
🍸
Enjoy

Block or report 1mocat

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

noob

Python 1 Updated Jun 9, 2026

This is the official code for "Black-box Optimization of LLM Outputs by Asking for Directions"

Python 8 3 Updated Oct 21, 2025

TRACE: Trajectory Recovery for Continuous Mechanism Evolution in Causal Representation Learning

Python 3 Updated May 4, 2026

[ICLR 2026] Official implementation for "RedCodeAgent: Automatic Red-teaming Agent against Diverse Code Agents"

Python 11 2 Updated Apr 24, 2026

[NeurIPS'24] RedCode: Risky Code Execution and Generation Benchmark for Code Agents

Python 81 11 Updated Apr 24, 2026

Official Repository for The Paper: Safety Alignment Should Be Made More Than Just a Few Tokens Deep

Python 183 13 Updated Apr 23, 2025
Python 4 Updated Aug 24, 2025

Source codes for paper "MACRec: A Multi-Agent Collaboration Framework for Recommendation" at SIGIR 2024

Python 117 11 Updated Feb 7, 2026

Yunjue Agent: A Fully Reproducible, Zero-Start In-Situ Self-Evolving Agent System for Open-Ended Tasks

Python 502 54 Updated Mar 6, 2026

Create beautiful slides on the web using a coding agent's frontend skills

JavaScript 21,719 1,775 Updated Jun 13, 2026
Python 26 5 Updated Sep 7, 2025

Research code artifacts for Code World Model (CWM) including inference tools, reproducibility, and documentation.

Python 883 71 Updated Jun 11, 2026
Python 352 55 Updated Mar 26, 2026

An Open Foundation Model and Benchmark to Accelerate Generative Recommendation

Python 820 117 Updated May 18, 2026

[ICLR 2026] Taming large-scale few-step training with self-adversarial flows! 👏🏻

Python 533 27 Updated Feb 24, 2026

[ICLR 2024] The official implementation of our ICLR2024 paper "AutoDAN: Generating Stealthy Jailbreak Prompts on Aligned Large Language Models".

Python 445 63 Updated Jan 22, 2025
Jupyter Notebook 15 1 Updated Sep 24, 2025

This repository provides a benchmark for prompt injection attacks and defenses in LLMs

Python 459 73 Updated Oct 29, 2025

[ICML 2025] UDora: A Unified Red Teaming Framework against LLM Agents

Python 37 7 Updated Jun 24, 2025

Agentic AI research papers, benchmarks, frameworks, and tools curated across 24 domains.

150 4 Updated Jun 13, 2026

Comprehensive Assessment of Trustworthiness in Multimodal Foundation Models

Jupyter Notebook 29 2 Updated Mar 15, 2025

verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework

Python 21,970 4,076 Updated Jun 15, 2026

A modern cookiecutter template for deep learning projects with pytorch lightning that use uv for dependency management

Python 38 11 Updated Aug 10, 2025

Build and deploy stateful agents across federated resources

Python 102 14 Updated Jun 11, 2026

Evaluating Agent Safety in Realistic, High-Risk Simulations

Python 29 17 Updated Nov 15, 2025

A benchmark for LLMs on complicated tasks in the terminal

Python 2,356 542 Updated Jan 22, 2026

R-Judge: Benchmarking Safety Risk Awareness for LLM Agents (EMNLP Findings 2024)

Python 105 9 Updated Jan 11, 2026
Python 22 2 Updated Jun 18, 2025

🔮Reasoning for Safer Code Generation; 🥇Winner Solution of Amazon Nova AI Challenge 2025

Python 39 3 Updated Aug 24, 2025
Next