Skip to content
View Bayi-Hu's full-sized avatar
🎯
Focusing
🎯
Focusing

Organizations

@git-disl

Block or report Bayi-Hu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 1 Updated Aug 24, 2024
Jupyter Notebook 282 116 Updated Apr 25, 2025

A fully functional pump.fun / letsbonk.fun trading and sniping bot not relying on any 3rd party APIs

Python 856 309 Updated Nov 23, 2025

This is the official code for the paper "Virus: Harmful Fine-tuning Attack for Large Language Models Bypassing Guardrail Moderation"

Python 53 4 Updated Feb 2, 2025

[NeurIPS 2024] Fast Best-of-N Decoding via Speculative Rejection

Python 52 8 Updated Oct 29, 2024

[NeurIPS 2024] Efficient LLM Jailbreak via Adaptive Dense-to-sparse Constrained Optimization

Python 7 4 Updated Mar 3, 2025
Python 9 2 Updated Mar 2, 2023

A collection of benchmarks and datasets for evaluating LLM.

535 34 Updated Jul 13, 2024
Jupyter Notebook 9 2 Updated Jan 2, 2025

A survey on harmful fine-tuning attack for large language model

227 6 Updated Nov 20, 2025

This is the official code for the paper "Booster: Tackling Harmful Fine-tuning for Large Language Models via Attenuating Harmful Perturbation" (ICLR2025 Oral).

Shell 33 1 Updated Mar 22, 2025

This is the official code for the paper "Lazy Safety Alignment for Large Language Models against Harmful Fine-tuning" (NeurIPS2024)

Python 25 Updated Sep 10, 2024

A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).

Python 896 50 Updated Sep 30, 2025

Trained models & code to predict toxic comments on all 3 Jigsaw Toxic Comment Challenges. Built using ⚡ Pytorch Lightning and 🤗 Transformers. For access to our API, please email us at contact@unita…

Python 1,156 136 Updated Oct 6, 2025

Code accompanying the paper Pretraining Language Models with Human Preferences

Python 180 13 Updated Feb 13, 2024

A toolkit for optimizing machine learning models for practical applications

Python 31 4 Updated Mar 6, 2025

Reference implementation for DPO (Direct Preference Optimization)

Python 2,812 233 Updated Aug 11, 2024

AI-powered pokemon bot on showdown

Python 13 1 Updated Oct 18, 2019

Monitoring recent cross-research on LLM & RL on arXiv for control. If there are good papers, PRs are welcome.

531 36 Updated Nov 17, 2025

Benchmark LLMs by fighting in Street Fighter 3! The new way to evaluate the quality of an LLM

Jupyter Notebook 1,456 178 Updated Mar 21, 2025

SwiftSage: A Generative Agent with Fast and Slow Thinking for Complex Interactive Tasks

Python 323 29 Updated Oct 22, 2024

Code and datasets for "Character-LLM: A Trainable Agent for Role-Playing"

Python 601 44 Updated Oct 29, 2024

Repository for the paper "Will GPT-4 Run DOOM?"

Python 24 4 Updated Nov 27, 2024

The first autonomous computer program that can do anything to earn money without human operators.

Python 144 16 Updated Nov 3, 2025

The Cradle framework is a first attempt at General Computer Control (GCC). Cradle supports agents to ace any computer task by enabling strong reasoning abilities, self-improvment, and skill curatio…

Python 2,357 232 Updated Nov 7, 2024

The Code Repo for Agent-Pro: Learning to Evolve via Policy-Level Reflection and Optimization

Python 129 10 Updated Sep 2, 2024

Chat凉宫春日, An open sourced Role-Playing chatbot Cheng Li, Ziang Leng, and others.

Jupyter Notebook 2,045 181 Updated Aug 13, 2024

[NeurIPS 2023] Reflexion: Language Agents with Verbal Reinforcement Learning

Python 2,996 289 Updated Jan 14, 2025
Next