Skip to content
View Bayi-Hu's full-sized avatar
🎯
Focusing
🎯
Focusing

Organizations

@git-disl

Block or report Bayi-Hu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
79 results for source starred repositories
Clear filter
Python 1 Updated Aug 24, 2024
Jupyter Notebook 291 118 Updated Apr 25, 2025

A fully functional pump.fun / letsbonk.fun trading and sniping bot not relying on any 3rd party APIs

Python 889 317 Updated Nov 23, 2025

This is the official code for the paper "Virus: Harmful Fine-tuning Attack for Large Language Models Bypassing Guardrail Moderation"

Python 53 4 Updated Feb 2, 2025

[NeurIPS 2024] Fast Best-of-N Decoding via Speculative Rejection

Python 55 8 Updated Oct 29, 2024

[NeurIPS 2024] Efficient LLM Jailbreak via Adaptive Dense-to-sparse Constrained Optimization

Python 7 4 Updated Mar 3, 2025
Python 9 2 Updated Mar 2, 2023

A collection of benchmarks and datasets for evaluating LLM.

550 33 Updated Jul 13, 2024
Jupyter Notebook 9 2 Updated Jan 2, 2025

A survey on harmful fine-tuning attack for large language model

232 7 Updated Jan 9, 2026

This is the official code for the paper "Booster: Tackling Harmful Fine-tuning for Large Language Models via Attenuating Harmful Perturbation" (ICLR2025 Oral).

Shell 36 1 Updated Mar 22, 2025

This is the official code for the paper "Lazy Safety Alignment for Large Language Models against Harmful Fine-tuning" (NeurIPS2024)

Python 25 Updated Sep 10, 2024

A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).

Python 905 49 Updated Sep 30, 2025

Trained models & code to predict toxic comments on all 3 Jigsaw Toxic Comment Challenges. Built using ⚡ Pytorch Lightning and 🤗 Transformers. For access to our API, please email us at contact@unita…

Python 1,182 138 Updated Jan 5, 2026

Code accompanying the paper Pretraining Language Models with Human Preferences

Python 180 13 Updated Feb 13, 2024

A toolkit for optimizing machine learning models for practical applications

Python 31 4 Updated Mar 6, 2025

Reference implementation for DPO (Direct Preference Optimization)

Python 2,849 234 Updated Aug 11, 2024

AI-powered pokemon bot on showdown

Python 13 1 Updated Oct 18, 2019

Monitoring recent cross-research on LLM & RL on arXiv for control. If there are good papers, PRs are welcome.

543 35 Updated Nov 17, 2025

Benchmark LLMs by fighting in Street Fighter 3! The new way to evaluate the quality of an LLM

Jupyter Notebook 1,463 177 Updated Mar 21, 2025

SwiftSage: A Generative Agent with Fast and Slow Thinking for Complex Interactive Tasks

Python 324 30 Updated Oct 22, 2024

Code and datasets for "Character-LLM: A Trainable Agent for Role-Playing"

Python 610 46 Updated Oct 29, 2024

Repository for the paper "Will GPT-4 Run DOOM?"

Python 24 4 Updated Nov 27, 2024

The first autonomous computer program that can do anything to earn money without human operators.

Python 151 16 Updated Nov 3, 2025

The Cradle framework is a first attempt at General Computer Control (GCC). Cradle supports agents to ace any computer task by enabling strong reasoning abilities, self-improvment, and skill curatio…

Python 2,444 253 Updated Nov 7, 2024

The Code Repo for Agent-Pro: Learning to Evolve via Policy-Level Reflection and Optimization

Python 128 10 Updated Sep 2, 2024

Chat凉宫春日, An open sourced Role-Playing chatbot Cheng Li, Ziang Leng, and others.

Jupyter Notebook 2,059 182 Updated Aug 13, 2024

[NeurIPS 2023] Reflexion: Language Agents with Verbal Reinforcement Learning

Python 3,058 297 Updated Jan 14, 2025
Next