Skip to content
View Rishubi's full-sized avatar
  • Tsinghua University

Block or report Rishubi

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Reproducing R1 for Code with Reliable Rewards

Python 12 2 Updated Apr 9, 2025

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

7,972 285 Updated May 15, 2025

DataSciBench: An LLM Agent Benchmark for Data Science

Python 55 8 Updated Jan 21, 2026

Official PyTorch implementation for "Your Absorbing Discrete Diffusion Secretly Models the Conditional Distributions of Clean Data" (ICLR 2025)

Python 82 10 Updated May 30, 2025

[AAAI'2025] Official PyTorch implementation of the paper "Identity-Text Video Corpus Grounding".

Python 10 1 Updated Dec 17, 2025

Python3 control flow graph generator

Python 213 36 Updated Aug 7, 2022
Python 311 22 Updated Aug 18, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 70,099 8,571 Updated Apr 12, 2026

深度学习经典、新论文逐段精读

32,865 2,782 Updated Mar 22, 2025
Python 11 2 Updated May 28, 2024

SWE-agent takes a GitHub issue and tries to automatically fix it, using your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges. [NeurIPS 2024]

Python 18,986 2,051 Updated Apr 13, 2026

A curated list of awesome Deep Reinforcement Learning resources.

884 84 Updated Jul 13, 2025

OpenChat: Advancing Open-source Language Models with Imperfect Data

Python 5,478 435 Updated Sep 13, 2024

MindSpore + 🤗Huggingface: Run any Transformers/Diffusers model on MindSpore with seamless compatibility and acceleration.

Python 917 272 Updated Mar 8, 2026

Community for applying LLMs to robotics and a robot simulator with ChatGPT integration

Python 2,094 221 Updated Jan 20, 2024

Teaching Assistants Competency 助教素养

8 1 Updated Mar 4, 2026

Python Implementation of Reinforcement Learning: An Introduction

Python 14,620 4,955 Updated Aug 9, 2024

MEASURING MASSIVE MULTITASK CHINESE UNDERSTANDING

Python 89 11 Updated Mar 24, 2024

CTF framework and exploit development library

Python 13,367 1,827 Updated Apr 14, 2026

Playing Hollow Knight with reinforcement learning.

Python 117 15 Updated Aug 14, 2023

Instruction Tuning with GPT-4

HTML 4,334 309 Updated Jun 11, 2023

Existing Literature about Machine Unlearning

961 118 Updated Aug 29, 2025

Code and models for the paper "One Transformer Fits All Distributions in Multi-Modal Diffusion"

Python 1,480 92 Updated May 31, 2023

Open Instruction Generalist is an assistant trained on massive synthetic instructions to perform many millions of tasks

Python 210 19 Updated Jan 13, 2024

Monte Carlo tree search in JAX

Python 2,608 209 Updated Sep 2, 2025

A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training

Python 24,146 3,211 Updated Aug 15, 2024

A modular RL library to fine-tune language models to human preferences

Python 2,385 203 Updated Mar 1, 2024

LaTeX Thesis Template for Tsinghua University

TeX 5,251 1,144 Updated Apr 4, 2026
Next