Skip to content
View Rishubi's full-sized avatar
  • Tsinghua University

Block or report Rishubi

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Reproducing R1 for Code with Reliable Rewards

Python 11 2 Updated Apr 9, 2025

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

7,930 286 Updated May 15, 2025

DataSciBench: An LLM Agent Benchmark for Data Science

Python 37 3 Updated Sep 1, 2025

Official PyTorch implementation for "Your Absorbing Discrete Diffusion Secretly Models the Conditional Distributions of Clean Data" (ICLR 2025)

Python 66 7 Updated May 30, 2025

[AAAI'2025] Official PyTorch implementation of the paper "Identity-Text Video Corpus Grounding".

Python 9 1 Updated Mar 7, 2025

Python3 control flow graph generator

Python 205 35 Updated Aug 7, 2022
Python 246 18 Updated Aug 18, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 61,849 7,480 Updated Nov 5, 2025

深度学习经典、新论文逐段精读

31,846 2,738 Updated Mar 22, 2025
Python 11 1 Updated May 28, 2024

SWE-agent takes a GitHub issue and tries to automatically fix it, using your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges. [NeurIPS 2024]

Python 17,707 1,864 Updated Nov 3, 2025

A curated list of awesome Deep Reinforcement Learning resources.

826 79 Updated Jul 13, 2025

OpenChat: Advancing Open-source Language Models with Imperfect Data

Python 5,439 428 Updated Sep 13, 2024

Easy-to-use and high-performance NLP and LLM framework based on MindSpore, compatible with models and datasets of 🤗Huggingface.

Jupyter Notebook 895 257 Updated Nov 5, 2025

Community for applying LLMs to robotics and a robot simulator with ChatGPT integration

Python 2,056 215 Updated Jan 20, 2024

Teaching Assistants Competency 助教素养

8 1 Updated Nov 28, 2024

Python Implementation of Reinforcement Learning: An Introduction

Python 14,394 4,957 Updated Aug 9, 2024

MEASURING MASSIVE MULTITASK CHINESE UNDERSTANDING

Python 89 12 Updated Mar 24, 2024

CTF framework and exploit development library

Python 13,018 1,783 Updated Oct 27, 2025

Playing Hollow Knight with reinforcement learning.

Python 102 14 Updated Aug 14, 2023

Instruction Tuning with GPT-4

HTML 4,337 306 Updated Jun 11, 2023

Existing Literature about Machine Unlearning

922 112 Updated Aug 29, 2025

Code and models for the paper "One Transformer Fits All Distributions in Multi-Modal Diffusion"

Python 1,444 88 Updated May 31, 2023

Open Instruction Generalist is an assistant trained on massive synthetic instructions to perform many millions of tasks

Python 209 19 Updated Jan 13, 2024

Monte Carlo tree search in JAX

Python 2,552 209 Updated Sep 2, 2025

A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training

Python 22,892 2,996 Updated Aug 15, 2024

A modular RL library to fine-tune language models to human preferences

Python 2,362 203 Updated Mar 1, 2024

LaTeX Thesis Template for Tsinghua University

TeX 5,019 1,128 Updated Oct 19, 2025
Next