Rishubi

Xiao Xia Rishubi

63 followers · 50 following

Tsinghua University

Achievements

Stars

KodCode-AI / code-r1

Forked from ganler/code-r1

Reproducing R1 for Code with Reliable Rewards

Python 12 2 Updated Apr 9, 2025

deepseek-ai / open-infra-index

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

7,976 286 Updated May 15, 2025

THUDM / DataSciBench

DataSciBench: An LLM Agent Benchmark for Data Science

Python 55 8 Updated Jan 21, 2026

ML-GSAI / RADD

Official PyTorch implementation for "Your Absorbing Discrete Diffusion Secretly Models the Conditional Distributions of Clean Data" (ICLR 2025)

Python 82 10 Updated May 30, 2025

huangb23 / Identity-Text-Video-Corpus-Grounding

[AAAI'2025] Official PyTorch implementation of the paper "Identity-Text Video Corpus Grounding".

Python 10 1 Updated Dec 17, 2025

coetaur0 / staticfg

Python3 control flow graph generator

Python 213 36 Updated Aug 7, 2022

THUDM / Android-Lab

Python 312 22 Updated Aug 18, 2025

hiyouga / LlamaFactory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 70,228 8,597 Updated Apr 12, 2026

mli / paper-reading

深度学习经典、新论文逐段精读

32,878 2,783 Updated Mar 22, 2025

SenseLLM / ReflectionCoder

Python 11 2 Updated May 28, 2024

SWE-agent / SWE-agent

SWE-agent takes a GitHub issue and tries to automatically fix it, using your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges. [NeurIPS 2024]

Python 19,000 2,050 Updated Apr 13, 2026

kengz / awesome-deep-rl

A curated list of awesome Deep Reinforcement Learning resources.

885 84 Updated Jul 13, 2025

imoneoi / openchat

OpenChat: Advancing Open-source Language Models with Imperfect Data

Python 5,478 436 Updated Sep 13, 2024

mindspore-lab / mindnlp

MindSpore + 🤗Huggingface: Run any Transformers/Diffusers model on MindSpore with seamless compatibility and acceleration.

Python 917 272 Updated Mar 8, 2026

microsoft / PromptCraft-Robotics

Community for applying LLMs to robotics and a robot simulator with ChatGPT integration

Python 2,096 221 Updated Jan 20, 2024

THUCSTAC / TAC

Teaching Assistants Competency 助教素养

8 1 Updated Mar 4, 2026

ShangtongZhang / reinforcement-learning-an-introduction

Python Implementation of Reinforcement Learning: An Introduction

Python 14,624 4,956 Updated Aug 9, 2024

Felixgithub2017 / MMCU

MEASURING MASSIVE MULTITASK CHINESE UNDERSTANDING

Python 89 11 Updated Mar 24, 2024

Gallopsled / pwntools

CTF framework and exploit development library

Python 13,372 1,829 Updated Apr 14, 2026

seermer / HollowKnight_RL

Playing Hollow Knight with reinforcement learning.

Python 117 15 Updated Aug 14, 2023

Instruction-Tuning-with-GPT-4 / GPT-4-LLM

Instruction Tuning with GPT-4

HTML 4,335 309 Updated Jun 11, 2023

jjbrophy47 / machine_unlearning

Existing Literature about Machine Unlearning

962 118 Updated Aug 29, 2025

nyu-mll / ILF-for-code-generation

Python 80 8 Updated Mar 24, 2025

thu-ml / unidiffuser

Code and models for the paper "One Transformer Fits All Distributions in Multi-Modal Diffusion"

Python 1,480 92 Updated May 31, 2023

LAION-AI / Open-Instruction-Generalist

Open Instruction Generalist is an assistant trained on massive synthetic instructions to perform many millions of tasks

Python 210 19 Updated Jan 13, 2024

chenfei-wu / TaskMatrix

Python 34,159 3,238 Updated Jan 6, 2024

google-deepmind / mctx

Monte Carlo tree search in JAX

Python 2,610 209 Updated Sep 2, 2025

karpathy / minGPT

A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training

Python 24,172 3,214 Updated Aug 15, 2024

allenai / RL4LMs

A modular RL library to fine-tune language models to human preferences

Python 2,386 203 Updated Mar 1, 2024

tuna / thuthesis

LaTeX Thesis Template for Tsinghua University

TeX 5,257 1,143 Updated Apr 4, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Xiao Xia Rishubi

Achievements

Achievements

Block or report Rishubi

Stars

KodCode-AI / code-r1

deepseek-ai / open-infra-index

THUDM / DataSciBench

ML-GSAI / RADD

huangb23 / Identity-Text-Video-Corpus-Grounding

coetaur0 / staticfg

THUDM / Android-Lab

hiyouga / LlamaFactory

mli / paper-reading

SenseLLM / ReflectionCoder

SWE-agent / SWE-agent

kengz / awesome-deep-rl

imoneoi / openchat

mindspore-lab / mindnlp

microsoft / PromptCraft-Robotics

THUCSTAC / TAC

ShangtongZhang / reinforcement-learning-an-introduction

Felixgithub2017 / MMCU

Gallopsled / pwntools

seermer / HollowKnight_RL

Instruction-Tuning-with-GPT-4 / GPT-4-LLM

jjbrophy47 / machine_unlearning

nyu-mll / ILF-for-code-generation

thu-ml / unidiffuser

LAION-AI / Open-Instruction-Generalist

chenfei-wu / TaskMatrix

google-deepmind / mctx

karpathy / minGPT

allenai / RL4LMs

tuna / thuthesis