Rishubi

Xiao Xia Rishubi

59 followers · 48 following

Tsinghua University

Achievements

Stars

KodCode-AI / code-r1

Forked from ganler/code-r1

Reproducing R1 for Code with Reliable Rewards

Python 11 2 Updated Apr 9, 2025

deepseek-ai / open-infra-index

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

7,929 286 Updated May 15, 2025

THUDM / DataSciBench

DataSciBench: An LLM Agent Benchmark for Data Science

Python 37 3 Updated Sep 1, 2025

ML-GSAI / RADD

Official PyTorch implementation for "Your Absorbing Discrete Diffusion Secretly Models the Conditional Distributions of Clean Data" (ICLR 2025)

Python 66 7 Updated May 30, 2025

huangb23 / Identity-Text-Video-Corpus-Grounding

[AAAI'2025] Official PyTorch implementation of the paper "Identity-Text Video Corpus Grounding".

Python 9 1 Updated Mar 7, 2025

coetaur0 / staticfg

Python3 control flow graph generator

Python 205 35 Updated Aug 7, 2022

THUDM / Android-Lab

Python 247 18 Updated Aug 18, 2025

hiyouga / LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 62,015 7,496 Updated Nov 6, 2025

mli / paper-reading

深度学习经典、新论文逐段精读

31,868 2,740 Updated Mar 22, 2025

SenseLLM / ReflectionCoder

Python 11 1 Updated May 28, 2024

SWE-agent / SWE-agent

SWE-agent takes a GitHub issue and tries to automatically fix it, using your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges. [NeurIPS 2024]

Python 17,727 1,868 Updated Nov 3, 2025

kengz / awesome-deep-rl

A curated list of awesome Deep Reinforcement Learning resources.

826 79 Updated Jul 13, 2025

imoneoi / openchat

OpenChat: Advancing Open-source Language Models with Imperfect Data

Python 5,439 429 Updated Sep 13, 2024

mindspore-lab / mindnlp

Easy-to-use and high-performance NLP and LLM framework based on MindSpore, compatible with models and datasets of 🤗Huggingface.

Jupyter Notebook 895 257 Updated Nov 5, 2025

microsoft / PromptCraft-Robotics

Community for applying LLMs to robotics and a robot simulator with ChatGPT integration

Python 2,056 215 Updated Jan 20, 2024

THUCSTAC / TAC

Teaching Assistants Competency 助教素养

8 1 Updated Nov 28, 2024

ShangtongZhang / reinforcement-learning-an-introduction

Python Implementation of Reinforcement Learning: An Introduction

Python 14,400 4,958 Updated Aug 9, 2024

Felixgithub2017 / MMCU

MEASURING MASSIVE MULTITASK CHINESE UNDERSTANDING

Python 89 12 Updated Mar 24, 2024

Gallopsled / pwntools

CTF framework and exploit development library

Python 13,019 1,784 Updated Nov 6, 2025

seermer / HollowKnight_RL

Playing Hollow Knight with reinforcement learning.

Python 104 14 Updated Aug 14, 2023

Instruction-Tuning-with-GPT-4 / GPT-4-LLM

Instruction Tuning with GPT-4

HTML 4,338 306 Updated Jun 11, 2023

jjbrophy47 / machine_unlearning

Existing Literature about Machine Unlearning

921 112 Updated Aug 29, 2025

nyu-mll / ILF-for-code-generation

Python 80 8 Updated Mar 24, 2025

thu-ml / unidiffuser

Code and models for the paper "One Transformer Fits All Distributions in Multi-Modal Diffusion"

Python 1,445 88 Updated May 31, 2023

LAION-AI / Open-Instruction-Generalist

Open Instruction Generalist is an assistant trained on massive synthetic instructions to perform many millions of tasks

Python 209 19 Updated Jan 13, 2024

chenfei-wu / TaskMatrix

Python 34,355 3,272 Updated Jan 6, 2024

google-deepmind / mctx

Monte Carlo tree search in JAX

Python 2,554 209 Updated Sep 2, 2025

karpathy / minGPT

A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training

Python 22,903 3,001 Updated Aug 15, 2024

allenai / RL4LMs

A modular RL library to fine-tune language models to human preferences

Python 2,363 203 Updated Mar 1, 2024

tuna / thuthesis

LaTeX Thesis Template for Tsinghua University

TeX 5,020 1,127 Updated Oct 19, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Xiao Xia Rishubi

Achievements

Achievements

Block or report Rishubi

Stars

KodCode-AI / code-r1

deepseek-ai / open-infra-index

THUDM / DataSciBench

ML-GSAI / RADD

huangb23 / Identity-Text-Video-Corpus-Grounding

coetaur0 / staticfg

THUDM / Android-Lab

hiyouga / LLaMA-Factory

mli / paper-reading

SenseLLM / ReflectionCoder

SWE-agent / SWE-agent

kengz / awesome-deep-rl

imoneoi / openchat

mindspore-lab / mindnlp

microsoft / PromptCraft-Robotics

THUCSTAC / TAC

ShangtongZhang / reinforcement-learning-an-introduction

Felixgithub2017 / MMCU

Gallopsled / pwntools

seermer / HollowKnight_RL

Instruction-Tuning-with-GPT-4 / GPT-4-LLM

jjbrophy47 / machine_unlearning

nyu-mll / ILF-for-code-generation

thu-ml / unidiffuser

LAION-AI / Open-Instruction-Generalist

chenfei-wu / TaskMatrix

google-deepmind / mctx

karpathy / minGPT

allenai / RL4LMs

tuna / thuthesis