Rishubi

Follow

Xiao Xia Rishubi

Follow

59 followers · 48 following

Tsinghua University

Achievements

Achievements

Stars

27 stars written in Python

labmlai / annotated_deep_learning_paper_implementations

🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…

Python 64,148 6,502 Updated Sep 19, 2025

hiyouga / LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 61,919 7,490 Updated Nov 6, 2025

chenfei-wu / TaskMatrix

Python 34,354 3,272 Updated Jan 6, 2024

karpathy / minGPT

A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training

Python 22,896 2,998 Updated Aug 15, 2024

SWE-agent / SWE-agent

SWE-agent takes a GitHub issue and tries to automatically fix it, using your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges. [NeurIPS 2024]

Python 17,710 1,867 Updated Nov 3, 2025

ShangtongZhang / reinforcement-learning-an-introduction

Python Implementation of Reinforcement Learning: An Introduction

Python 14,395 4,957 Updated Aug 9, 2024

Gallopsled / pwntools

CTF framework and exploit development library

Python 13,019 1,783 Updated Oct 27, 2025

imoneoi / openchat

OpenChat: Advancing Open-source Language Models with Imperfect Data

Python 5,439 428 Updated Sep 13, 2024

google-deepmind / alphatensor

Python 2,791 256 Updated Apr 22, 2024

google-deepmind / mctx

Monte Carlo tree search in JAX

Python 2,554 209 Updated Sep 2, 2025

allenai / RL4LMs

A modular RL library to fine-tune language models to human preferences

Python 2,363 203 Updated Mar 1, 2024

microsoft / PromptCraft-Robotics

Community for applying LLMs to robotics and a robot simulator with ChatGPT integration

Python 2,056 215 Updated Jan 20, 2024

atomiechen / THU-PPT-Theme

清华主题PPT模板

Python 1,458 94 Updated Sep 26, 2025

thu-ml / unidiffuser

Code and models for the paper "One Transformer Fits All Distributions in Multi-Modal Diffusion"

Python 1,445 88 Updated May 31, 2023

THUDM / Android-Lab

Python 246 18 Updated Aug 18, 2025

LAION-AI / Open-Instruction-Generalist

Open Instruction Generalist is an assistant trained on massive synthetic instructions to perform many millions of tasks

Python 209 19 Updated Jan 13, 2024

coetaur0 / staticfg

Python3 control flow graph generator

Python 205 35 Updated Aug 7, 2022

seermer / HollowKnight_RL

Playing Hollow Knight with reinforcement learning.

Python 104 14 Updated Aug 14, 2023

Felixgithub2017 / MMCU

MEASURING MASSIVE MULTITASK CHINESE UNDERSTANDING

Python 89 12 Updated Mar 24, 2024

nyu-mll / ILF-for-code-generation

Python 80 8 Updated Mar 24, 2025

guangyliu / LatentOps

Source code of LatentOps

Python 78 9 Updated Oct 23, 2023

ML-GSAI / RADD

Official PyTorch implementation for "Your Absorbing Discrete Diffusion Secretly Models the Conditional Distributions of Clean Data" (ICLR 2025)

Python 66 7 Updated May 30, 2025

THUDM / DataSciBench

DataSciBench: An LLM Agent Benchmark for Data Science

Python 37 3 Updated Sep 1, 2025

KodCode-AI / code-r1

Forked from ganler/code-r1

Reproducing R1 for Code with Reliable Rewards

Python 11 2 Updated Apr 9, 2025

SenseLLM / ReflectionCoder

Python 11 1 Updated May 28, 2024

THUDM / IGB

Source code and dataset for IJCAI 2022 paper "Rethinking the Setting of Semi-supervised Learning on Graphs"

Python 10 1 Updated May 31, 2022

huangb23 / Identity-Text-Video-Corpus-Grounding

[AAAI'2025] Official PyTorch implementation of the paper "Identity-Text Video Corpus Grounding".

Python 9 1 Updated Mar 7, 2025