Skip to content
View Rishubi's full-sized avatar
  • Tsinghua University

Block or report Rishubi

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
27 stars written in Python
Clear filter

🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…

Python 64,148 6,502 Updated Sep 19, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 61,919 7,490 Updated Nov 6, 2025

A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training

Python 22,896 2,998 Updated Aug 15, 2024

SWE-agent takes a GitHub issue and tries to automatically fix it, using your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges. [NeurIPS 2024]

Python 17,710 1,867 Updated Nov 3, 2025

Python Implementation of Reinforcement Learning: An Introduction

Python 14,395 4,957 Updated Aug 9, 2024

CTF framework and exploit development library

Python 13,019 1,783 Updated Oct 27, 2025

OpenChat: Advancing Open-source Language Models with Imperfect Data

Python 5,439 428 Updated Sep 13, 2024

Monte Carlo tree search in JAX

Python 2,554 209 Updated Sep 2, 2025

A modular RL library to fine-tune language models to human preferences

Python 2,363 203 Updated Mar 1, 2024

Community for applying LLMs to robotics and a robot simulator with ChatGPT integration

Python 2,056 215 Updated Jan 20, 2024

清华主题PPT模板

Python 1,458 94 Updated Sep 26, 2025

Code and models for the paper "One Transformer Fits All Distributions in Multi-Modal Diffusion"

Python 1,445 88 Updated May 31, 2023
Python 246 18 Updated Aug 18, 2025

Open Instruction Generalist is an assistant trained on massive synthetic instructions to perform many millions of tasks

Python 209 19 Updated Jan 13, 2024

Python3 control flow graph generator

Python 205 35 Updated Aug 7, 2022

Playing Hollow Knight with reinforcement learning.

Python 104 14 Updated Aug 14, 2023

MEASURING MASSIVE MULTITASK CHINESE UNDERSTANDING

Python 89 12 Updated Mar 24, 2024

Source code of LatentOps

Python 78 9 Updated Oct 23, 2023

Official PyTorch implementation for "Your Absorbing Discrete Diffusion Secretly Models the Conditional Distributions of Clean Data" (ICLR 2025)

Python 66 7 Updated May 30, 2025

DataSciBench: An LLM Agent Benchmark for Data Science

Python 37 3 Updated Sep 1, 2025

Reproducing R1 for Code with Reliable Rewards

Python 11 2 Updated Apr 9, 2025
Python 11 1 Updated May 28, 2024

Source code and dataset for IJCAI 2022 paper "Rethinking the Setting of Semi-supervised Learning on Graphs"

Python 10 1 Updated May 31, 2022

[AAAI'2025] Official PyTorch implementation of the paper "Identity-Text Video Corpus Grounding".

Python 9 1 Updated Mar 7, 2025