Skip to content
View azshue's full-sized avatar

Organizations

@judy-vscode

Block or report azshue

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.

Python 2,710 225 Updated Apr 14, 2026

My learning notes for ML SYS.

Python 6,566 448 Updated Jun 18, 2026

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

Python 2,005 138 Updated Nov 7, 2025

Solve puzzles. Improve your pytorch.

Jupyter Notebook 4,162 378 Updated Jul 15, 2024

Official Repo for Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning

Jupyter Notebook 412 39 Updated Dec 15, 2024

verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework

Python 22,089 4,110 Updated Jun 23, 2026

Minimal reproduction of DeepSeek R1-Zero

Python 13,177 1,585 Updated Feb 27, 2026

[EMNLP-2024] Build multimodal language agents for fast prototype and production

Python 2,660 289 Updated Mar 19, 2025

AllenAI's post-training codebase

Python 3,763 548 Updated Jun 22, 2026

An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & VLM & TIS & vLLM & Ray & Async RL)

Python 9,672 973 Updated Jun 17, 2026
Python 69 3 Updated Jun 2, 2026

A instruction data generation system for multimodal language models.

Jupyter Notebook 37 1 Updated Jan 31, 2025

Extend existing LLMs way beyond the original training length with constant memory usage, without retraining

Python 736 45 Updated Apr 10, 2024

🍃 MINT-1T: A one trillion token multimodal interleaved dataset.

833 19 Updated Jul 31, 2024

[COLM-2024] List Items One by One: A New Data Source and Learning Paradigm for Multimodal LLMs

Python 145 4 Updated Aug 23, 2024

Official repo for Detecting, Explaining, and Mitigating Memorization in Diffusion Models (ICLR 2024)

Python 80 9 Updated Apr 3, 2024

Package to optimize Adversarial Attacks against (Large) Language Models with Varied Objectives

Python 71 6 Updated Feb 22, 2024

An open-source framework for training large multimodal models.

Python 4,107 320 Updated Aug 31, 2024

LAVIS - A One-stop Library for Language-Vision Intelligence

Jupyter Notebook 11,238 1,107 Updated Jun 2, 2026

A Next-Generation Training Engine Built for Ultra-Large MoE Models

Python 5,150 425 Updated Jun 23, 2026

Consistency Distilled Diff VAE

Python 2,213 80 Updated Nov 7, 2023

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 60,027 10,338 Updated Nov 12, 2025

Ongoing research training transformer models at scale

Python 16,800 4,109 Updated Jun 23, 2026

Official repository of NEFTune: Noisy Embeddings Improves Instruction Finetuning

Python 412 19 Updated May 17, 2024

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 42,565 4,864 Updated Jun 23, 2026

Denoising Diffusion Implicit Models

Python 1,831 233 Updated Jul 26, 2024

GLIDE: a diffusion-based text-conditional image synthesis model

Python 3,688 501 Updated Mar 8, 2024

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python 39,486 4,791 Updated May 1, 2026

A framework for few-shot evaluation of language models.

Python 13,031 3,353 Updated Jun 22, 2026
Next