azshue

Manli Shu azshue

Research and code

48 followers · 26 following

Salesforce Research
Palo Alto
https://azshue.github.io/

Achievements

x2 x2

Achievements

x2 x2

Organizations

Stars

mll-lab-nu / RAGEN

RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.

Python 2,710 225 Updated Apr 14, 2026

zhaochenyang20 / Awesome-ML-SYS-Tutorial

My learning notes for ML SYS.

Python 6,566 448 Updated Jun 18, 2026

cambrian-mllm / cambrian

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

Python 2,005 138 Updated Nov 7, 2025

srush / Tensor-Puzzles

Solve puzzles. Improve your pytorch.

Jupyter Notebook 4,162 378 Updated Jul 15, 2024

RL4VLM / RL4VLM

Official Repo for Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning

Jupyter Notebook 412 39 Updated Dec 15, 2024

verl-project / verl

verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework

Python 22,089 4,110 Updated Jun 23, 2026

Jiayi-Pan / TinyZero

Minimal reproduction of DeepSeek R1-Zero

Python 13,177 1,585 Updated Feb 27, 2026

om-ai-lab / OmAgent

[EMNLP-2024] Build multimodal language agents for fast prototype and production

Python 2,660 289 Updated Mar 19, 2025

allenai / open-instruct

AllenAI's post-training codebase

Python 3,763 548 Updated Jun 22, 2026

OpenRLHF / OpenRLHF

An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & VLM & TIS & vLLM & Ray & Async RL)

Python 9,672 973 Updated Jun 17, 2026

SalesforceAIResearch / LATTE

Python 69 3 Updated Jun 2, 2026

JieyuZ2 / ProVision

A instruction data generation system for multimodal language models.

Jupyter Notebook 37 1 Updated Jan 31, 2025

tomaarsen / attention_sinks

Extend existing LLMs way beyond the original training length with constant memory usage, without retraining

Python 736 45 Updated Apr 10, 2024

salesforce / Hierarchical_Point_Attention

Python 10 1 Updated Jun 2, 2026

mlfoundations / MINT-1T

🍃 MINT-1T: A one trillion token multimodal interleaved dataset.

833 19 Updated Jul 31, 2024

zzxslp / SoM-LLaVA

[COLM-2024] List Items One by One: A New Data Source and Learning Paradigm for Multimodal LLMs

Python 145 4 Updated Aug 23, 2024

YuxinWenRick / diffusion_memorization

Official repo for Detecting, Explaining, and Mitigating Memorization in Diffusion Models (ICLR 2024)

Python 80 9 Updated Apr 3, 2024

JonasGeiping / carving

Package to optimize Adversarial Attacks against (Large) Language Models with Varied Objectives

Python 71 6 Updated Feb 22, 2024

mlfoundations / open_flamingo

An open-source framework for training large multimodal models.

Python 4,107 320 Updated Aug 31, 2024

salesforce / LAVIS

LAVIS - A One-stop Library for Language-Vision Intelligence

Jupyter Notebook 11,238 1,107 Updated Jun 2, 2026

InternLM / xtuner

A Next-Generation Training Engine Built for Ultra-Large MoE Models

Python 5,150 425 Updated Jun 23, 2026

openai / consistencydecoder

Consistency Distilled Diff VAE

Python 2,213 80 Updated Nov 7, 2023

karpathy / nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 60,027 10,338 Updated Nov 12, 2025

NVIDIA / Megatron-LM

Ongoing research training transformer models at scale

Python 16,800 4,109 Updated Jun 23, 2026

neelsjain / NEFTune

Official repository of NEFTune: Noisy Embeddings Improves Instruction Finetuning

Python 412 19 Updated May 17, 2024

deepspeedai / DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 42,565 4,864 Updated Jun 23, 2026

ermongroup / ddim

Denoising Diffusion Implicit Models

Python 1,831 233 Updated Jul 26, 2024

openai / glide-text2im

GLIDE: a diffusion-based text-conditional image synthesis model

Python 3,688 501 Updated Mar 8, 2024

lm-sys / FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python 39,486 4,791 Updated May 1, 2026

EleutherAI / lm-evaluation-harness

A framework for few-shot evaluation of language models.

Python 13,031 3,353 Updated Jun 22, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Manli Shu azshue

Achievements

Achievements

Organizations

Block or report azshue

Stars

mll-lab-nu / RAGEN

zhaochenyang20 / Awesome-ML-SYS-Tutorial

cambrian-mllm / cambrian

srush / Tensor-Puzzles

RL4VLM / RL4VLM

verl-project / verl

Jiayi-Pan / TinyZero

om-ai-lab / OmAgent

allenai / open-instruct

OpenRLHF / OpenRLHF

SalesforceAIResearch / LATTE

JieyuZ2 / ProVision

tomaarsen / attention_sinks

salesforce / Hierarchical_Point_Attention

mlfoundations / MINT-1T

zzxslp / SoM-LLaVA

YuxinWenRick / diffusion_memorization

JonasGeiping / carving

mlfoundations / open_flamingo

salesforce / LAVIS

InternLM / xtuner

openai / consistencydecoder

karpathy / nanoGPT

NVIDIA / Megatron-LM

neelsjain / NEFTune

deepspeedai / DeepSpeed

ermongroup / ddim

openai / glide-text2im

lm-sys / FastChat

EleutherAI / lm-evaluation-harness