Ja1Zhou

🏠

Working from home

Jay (Zhejian) Zhou Ja1Zhou

🏠

Working from home

B.S. @ PKU, CS PhD Student @ USC

30 followers · 70 following

USC
https://ja1zhou.github.io/

Achievements

Highlights

Lists (14)

Sort

Stars

175 stars written in Python

Clear filter

vwxyzjn / cleanrl

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

Python 8,233 887 Updated Jul 8, 2025

lucidrains / PaLM-rlhf-pytorch

Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM

Python 7,868 681 Updated Oct 11, 2025

Future-House / paper-qa

High accuracy RAG for answering questions from scientific documents with citations

Python 7,816 783 Updated Nov 6, 2025

zai-org / GLM-130B

GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023)

Python 7,681 607 Updated Jul 25, 2023

google / latexify_py

A library to generate LaTeX expression from Python code.

Python 7,578 397 Updated Feb 13, 2025

bigcode-project / starcoder

Home of StarCoder: fine-tuning & inference!

Python 7,472 529 Updated Feb 27, 2024

mli / autocut

用文本编辑器剪视频

Python 7,448 774 Updated Oct 5, 2024

AntixK / PyTorch-VAE

A Collection of Variational Autoencoders (VAE) in PyTorch.

Python 7,418 1,176 Updated Mar 21, 2025

mit-han-lab / streaming-llm

[ICLR 2024] Efficient Streaming Language Models with Attention Sinks

Python 7,112 391 Updated Jul 11, 2024

timothybrooks / instruct-pix2pix

Python 6,831 574 Updated Mar 3, 2024

pdfminer / pdfminer.six

Community maintained fork of pdfminer - we fathom PDF

Python 6,777 1,009 Updated May 6, 2025

google-research / text-to-text-transfer-transformer

Code for the paper "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"

Python 6,452 787 Updated Nov 6, 2025

yenchenlin / nerf-pytorch

A PyTorch implementation of NeRF (Neural Radiance Fields) that reproduces the results.

Python 5,942 1,123 Updated Jul 25, 2024

aiwaves-cn / agents

An Open-source Framework for Data-centric, Self-evolving Autonomous Language Agents

Python 5,753 450 Updated Sep 26, 2024

meta-pytorch / captum

Model interpretability and understanding for PyTorch

Python 5,457 547 Updated Nov 2, 2025

OpenBMB / ToolBench

[ICLR'24 spotlight] An open platform for training, serving, and evaluating large language model for tool learning.

Python 5,326 451 Updated May 21, 2025

CarperAI / trlx

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

Python 4,723 483 Updated Jan 8, 2024

xlang-ai / OpenAgents

[COLM 2024] OpenAgents: An Open Platform for Language Agents in the Wild

Python 4,605 500 Updated Nov 18, 2024

yizhongw / self-instruct

Aligning pretrained language models with instruction data generated by themselves.

Python 4,518 522 Updated Mar 27, 2023

rom1504 / img2dataset

Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.

Python 4,208 361 Updated Oct 19, 2025

microsoft / LMOps

General technology for enabling AI capabilities w/ LLMs and MLLMs

Python 4,165 342 Updated Jun 30, 2025

openai / transformer-debugger

Python 4,102 242 Updated Jun 4, 2024

amazon-science / mm-cot

Official implementation for "Multimodal Chain-of-Thought Reasoning in Language Models" (stay tuned and more will be updated)

Python 3,978 331 Updated Jun 12, 2024

microsoft / torchscale

Foundation Architecture for (M)LLMs

Python 3,119 221 Updated Apr 11, 2024

salesforce / CodeT5

Home of CodeT5: Open Code LLMs for Code Understanding and Generation

Python 3,080 485 Updated Jan 20, 2024

enhuiz / vall-e

An unofficial PyTorch implementation of the audio LM VALL-E

Python 2,990 412 Updated May 10, 2023

deepseek-ai / DeepSeek-Math

DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models

Python 2,962 551 Updated Apr 15, 2024

noahshinn / reflexion

[NeurIPS 2023] Reflexion: Language Agents with Verbal Reinforcement Learning

Python 2,932 281 Updated Jan 14, 2025

THUDM / AgentBench

A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)

Python 2,916 208 Updated Oct 14, 2025

google-research / t5x

Python 2,906 336 Updated Nov 6, 2025

Previous Next

Jay (Zhejian) Zhou Ja1Zhou

Highlights

Lists (14)

🤖 agent

🎤 audio

🥇 Awesome Lists

💬 ChatGPT

🤖 Code

💥 compilers

🌟 Diffusion

💯 Math

🤩 Multi-modal

⭐ Packages

🍀 RL

🔨 Tools

👓 Vision

🕸️ wasm

Stars