Ja1Zhou

🏠

Working from home

Jay (Zhejian) Zhou Ja1Zhou

🏠

Working from home

B.S. @ PKU, CS PhD Student @ USC

30 followers · 70 following

USC
https://ja1zhou.github.io/

Achievements

Highlights

Lists (14)

Sort

Stars

175 stars written in Python

Clear filter

thunlp / UltraChat

Large-scale, Informative, and Diverse Multi-round Chat Data (and Models)

Python 2,708 136 Updated Mar 13, 2024

microsoft / CodeBERT

CodeBERT

Python 2,662 502 Updated Jul 9, 2023

AbanteAI / archive-old-cli-mentat

Python 2,565 242 Updated Jan 7, 2025

stanford-crfm / helm

Holistic Evaluation of Language Models (HELM) is an open source Python framework created by the Center for Research on Foundation Models (CRFM) at Stanford for holistic, reproducible and transparen…

Python 2,534 340 Updated Oct 21, 2025

opendilab / PPOxFamily

PPO x Family DRL Tutorial Course（决策智能入门级公开课：8节课帮你盘清算法理论，理顺代码逻辑，玩转决策AI应用实践）

Python 2,425 204 Updated Mar 13, 2025

allenai / RL4LMs

A modular RL library to fine-tune language models to human preferences

Python 2,363 203 Updated Mar 1, 2024

lucidrains / lion-pytorch

🦁 Lion, new optimizer discovered by Google Brain using genetic algorithms that is purportedly better than Adam(w), in Pytorch

Python 2,170 56 Updated Nov 27, 2024

openai / prm800k

800,000 step-level correctness labels on LLM solutions to MATH problems

Python 2,067 122 Updated Jun 1, 2023

microsoft / PromptCraft-Robotics

Community for applying LLMs to robotics and a robot simulator with ChatGPT integration

Python 2,056 215 Updated Jan 20, 2024

lucidrains / toolformer-pytorch

Implementation of Toolformer, Language Models That Can Use Tools, by MetaAI

Python 2,055 129 Updated Jul 22, 2024

neulab / prompt2model

prompt2model - Generate Deployable Models from Natural Language Instructions

Python 2,007 182 Updated Dec 29, 2024

salesforce / ALBEF

Code for ALBEF: a new vision-language pre-training method

Python 1,730 222 Updated Sep 20, 2022

evalplus / evalplus

Rigourous evaluation of LLM-synthesized code - NeurIPS 2023 & COLM 2024

Python 1,620 179 Updated Oct 2, 2025

gururise / AlpacaDataCleaned

Alpaca dataset from Stanford, cleaned and curated

Python 1,579 150 Updated Apr 14, 2023

google-research / FLAN

Python 1,550 161 Updated Oct 29, 2025

samim23 / polymath

Convert any music library into a music production sample-library with ML

Python 1,550 121 Updated Aug 17, 2024

vturrisi / solo-learn

solo-learn: a library of self-supervised methods for visual representation learning powered by Pytorch Lightning

Python 1,526 196 Updated Oct 20, 2025

THUDM / AgentTuning

AgentTuning: Enabling Generalized Agent Abilities for LLMs

Python 1,467 106 Updated Oct 31, 2023

bigscience-workshop / Megatron-DeepSpeed

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Python 1,425 228 Updated Mar 20, 2024

Yifan-Song793 / RestGPT

An LLM-based autonomous agent controlling real-world applications via RESTful APIs

Python 1,386 103 Updated Jun 7, 2024

PKU-TANGENT / nlp-tutorial

NLP新手入门教程

Python 1,383 131 Updated Oct 23, 2022

poloclub / diffusiondb

A large-scale text-to-image prompt gallery dataset based on Stable Diffusion

Python 1,323 73 Updated Jul 11, 2024

NVlabs / prismer

The implementation of "Prismer: A Vision-Language Model with Multi-Task Experts".

Python 1,304 73 Updated Jan 17, 2024

tysam-code / hlb-CIFAR10

Train to 94% on CIFAR-10 in <6.3 seconds on a single A100. Or ~95.79% in ~110 seconds (or less!)

Python 1,289 79 Updated Dec 18, 2024

srush / MiniChain

A tiny library for coding with large language models.

Python 1,236 74 Updated Jul 10, 2024

Genesis-Embodied-AI / RoboGen

A generative and self-guided robotic agent that endlessly propose and master new skills.

Python 1,094 102 Updated May 31, 2024

openai / summarize-from-feedback

Code for "Learning to summarize from human feedback"

Python 1,052 153 Updated Sep 5, 2023

microsoft / VideoX

VideoX: a collection of video cross-modal models

Python 1,047 164 Updated Jun 3, 2024

allenai / natural-instructions

Expanding natural instructions

Python 1,022 197 Updated Dec 11, 2023

allenai / mmc4

MultimodalC4 is a multimodal extension of c4 that interleaves millions of images with text.

Python 942 39 Updated Mar 19, 2025

Previous Next

Jay (Zhejian) Zhou Ja1Zhou

Highlights

Lists (14)

🤖 agent

🎤 audio

🥇 Awesome Lists

💬 ChatGPT

🤖 Code

💥 compilers

🌟 Diffusion

💯 Math

🤩 Multi-modal

⭐ Packages

🍀 RL

🔨 Tools

👓 Vision

🕸️ wasm

Stars