Skip to content
View Ja1Zhou's full-sized avatar
🏠
Working from home
🏠
Working from home

Highlights

  • Pro

Block or report Ja1Zhou

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
175 stars written in Python
Clear filter

Large-scale, Informative, and Diverse Multi-round Chat Data (and Models)

Python 2,708 136 Updated Mar 13, 2024

CodeBERT

Python 2,662 502 Updated Jul 9, 2023

Holistic Evaluation of Language Models (HELM) is an open source Python framework created by the Center for Research on Foundation Models (CRFM) at Stanford for holistic, reproducible and transparen…

Python 2,534 340 Updated Oct 21, 2025

PPO x Family DRL Tutorial Course(决策智能入门级公开课:8节课帮你盘清算法理论,理顺代码逻辑,玩转决策AI应用实践 )

Python 2,425 204 Updated Mar 13, 2025

A modular RL library to fine-tune language models to human preferences

Python 2,363 203 Updated Mar 1, 2024

🦁 Lion, new optimizer discovered by Google Brain using genetic algorithms that is purportedly better than Adam(w), in Pytorch

Python 2,170 56 Updated Nov 27, 2024

800,000 step-level correctness labels on LLM solutions to MATH problems

Python 2,067 122 Updated Jun 1, 2023

Community for applying LLMs to robotics and a robot simulator with ChatGPT integration

Python 2,056 215 Updated Jan 20, 2024

Implementation of Toolformer, Language Models That Can Use Tools, by MetaAI

Python 2,055 129 Updated Jul 22, 2024

prompt2model - Generate Deployable Models from Natural Language Instructions

Python 2,007 182 Updated Dec 29, 2024

Code for ALBEF: a new vision-language pre-training method

Python 1,730 222 Updated Sep 20, 2022

Rigourous evaluation of LLM-synthesized code - NeurIPS 2023 & COLM 2024

Python 1,620 179 Updated Oct 2, 2025

Alpaca dataset from Stanford, cleaned and curated

Python 1,579 150 Updated Apr 14, 2023
Python 1,550 161 Updated Oct 29, 2025

Convert any music library into a music production sample-library with ML

Python 1,550 121 Updated Aug 17, 2024

solo-learn: a library of self-supervised methods for visual representation learning powered by Pytorch Lightning

Python 1,526 196 Updated Oct 20, 2025

AgentTuning: Enabling Generalized Agent Abilities for LLMs

Python 1,467 106 Updated Oct 31, 2023

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Python 1,425 228 Updated Mar 20, 2024

An LLM-based autonomous agent controlling real-world applications via RESTful APIs

Python 1,386 103 Updated Jun 7, 2024

NLP新手入门教程

Python 1,383 131 Updated Oct 23, 2022

A large-scale text-to-image prompt gallery dataset based on Stable Diffusion

Python 1,323 73 Updated Jul 11, 2024

The implementation of "Prismer: A Vision-Language Model with Multi-Task Experts".

Python 1,304 73 Updated Jan 17, 2024

Train to 94% on CIFAR-10 in <6.3 seconds on a single A100. Or ~95.79% in ~110 seconds (or less!)

Python 1,289 79 Updated Dec 18, 2024

A tiny library for coding with large language models.

Python 1,236 74 Updated Jul 10, 2024

A generative and self-guided robotic agent that endlessly propose and master new skills.

Python 1,094 102 Updated May 31, 2024

Code for "Learning to summarize from human feedback"

Python 1,052 153 Updated Sep 5, 2023

VideoX: a collection of video cross-modal models

Python 1,047 164 Updated Jun 3, 2024

Expanding natural instructions

Python 1,022 197 Updated Dec 11, 2023

MultimodalC4 is a multimodal extension of c4 that interleaves millions of images with text.

Python 942 39 Updated Mar 19, 2025