Skip to content
View Ja1Zhou's full-sized avatar
🏠
Working from home
🏠
Working from home

Highlights

  • Pro

Block or report Ja1Zhou

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
175 stars written in Python
Clear filter

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

Python 8,233 887 Updated Jul 8, 2025

Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM

Python 7,868 681 Updated Oct 11, 2025

High accuracy RAG for answering questions from scientific documents with citations

Python 7,816 783 Updated Nov 6, 2025

GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023)

Python 7,681 607 Updated Jul 25, 2023

A library to generate LaTeX expression from Python code.

Python 7,578 397 Updated Feb 13, 2025

Home of StarCoder: fine-tuning & inference!

Python 7,472 529 Updated Feb 27, 2024

用文本编辑器剪视频

Python 7,448 774 Updated Oct 5, 2024

A Collection of Variational Autoencoders (VAE) in PyTorch.

Python 7,418 1,176 Updated Mar 21, 2025

[ICLR 2024] Efficient Streaming Language Models with Attention Sinks

Python 7,112 391 Updated Jul 11, 2024

Community maintained fork of pdfminer - we fathom PDF

Python 6,777 1,009 Updated May 6, 2025

Code for the paper "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"

Python 6,452 787 Updated Nov 6, 2025

A PyTorch implementation of NeRF (Neural Radiance Fields) that reproduces the results.

Python 5,942 1,123 Updated Jul 25, 2024

An Open-source Framework for Data-centric, Self-evolving Autonomous Language Agents

Python 5,753 450 Updated Sep 26, 2024

Model interpretability and understanding for PyTorch

Python 5,457 547 Updated Nov 2, 2025

[ICLR'24 spotlight] An open platform for training, serving, and evaluating large language model for tool learning.

Python 5,326 451 Updated May 21, 2025

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

Python 4,723 483 Updated Jan 8, 2024

[COLM 2024] OpenAgents: An Open Platform for Language Agents in the Wild

Python 4,605 500 Updated Nov 18, 2024

Aligning pretrained language models with instruction data generated by themselves.

Python 4,518 522 Updated Mar 27, 2023

Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.

Python 4,208 361 Updated Oct 19, 2025

General technology for enabling AI capabilities w/ LLMs and MLLMs

Python 4,165 342 Updated Jun 30, 2025

Official implementation for "Multimodal Chain-of-Thought Reasoning in Language Models" (stay tuned and more will be updated)

Python 3,978 331 Updated Jun 12, 2024

Foundation Architecture for (M)LLMs

Python 3,119 221 Updated Apr 11, 2024

Home of CodeT5: Open Code LLMs for Code Understanding and Generation

Python 3,080 485 Updated Jan 20, 2024

An unofficial PyTorch implementation of the audio LM VALL-E

Python 2,990 412 Updated May 10, 2023

DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models

Python 2,962 551 Updated Apr 15, 2024

[NeurIPS 2023] Reflexion: Language Agents with Verbal Reinforcement Learning

Python 2,932 281 Updated Jan 14, 2025

A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)

Python 2,916 208 Updated Oct 14, 2025
Python 2,906 336 Updated Nov 6, 2025