Ja1Zhou

🏠

Working from home

Jay (Zhejian) Zhou Ja1Zhou

🏠

Working from home

B.S. @ PKU, CS PhD Student @ USC

30 followers · 70 following

USC
https://ja1zhou.github.io/

Achievements

Highlights

Lists (14)

Sort

Stars

yanring / Megatron-MoE-ModelZoo

Best practices for training DeepSeek, Mixtral, Qwen and other MoE models using Megatron Core.

Python 123 26 Updated Nov 6, 2025

ISEEKYAN / mbridge

Bridge Megatron-Core to Hugging Face/Reinforcement Learning

Python 147 31 Updated Nov 6, 2025

guoriyue / 3dgs-warp-scratch

Build 3D Gaussian Splatting from scratch with NVIDIA Warp in Python — CPU/GPU compatible, with a clean and minimalist design focused on learning modern graphics.

Python 229 15 Updated Sep 28, 2025

Trae1ounG / Awesome-Parametric-Knowledge-in-LLMs

Must-read papers and blogs about parametric knowledge mechanism in LLMs.

29 Updated May 9, 2025

LCLM-Horizon / A-Comprehensive-Survey-For-Long-Context-Language-Modeling

A Comprehensive Survey on Long Context Language Modeling

198 15 Updated Jul 8, 2025

WeiminXiong / MPO

MPO: Boosting LLM Agents with Meta Plan Optimization (EMNLP 2025 Findings)

Python 73 5 Updated Aug 20, 2025

Xnhyacinth / Awesome-LLM-Long-Context-Modeling

📰 Must-read papers and blogs on LLM based Long Context Modeling 🔥

1,807 76 Updated Nov 6, 2025

IAAR-Shanghai / Awesome-Attention-Heads

An awesome repository & A comprehensive survey on interpretability of LLM attention heads.

TeX 379 12 Updated Mar 2, 2025

nerfies / nerfies.github.io

JavaScript 3,683 1,577 Updated Jun 21, 2024

hijkzzz / Awesome-LLM-Strawberry

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

6,838 373 Updated Oct 17, 2025

zhijing-jin / nlp-phd-global-equality

A repo for open resources & information for people to succeed in PhD in CS & career in AI / NLP

952 77 Updated Sep 22, 2024

ars22 / scaling-LLM-math-synthetic-data

Code and data used in the paper: "Training on Incorrect Synthetic Data via RL Scales LLM Math Reasoning Eight-Fold"

31 Updated Jun 16, 2024

EleutherAI / sparsify

Sparsify transformers with SAEs and transcoders

Python 652 86 Updated Nov 3, 2025

JShollaj / awesome-llm-interpretability

A curated list of Large Language Model (LLM) Interpretability resources.

1,437 102 Updated Jun 22, 2025

OpenHands / OpenHands

🙌 OpenHands: Code Less, Make More

Python 64,747 7,870 Updated Nov 6, 2025

ineo6 / hosts

GitHub最新hosts。解决GitHub图片无法显示，加速GitHub网页浏览。

TypeScript 5,246 446 Updated Nov 6, 2025

OpenBMB / OlympiadBench

[ACL 2024]Official GitHub repo for OlympiadBench: A Challenging Benchmark for Promoting AGI with Olympiad-Level Bilingual Multimodal Scientific Problems.

Python 173 9 Updated Jun 8, 2025

protagolabs / odyssey-math

Jupyter Notebook 83 10 Updated Jan 25, 2025

adam-maj / tiny-gpu

A minimal GPU design in Verilog to learn how GPUs work from the ground up

SystemVerilog 8,855 697 Updated Aug 18, 2024

dwzhu-pku / LongEmbed

LongEmbed: Extending Embedding Models for Long Context Retrieval (EMNLP 2024)

Python 144 9 Updated Nov 9, 2024

meta-llama / llama3

The official Meta Llama 3 GitHub site

Python 29,074 3,476 Updated Jan 26, 2025

evalplus / evalplus

Rigourous evaluation of LLM-synthesized code - NeurIPS 2023 & COLM 2024

Python 1,618 179 Updated Oct 2, 2025

OpenRLHF / OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agentic RL)

Python 8,328 808 Updated Oct 31, 2025