Skip to content
View Ja1Zhou's full-sized avatar
🏠
Working from home
🏠
Working from home

Highlights

  • Pro

Block or report Ja1Zhou

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Best practices for training DeepSeek, Mixtral, Qwen and other MoE models using Megatron Core.

Python 123 26 Updated Nov 4, 2025

Bridge Megatron-Core to Hugging Face/Reinforcement Learning

Python 147 30 Updated Nov 4, 2025

Build 3D Gaussian Splatting from scratch with NVIDIA Warp in Python — CPU/GPU compatible, with a clean and minimalist design focused on learning modern graphics.

Python 229 15 Updated Sep 28, 2025

Must-read papers and blogs about parametric knowledge mechanism in LLMs.

29 Updated May 9, 2025

A Comprehensive Survey on Long Context Language Modeling

198 15 Updated Jul 8, 2025

MPO: Boosting LLM Agents with Meta Plan Optimization (EMNLP 2025 Findings)

Python 73 5 Updated Aug 20, 2025

📰 Must-read papers and blogs on LLM based Long Context Modeling 🔥

1,805 76 Updated Oct 31, 2025

An awesome repository & A comprehensive survey on interpretability of LLM attention heads.

TeX 379 12 Updated Mar 2, 2025
JavaScript 3,680 1,573 Updated Jun 21, 2024

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

6,838 373 Updated Oct 17, 2025

A repo for open resources & information for people to succeed in PhD in CS & career in AI / NLP

952 77 Updated Sep 22, 2024

Code and data used in the paper: "Training on Incorrect Synthetic Data via RL Scales LLM Math Reasoning Eight-Fold"

31 Updated Jun 16, 2024

Sparsify transformers with SAEs and transcoders

Python 651 86 Updated Nov 3, 2025

A curated list of Large Language Model (LLM) Interpretability resources.

1,436 102 Updated Jun 22, 2025

🙌 OpenHands: Code Less, Make More

Python 64,715 7,866 Updated Nov 5, 2025

GitHub最新hosts。解决GitHub图片无法显示,加速GitHub网页浏览。

TypeScript 5,245 446 Updated Nov 5, 2025

[ACL 2024]Official GitHub repo for OlympiadBench: A Challenging Benchmark for Promoting AGI with Olympiad-Level Bilingual Multimodal Scientific Problems.

Python 173 9 Updated Jun 8, 2025
Jupyter Notebook 83 10 Updated Jan 25, 2025

A minimal GPU design in Verilog to learn how GPUs work from the ground up

SystemVerilog 8,847 697 Updated Aug 18, 2024

LongEmbed: Extending Embedding Models for Long Context Retrieval (EMNLP 2024)

Python 144 9 Updated Nov 9, 2024

The official Meta Llama 3 GitHub site

Python 29,072 3,474 Updated Jan 26, 2025

Rigourous evaluation of LLM-synthesized code - NeurIPS 2023 & COLM 2024

Python 1,618 179 Updated Oct 2, 2025

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agentic RL)

Python 8,319 807 Updated Oct 31, 2025

WebAssembly (Wasm) Build and Bindings for llama.cpp

JavaScript 284 28 Updated Jul 23, 2024

DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models

Python 2,961 551 Updated Apr 15, 2024

State-of-the-art bilingual open-sourced Math reasoning LLMs.

Python 525 36 Updated Oct 22, 2024

Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.

Python 12,223 1,117 Updated Sep 26, 2025

LLMs as Copilots for Theorem Proving in Lean

C++ 1,183 114 Updated Oct 15, 2025

A Production-ready Reinforcement Learning AI Agent Library brought by the Applied Reinforcement Learning team at Meta.

Jupyter Notebook 2,951 194 Updated Nov 3, 2025
Next