Skip to content
View kunyuan's full-sized avatar
  • Flatiron Insititute
  • New York City

Organizations

@numericalEFT

Block or report kunyuan

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

个人构建MoE大模型:从预训练到DPO的完整实践

Python 2,079 156 Updated Dec 16, 2025

Implementation for FP8/INT8 Rollout for RL training without performence drop.

Python 281 18 Updated Nov 7, 2025

Automated tool for running Python programs in a streamlined manner

JavaScript 328 18 Updated Oct 22, 2025

An Open-source RL System from ByteDance Seed and Tsinghua AIR

Python 1,678 76 Updated May 11, 2025

GTP engine and self-play learning in Go

C++ 4,311 646 Updated Nov 8, 2025

Go engine with no human-provided knowledge, modeled after the AlphaGo Zero paper.

C++ 5,548 1,020 Updated May 2, 2024

A PyTorch implementation of DeepMind's AlphaZero agent to play Go and Gomoku board games

Python 163 36 Updated Oct 26, 2024

Minimal reproduction of DeepSeek R1-Zero

Python 12,501 1,530 Updated Apr 24, 2025

A Flexible Framework for Experiencing Heterogeneous LLM Inference/Fine-tune Optimizations

Python 16,231 1,186 Updated Dec 20, 2025

minimal-cost for training 0.5B R1-Zero

Python 792 101 Updated May 14, 2025

Democratizing Reinforcement Learning for LLMs

Python 4,882 467 Updated Dec 21, 2025

High Dimensional Integration with GPU

Python 2 Updated Oct 31, 2025

PyMatching: A Python/C++ library for decoding quantum error correcting codes with minimum-weight perfect matching.

C++ 280 53 Updated Nov 27, 2025

Robust and fast Monte Carlo algorithm for high dimension integration

Julia 50 4 Updated Dec 8, 2025

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 81,393 12,166 Updated Dec 21, 2025

Teaching Addition to Small Transformers

Python 17 2 Updated Nov 28, 2023

《动手学深度学习》:面向中文读者、能运行、可讨论。中英文版被70多个国家的500多所大学用于教学。

Python 74,504 12,028 Updated Jul 30, 2024

Awesome resources on normalizing flows.

Python 1,597 131 Updated Jul 7, 2025

Machine learning algorithms for many-body quantum systems

Python 648 206 Updated Dec 16, 2025

AllocCheck

Julia 246 11 Updated Nov 4, 2025

Julia functional programming infrastructures and metaprogramming facilities

Julia 419 38 Updated Sep 9, 2025

Toolbox for Green's functions on Matsubara grids

Julia 23 4 Updated Dec 4, 2025
Julia 3 Updated Nov 11, 2025

Create any job from Julia functions

Julia 6 Updated Dec 15, 2025

A simple workflow engine powered by Julia

Julia 7 Updated Dec 10, 2025

Express: a high-level, extensible workflow framework for accelerating ab initio calculations for the materials science community

Julia 29 1 Updated Nov 26, 2025

phq: a Fortran code to compute phonon quasiparticle properties and dispersions

Fortran 14 6 Updated Aug 16, 2019

Interface and Julia implementation of exchange-correlation functionals

Julia 8 9 Updated Nov 24, 2025

Julia bindings to the libxc library for exchange-correlation functionals

Julia 23 8 Updated Nov 24, 2025
Next