-
ByteDance Seed
- Beijing, China
-
03:43
(UTC +08:00) - tongyx361.github.io
- @tongyx361
-
verl Public
Forked from verl-project/verlverl: Volcano Engine Reinforcement Learning for LLMs
Python Apache License 2.0 UpdatedMay 26, 2026 -
claude-code Public
Forked from ultraworkers/claw-codeAn independent Python feature port of Claude Code, entirely rewritting from scratch using oh-my-codex. Educational Purpose only.
Python UpdatedMar 31, 2026 -
codex Public
Forked from openai/codexLightweight coding agent that runs in your terminal
Rust Apache License 2.0 UpdatedMar 16, 2026 -
-
slime Public
Forked from THUDM/slimeslime is an LLM post-training framework for RL Scaling.
Python Apache License 2.0 UpdatedMar 5, 2026 -
rllm Public
Forked from rllm-org/rllmDemocratizing Reinforcement Learning for LLMs
Python Apache License 2.0 UpdatedJan 10, 2026 -
verl-recipe Public
Forked from verl-project/verl-recipeA set of examples based on verl for end-to-end RL training recipes.
Python UpdatedJan 5, 2026 -
tinker-cookbook Public
Forked from thinking-machines-lab/tinker-cookbookPost-training with Tinker
Python Apache License 2.0 UpdatedDec 18, 2025 -
tensordict Public
Forked from pytorch/tensordictTensorDict is a pytorch dedicated tensor container.
Python MIT License UpdatedDec 6, 2025 -
TransformerEngine Public
Forked from NVIDIA/TransformerEngineA library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper, Ada and Blackwell GPUs, to provide better performance with lower memory…
Python Apache License 2.0 UpdatedOct 31, 2025 -
Megatron-LM Public
Forked from NVIDIA/Megatron-LMOngoing research training transformer models at scale
Python Other UpdatedOct 30, 2025 -
transformers Public
Forked from huggingface/transformers🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Python Apache License 2.0 UpdatedOct 23, 2025 -
flash-attention Public
Forked from Dao-AILab/flash-attentionFast and memory-efficient exact attention
-
metagen Public
Flexible meta-generation library
-
vllm Public
Forked from vllm-project/vllmA high-throughput and memory-efficient inference and serving engine for LLMs
Python Apache License 2.0 UpdatedAug 22, 2025 -
pytorch Public
Forked from pytorch/pytorchTensors and Dynamic neural networks in Python with strong GPU acceleration
Python Other UpdatedAug 6, 2025 -
sonetsim Public
A Social Network Simulator based on Large Language Model Agents.
-
-
oasis Public
Forked from camel-ai/oasis🏝️ OASIS: Open Agent Social Interaction Simulations with One Million Agents. https://oasis.camel-ai.org
-
tongyx361.github.io Public
(Shawn) Yuxuan Tong's Homepage
Python Apache License 2.0 UpdatedJun 6, 2025 -
-
-
psp-lab2-recog-button-audio Public
Lab 2 *Recognition of Button Audio* in course *Principles of Signal Processing* by Prof. Jia Jia at DSCT, THU
Python MIT License UpdatedJan 14, 2025 -
nbdev-template-tongyx361 Public template
nbdev template customed by Yuxuan Tong
-
psp-lab3-zero-phase-filter Public
Lab 2 *Zero-Phase Filter* in course *Principles of Signal Processing* by Prof. Jia Jia at DSCT, THU
Jupyter Notebook MIT License UpdatedJan 11, 2025 -
VinePPO Public
Forked from McGill-NLP/VinePPOCode for the paper "VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit Assignment"
Python MIT License UpdatedJan 8, 2025 -
OpenRLHF Public
Forked from OpenRLHF/OpenRLHFAn Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)
-
-
Qwen2.5-Math Public
Forked from QwenLM/Qwen2.5-MathA series of math-specific large language models of our Qwen2 series.
Python UpdatedNov 28, 2024 -
psp-lab1-viz-fourier-series Public
Lab 1 *Visualization of Fourier Series* in course *Principles of Signal Processing* by Prof. Jia Jia at DSCT, THU
Jupyter Notebook MIT License UpdatedNov 1, 2024