tangzhy

Follow

Zhengyang Tang tangzhy

Follow

16 followers · 5 following

https://tangzhy.github.io/

Achievements

Achievements

Stars

zai-org / Open-AutoGLM

An Open Phone Agent Model & Framework. Unlocking the AI Phone for Everyone

Python 23,338 3,690 Updated Feb 6, 2026

ChengpengLi1003 / Awesome-Long-Chain-of-Thought-Reasoning-with-tools

A curated list of cutting-edge research papers and resources on Long Chain-of-Thought (CoT) Reasoning with Tools.

45 4 Updated Dec 17, 2025

zyushun / Adam-mini

Code for Adam-mini: Use Fewer Learning Rates To Gain More https://arxiv.org/abs/2406.16793

Python 452 16 Updated May 13, 2025

ChengpengLi1003 / CoRT

Python 71 5 Updated Oct 23, 2025

liziniu / GEM

Code for Paper (Preserving Diversity in Supervised Fine-tuning of Large Language Models)

Python 51 5 Updated May 12, 2025

rllm-org / rllm

Democratizing Reinforcement Learning for LLMs

Python 5,097 499 Updated Feb 11, 2026

tangzhy / RealCritic

Python 14 1 Updated Jan 27, 2025

ai4co / awesome-fm4co

Recent research papers about Foundation Models for Combinatorial Optimization

465 36 Updated Feb 10, 2026

antgroup / LLMOPT

The LLMOPT project offers a comprehensive set of resources, including the model, dataset, training framework, and inference code, enabling users to fully utilize LLMOPT.

Python 122 15 Updated Nov 19, 2025

ericyangyu / PPO-for-Beginners

A simple and well styled PPO implementation. Based on my Medium series: https://medium.com/@eyyu/coding-ppo-from-scratch-with-pytorch-part-1-4-613dfc1b14c8.

Python 1,209 157 Updated Oct 1, 2024

deepseek-ai / DeepSeek-Prover-V1.5

Python 553 234 Updated Aug 16, 2024

magpie-align / magpie

[ICLR 2025] Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing. Your efficient and high-quality synthetic data generation pipeline!

Python 826 68 Updated Mar 17, 2025

Cardinal-Operations / ORLM

ORLM: Training Large Language Models for Optimization Modeling

Python 233 35 Updated Sep 18, 2025

FreedomIntelligence / Apollo

Multilingual Medicine: Model, Dataset, Benchmark, Code

Python 199 8 Updated Oct 15, 2024

lean-dojo / ReProver

Retrieval-Augmented Theorem Provers for Lean

Python 316 69 Updated Jan 30, 2025

openai / miniF2F

Formal to Formal Mathematics Benchmark

Objective-C++ 415 49 Updated Aug 16, 2023

glotlabs / gdrive

Google Drive CLI Client

Rust 1,997 135 Updated Aug 3, 2024

facebookresearch / nougat

Implementation of Nougat Neural Optical Understanding for Academic Documents

Python 9,834 624 Updated Feb 21, 2025

mandyyyyii / scibench

Python 130 6 Updated Jul 8, 2024

google-deepmind / mathematics_dataset

This dataset code generates mathematical question and answer pairs, from a range of question types at roughly school-level difficulty.

Python 1,933 264 Updated Dec 23, 2024

nimarb / pytorch_influence_functions

This is a PyTorch reimplementation of Influence Functions from the ICML2017 best paper: Understanding Black-box Predictions via Influence Functions by Pang Wei Koh and Percy Liang.

Python 344 74 Updated Oct 29, 2023

kohpangwei / influence-release

Jupyter Notebook 807 182 Updated Dec 29, 2020

microsoft / CodeT

Python 671 87 Updated Nov 1, 2024

SupeRuier / awesome-active-learning

Everything you need about Active Learning (AL).

966 85 Updated Jun 1, 2024

google-research / FLAN

Python 1,559 159 Updated Feb 5, 2026

allenai / open-instruct

AllenAI's post-training codebase

Python 3,574 496 Updated Feb 11, 2026

lean-dojo / LeanDojo

Tool for data extraction and interacting with Lean programmatically.

Python 761 117 Updated Jan 18, 2026

huggingface / transformers-bloom-inference

Fast Inference Solutions for BLOOM

Python 566 112 Updated Oct 9, 2024

deepspeedai / Megatron-DeepSpeed

Forked from NVIDIA/Megatron-LM

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Python 2,226 370 Updated Aug 14, 2025

openai / prm800k

800,000 step-level correctness labels on LLM solutions to MATH problems

Python 2,092 122 Updated Jun 1, 2023