Skip to content
View tangzhy's full-sized avatar

Block or report tangzhy

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

An Open Phone Agent Model & Framework. Unlocking the AI Phone for Everyone

Python 23,338 3,690 Updated Feb 6, 2026

A curated list of cutting-edge research papers and resources on Long Chain-of-Thought (CoT) Reasoning with Tools.

45 4 Updated Dec 17, 2025

Code for Adam-mini: Use Fewer Learning Rates To Gain More https://arxiv.org/abs/2406.16793

Python 452 16 Updated May 13, 2025
Python 71 5 Updated Oct 23, 2025

Code for Paper (Preserving Diversity in Supervised Fine-tuning of Large Language Models)

Python 51 5 Updated May 12, 2025

Democratizing Reinforcement Learning for LLMs

Python 5,097 499 Updated Feb 11, 2026
Python 14 1 Updated Jan 27, 2025

Recent research papers about Foundation Models for Combinatorial Optimization

465 36 Updated Feb 10, 2026

The LLMOPT project offers a comprehensive set of resources, including the model, dataset, training framework, and inference code, enabling users to fully utilize LLMOPT.

Python 122 15 Updated Nov 19, 2025

A simple and well styled PPO implementation. Based on my Medium series: https://medium.com/@eyyu/coding-ppo-from-scratch-with-pytorch-part-1-4-613dfc1b14c8.

Python 1,209 157 Updated Oct 1, 2024

[ICLR 2025] Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing. Your efficient and high-quality synthetic data generation pipeline!

Python 826 68 Updated Mar 17, 2025

ORLM: Training Large Language Models for Optimization Modeling

Python 233 35 Updated Sep 18, 2025

Multilingual Medicine: Model, Dataset, Benchmark, Code

Python 199 8 Updated Oct 15, 2024

Retrieval-Augmented Theorem Provers for Lean

Python 316 69 Updated Jan 30, 2025

Formal to Formal Mathematics Benchmark

Objective-C++ 415 49 Updated Aug 16, 2023

Google Drive CLI Client

Rust 1,997 135 Updated Aug 3, 2024

Implementation of Nougat Neural Optical Understanding for Academic Documents

Python 9,834 624 Updated Feb 21, 2025
Python 130 6 Updated Jul 8, 2024

This dataset code generates mathematical question and answer pairs, from a range of question types at roughly school-level difficulty.

Python 1,933 264 Updated Dec 23, 2024

This is a PyTorch reimplementation of Influence Functions from the ICML2017 best paper: Understanding Black-box Predictions via Influence Functions by Pang Wei Koh and Percy Liang.

Python 344 74 Updated Oct 29, 2023
Jupyter Notebook 807 182 Updated Dec 29, 2020
Python 671 87 Updated Nov 1, 2024

Everything you need about Active Learning (AL).

966 85 Updated Jun 1, 2024
Python 1,559 159 Updated Feb 5, 2026

AllenAI's post-training codebase

Python 3,574 496 Updated Feb 11, 2026

Tool for data extraction and interacting with Lean programmatically.

Python 761 117 Updated Jan 18, 2026

Fast Inference Solutions for BLOOM

Python 566 112 Updated Oct 9, 2024

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Python 2,226 370 Updated Aug 14, 2025

800,000 step-level correctness labels on LLM solutions to MATH problems

Python 2,092 122 Updated Jun 1, 2023
Next