Skip to content
View hiyouga's full-sized avatar
🕊️
咕咕咕
🕊️
咕咕咕

Organizations

@the-seeds

Block or report hiyouga

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 250 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

A Gym for Agentic LLMs

Python 282 12 Updated Oct 9, 2025

A JAX-native LLM Post-Training Library

Python 1,592 137 Updated Oct 9, 2025

Asyncer, async and await, focused on developer experience.

Python 2,168 74 Updated Oct 8, 2025

Typer, build great CLIs. Easy to code. Based on Python type hints.

Python 18,037 782 Updated Oct 8, 2025

Official implementation of Regularized Policy Gradient (RPG) (https://arxiv.org/abs/2505.17508)

Python 51 2 Updated Oct 7, 2025

Quickly rewrite git repository history (filter-branch replacement)

Python 10,838 858 Updated Oct 1, 2025

Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels

C++ 3,316 241 Updated Oct 9, 2025

The Postgres development platform. Supabase gives you a dedicated Postgres database to build your web, mobile, and AI applications.

TypeScript 89,768 10,120 Updated Oct 9, 2025

Lightweight coding agent that runs in your terminal

Rust 46,598 5,585 Updated Oct 9, 2025

DataFlex is a data-centric training framework that enhances model performance by either selecting the most influential samples, optimizing their weights, or adjusting their mixing ratios.

Python 25 6 Updated Oct 5, 2025
Python 20 10 Updated Sep 29, 2025

Code for the paper "ClinicalBench: Can LLMs Beat Traditional ML Models in Clinical Prediction?"

Python 28 7 Updated Jun 18, 2025

Atropos is a Language Model Reinforcement Learning Environments framework for collecting and evaluating LLM trajectories through diverse environments

Python 703 160 Updated Oct 7, 2025

Checkpoint-engine is a simple middleware to update model weights in LLM inference engines

Python 757 54 Updated Sep 30, 2025

[ICML 2024] LESS: Selecting Influential Data for Targeted Instruction Tuning

Jupyter Notebook 496 46 Updated Oct 20, 2024

Easy Data Preparation with latest LLMs-based Operators and Pipelines.

Python 1,370 90 Updated Oct 9, 2025

Trae Agent is an LLM-based agent for general purpose software engineering tasks.

Python 9,624 994 Updated Sep 24, 2025

VeOmni: Scaling Any Modality Model Training with Model-Centric Distributed Recipe Zoo

Python 1,195 71 Updated Oct 8, 2025

LLM inference in C/C++

C++ 87,382 13,260 Updated Oct 9, 2025

[Preprint] On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification.

Python 465 17 Updated Sep 24, 2025

Renderer for the harmony response format to be used with gpt-oss

Rust 3,877 208 Updated Aug 15, 2025

Collection of scripts and notebooks for OpenAI's latest GPT OSS models

Jupyter Notebook 454 48 Updated Aug 25, 2025

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 18,737 1,830 Updated Oct 6, 2025

The official code implementation of the ACL2025 paper “A Text is Worth Several Tokens: Text Embedding from LLMs Secretly Aligns Well with The Key Tokens”. Text Embedding from LLMs Secretly Aligns W…

Python 15 Updated Jul 12, 2025

A lightweight, local-first, and free experiment tracking library from Hugging Face 🤗

Python 931 61 Updated Oct 8, 2025

✨ Agentic Reinforced Policy Optimization

Python 638 29 Updated Sep 17, 2025

An open-source AI agent that brings the power of Gemini directly into your terminal.

TypeScript 78,382 8,489 Updated Oct 9, 2025

Qwen Code is a coding agent that lives in the digital world.

TypeScript 14,029 1,109 Updated Oct 9, 2025

Text-audio foundation model from Boson AI

Python 7,415 536 Updated Sep 15, 2025

一个面向多模态大模型训练的智能数据集构建与评估平台

TypeScript 124 11 Updated Sep 30, 2025
Next