Skip to content
View hiyouga's full-sized avatar
🕊️
咕咕咕
🕊️
咕咕咕

Organizations

@the-seeds

Block or report hiyouga

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

A fully automated HTTPS server powered by Nginx, Let's Encrypt and Docker.

Ruby 4,648 297 Updated Mar 25, 2025

LlamaFactory integration with Berkeley Function Calling Leaderboard

Python 5 Updated Oct 31, 2025

A high-level multi-agent development framework built on LangGraph, combining CrewAI’s intuitive concepts with enterprise-grade features, ready-to-use templates, and full-stack UI for rapid producti…

Python 98 5 Updated Nov 4, 2025

PatentWriterAgent Demo

Mermaid 370 83 Updated Oct 28, 2025

A high-performance distributed deep learning system targeting large-scale and automated distributed training.

Python 327 39 Updated Jul 28, 2025

Pokee Deep Research Model Open Source Repo

Python 1,647 1,018 Updated Oct 22, 2025

Tongyi Deep Research, the Leading Open-source Deep Research Agent

Python 17,147 1,302 Updated Nov 10, 2025

One second to read GitHub code with VS Code.

TypeScript 23,229 899 Updated Oct 30, 2025

An Open-Source AI Chatbot Framework for GitHub Repository Analysis

Python 8 2 Updated Aug 9, 2025

SkyRL: A Modular Full-stack RL Library for LLMs

Python 1,195 167 Updated Nov 13, 2025

A Gym for Agentic LLMs

Python 358 22 Updated Nov 10, 2025

A JAX-native LLM Post-Training Library

Python 1,830 162 Updated Nov 14, 2025

Asyncer, async and await, focused on developer experience.

Python 2,238 77 Updated Nov 13, 2025

Typer, build great CLIs. Easy to code. Based on Python type hints.

Python 18,285 801 Updated Nov 13, 2025

Official implementation of Regularized Policy Gradient (RPG) (https://arxiv.org/abs/2505.17508)

Python 54 2 Updated Oct 17, 2025

Quickly rewrite git repository history (filter-branch replacement)

Python 11,083 873 Updated Nov 1, 2025

Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels

C++ 3,918 313 Updated Nov 14, 2025

The Postgres development platform. Supabase gives you a dedicated Postgres database to build your web, mobile, and AI applications.

TypeScript 92,709 10,656 Updated Nov 14, 2025

Lightweight coding agent that runs in your terminal

Rust 50,470 6,286 Updated Nov 14, 2025

DataFlex is a data-centric training framework that enhances model performance by either selecting the most influential samples, optimizing their weights, or adjusting their mixing ratios.

Python 31 9 Updated Nov 10, 2025

Code for the paper "ClinicalBench: Can LLMs Beat Traditional ML Models in Clinical Prediction?"

Python 29 7 Updated Jun 18, 2025

Atropos is a Language Model Reinforcement Learning Environments framework for collecting and evaluating LLM trajectories through diverse environments

Python 744 168 Updated Nov 14, 2025

Checkpoint-engine is a simple middleware to update model weights in LLM inference engines

Python 820 65 Updated Nov 12, 2025

[ICML 2024] LESS: Selecting Influential Data for Targeted Instruction Tuning

Jupyter Notebook 502 44 Updated Oct 20, 2024

Easy Data Preparation with latest LLMs-based Operators and Pipelines.

Python 1,469 101 Updated Nov 14, 2025

Trae Agent is an LLM-based agent for general purpose software engineering tasks.

Python 10,000 1,034 Updated Sep 24, 2025

VeOmni: Scaling Any Modality Model Training with Model-Centric Distributed Recipe Zoo

Python 1,291 94 Updated Nov 13, 2025

LLM inference in C/C++

C++ 89,743 13,686 Updated Nov 14, 2025

[Preprint] On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification.

Python 497 20 Updated Nov 5, 2025

Renderer for the harmony response format to be used with gpt-oss

Rust 4,003 225 Updated Nov 5, 2025
Next