Skip to content
View hiyouga's full-sized avatar
🕊️
咕咕咕
🕊️
咕咕咕

Organizations

@the-seeds

Block or report hiyouga

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

A fully automated HTTPS server powered by Nginx, Let's Encrypt and Docker.

Ruby 4,635 298 Updated Mar 25, 2025

LlamaFactory integration with Berkeley Function Calling Leaderboard

Python 5 Updated Oct 31, 2025

A high-level multi-agent development framework built on LangGraph, combining CrewAI’s intuitive concepts with enterprise-grade features, ready-to-use templates, and full-stack UI for rapid producti…

Python 93 4 Updated Nov 4, 2025

PatentWriterAgent Demo

Mermaid 357 80 Updated Oct 28, 2025

A high-performance distributed deep learning system targeting large-scale and automated distributed training.

Python 326 39 Updated Jul 28, 2025

Pokee Deep Research Model Open Source Repo

Python 1,608 1,009 Updated Oct 22, 2025

Tongyi Deep Research, the Leading Open-source Deep Research Agent

Python 16,962 1,294 Updated Nov 3, 2025

One second to read GitHub code with VS Code.

TypeScript 23,217 897 Updated Oct 30, 2025

An Open-Source AI Chatbot Framework for GitHub Repository Analysis

Python 8 2 Updated Aug 9, 2025

SkyRL: A Modular Full-stack RL Library for LLMs

Python 1,159 160 Updated Nov 6, 2025

A Gym for Agentic LLMs

Python 347 20 Updated Oct 30, 2025

A JAX-native LLM Post-Training Library

Python 1,712 150 Updated Nov 6, 2025

Asyncer, async and await, focused on developer experience.

Python 2,227 77 Updated Nov 4, 2025

Typer, build great CLIs. Easy to code. Based on Python type hints.

Python 18,220 800 Updated Nov 3, 2025

Official implementation of Regularized Policy Gradient (RPG) (https://arxiv.org/abs/2505.17508)

Python 53 2 Updated Oct 17, 2025

Quickly rewrite git repository history (filter-branch replacement)

Python 11,034 872 Updated Nov 1, 2025

Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels

C++ 3,867 301 Updated Nov 6, 2025

The Postgres development platform. Supabase gives you a dedicated Postgres database to build your web, mobile, and AI applications.

TypeScript 92,048 10,511 Updated Nov 6, 2025

Lightweight coding agent that runs in your terminal

Rust 49,911 6,171 Updated Nov 6, 2025

DataFlex is a data-centric training framework that enhances model performance by either selecting the most influential samples, optimizing their weights, or adjusting their mixing ratios.

Python 30 9 Updated Nov 4, 2025

Code for the paper "ClinicalBench: Can LLMs Beat Traditional ML Models in Clinical Prediction?"

Python 29 7 Updated Jun 18, 2025

Atropos is a Language Model Reinforcement Learning Environments framework for collecting and evaluating LLM trajectories through diverse environments

Python 738 164 Updated Nov 6, 2025

Checkpoint-engine is a simple middleware to update model weights in LLM inference engines

Python 806 61 Updated Nov 4, 2025

[ICML 2024] LESS: Selecting Influential Data for Targeted Instruction Tuning

Jupyter Notebook 498 45 Updated Oct 20, 2024

Easy Data Preparation with latest LLMs-based Operators and Pipelines.

Python 1,443 100 Updated Nov 6, 2025

Trae Agent is an LLM-based agent for general purpose software engineering tasks.

Python 9,907 1,024 Updated Sep 24, 2025

VeOmni: Scaling Any Modality Model Training with Model-Centric Distributed Recipe Zoo

Python 1,272 92 Updated Nov 6, 2025

LLM inference in C/C++

C++ 89,236 13,583 Updated Nov 6, 2025

[Preprint] On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification.

Python 491 20 Updated Nov 5, 2025
Next