hiyouga

🕊️

咕咕咕

Yaowei Zheng hiyouga

🕊️

咕咕咕

No code All live

5.8k followers · 96 following

Millennium Science School
Beijing, China
15:52 (UTC +08:00)
@llamafactory_ai
https://huggingface.co/hiyouga

Achievements

x4 x4 x3 x4

Achievements

x4 x4 x3 x4

Organizations

Lists (2)

Sort

🔮 Future ideas

🚀 My stack

Starred repositories

SteveLTN / https-portal

A fully automated HTTPS server powered by Nginx, Let's Encrypt and Docker.

Ruby 4,648 297 Updated Mar 25, 2025

XueyiC / Llama-Factory-BFCL

LlamaFactory integration with Berkeley Function Calling Leaderboard

Python 5 Updated Oct 31, 2025

01-ai / langcrew

A high-level multi-agent development framework built on LangGraph, combining CrewAI’s intuitive concepts with enterprise-grade features, ready-to-use templates, and full-stack UI for rapid producti…

Python 98 5 Updated Nov 4, 2025

ninehills / PatentWriterAgent

PatentWriterAgent Demo

Mermaid 370 83 Updated Oct 28, 2025

PKU-DAIR / Hetu

Forked from Hsword/Hetu

A high-performance distributed deep learning system targeting large-scale and automated distributed training.

Python 327 39 Updated Jul 28, 2025

Pokee-AI / PokeeResearchOSS

Pokee Deep Research Model Open Source Repo

Python 1,647 1,018 Updated Oct 22, 2025

Alibaba-NLP / DeepResearch

Tongyi Deep Research, the Leading Open-source Deep Research Agent

Python 17,147 1,302 Updated Nov 10, 2025

conwnet / github1s

One second to read GitHub code with VS Code.

TypeScript 23,229 899 Updated Oct 30, 2025

oGYCo / GithubBot

An Open-Source AI Chatbot Framework for GitHub Repository Analysis

Python 8 2 Updated Aug 9, 2025

NovaSky-AI / SkyRL

SkyRL: A Modular Full-stack RL Library for LLMs

Python 1,195 167 Updated Nov 13, 2025

axon-rl / gem

A Gym for Agentic LLMs

Python 358 22 Updated Nov 10, 2025

google / tunix

A JAX-native LLM Post-Training Library

Python 1,830 162 Updated Nov 14, 2025

fastapi / asyncer

Asyncer, async and await, focused on developer experience.

Python 2,238 77 Updated Nov 13, 2025

fastapi / typer

Typer, build great CLIs. Easy to code. Based on Python type hints.

Python 18,285 801 Updated Nov 13, 2025

complex-reasoning / RPG

Official implementation of Regularized Policy Gradient (RPG) (https://arxiv.org/abs/2505.17508)

Python 54 2 Updated Oct 17, 2025

newren / git-filter-repo

Quickly rewrite git repository history (filter-branch replacement)

Python 11,083 873 Updated Nov 1, 2025

tile-ai / tilelang

Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels

C++ 3,918 313 Updated Nov 14, 2025

supabase / supabase

The Postgres development platform. Supabase gives you a dedicated Postgres database to build your web, mobile, and AI applications.

TypeScript 92,709 10,656 Updated Nov 14, 2025

openai / codex

Lightweight coding agent that runs in your terminal

Rust 50,470 6,286 Updated Nov 14, 2025

OpenDCAI / DataFlex

DataFlex is a data-centric training framework that enhances model performance by either selecting the most influential samples, optimizing their weights, or adjusting their mixing ratios.

Python 31 9 Updated Nov 10, 2025

canyuchen / ClinicalBench

Code for the paper "ClinicalBench: Can LLMs Beat Traditional ML Models in Clinical Prediction?"

Python 29 7 Updated Jun 18, 2025

NousResearch / atropos

Atropos is a Language Model Reinforcement Learning Environments framework for collecting and evaluating LLM trajectories through diverse environments

Python 744 168 Updated Nov 14, 2025

MoonshotAI / checkpoint-engine

Checkpoint-engine is a simple middleware to update model weights in LLM inference engines

Python 820 65 Updated Nov 12, 2025

princeton-nlp / LESS

[ICML 2024] LESS: Selecting Influential Data for Targeted Instruction Tuning

Jupyter Notebook 502 44 Updated Oct 20, 2024

OpenDCAI / DataFlow

Easy Data Preparation with latest LLMs-based Operators and Pipelines.

Python 1,469 101 Updated Nov 14, 2025

bytedance / trae-agent

Trae Agent is an LLM-based agent for general purpose software engineering tasks.

Python 10,000 1,034 Updated Sep 24, 2025

ByteDance-Seed / VeOmni

VeOmni: Scaling Any Modality Model Training with Model-Centric Distributed Recipe Zoo

Python 1,291 94 Updated Nov 13, 2025

ggml-org / llama.cpp

LLM inference in C/C++

C++ 89,743 13,686 Updated Nov 14, 2025

yongliang-wu / DFT

[Preprint] On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification.

Python 497 20 Updated Nov 5, 2025

openai / harmony

Renderer for the harmony response format to be used with gpt-oss

Rust 4,003 225 Updated Nov 5, 2025