-
Millennium Science School
- Beijing, China
-
23:41
(UTC +08:00) - @llamafactory_ai
- https://huggingface.co/hiyouga
Lists (2)
Sort Name ascending (A-Z)
Starred repositories
A fully automated HTTPS server powered by Nginx, Let's Encrypt and Docker.
LlamaFactory integration with Berkeley Function Calling Leaderboard
A high-level multi-agent development framework built on LangGraph, combining CrewAI’s intuitive concepts with enterprise-grade features, ready-to-use templates, and full-stack UI for rapid producti…
PKU-DAIR / Hetu
Forked from Hsword/HetuA high-performance distributed deep learning system targeting large-scale and automated distributed training.
Pokee Deep Research Model Open Source Repo
Tongyi Deep Research, the Leading Open-source Deep Research Agent
One second to read GitHub code with VS Code.
An Open-Source AI Chatbot Framework for GitHub Repository Analysis
SkyRL: A Modular Full-stack RL Library for LLMs
Asyncer, async and await, focused on developer experience.
Typer, build great CLIs. Easy to code. Based on Python type hints.
Official implementation of Regularized Policy Gradient (RPG) (https://arxiv.org/abs/2505.17508)
Quickly rewrite git repository history (filter-branch replacement)
Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels
The Postgres development platform. Supabase gives you a dedicated Postgres database to build your web, mobile, and AI applications.
Lightweight coding agent that runs in your terminal
DataFlex is a data-centric training framework that enhances model performance by either selecting the most influential samples, optimizing their weights, or adjusting their mixing ratios.
Code for the paper "ClinicalBench: Can LLMs Beat Traditional ML Models in Clinical Prediction?"
Atropos is a Language Model Reinforcement Learning Environments framework for collecting and evaluating LLM trajectories through diverse environments
Checkpoint-engine is a simple middleware to update model weights in LLM inference engines
[ICML 2024] LESS: Selecting Influential Data for Targeted Instruction Tuning
Easy Data Preparation with latest LLMs-based Operators and Pipelines.
Trae Agent is an LLM-based agent for general purpose software engineering tasks.
VeOmni: Scaling Any Modality Model Training with Model-Centric Distributed Recipe Zoo
[Preprint] On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification.