Skip to content
View AceCoooool's full-sized avatar
😴
lazy.
😴
lazy.

Block or report AceCoooool

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Miles is an enterprise-facing reinforcement learning framework for LLM and VLM post-training, forked from and co-evolving with slime.

Python 1,042 145 Updated Apr 3, 2026

Text and code embeddings research from CodeFuse: C2LLM, D2LLM, E2LLM, F2LLM

Python 419 55 Updated Mar 26, 2026

Standardized environment infrastructure for Agentic AI development.

Python 291 35 Updated Mar 25, 2026

SkyRL: A Modular Full-stack RL Library for LLMs

Python 1,731 291 Updated Apr 3, 2026

An Open-Source Asynchronous Coding Agent

Python 9,078 1,037 Updated Apr 3, 2026

🚀 PR Agent - The Original Open-Source PR Reviewer. This repo is not the Qodo free tier! Try the free version on our website.

Python 10,754 1,401 Updated Apr 2, 2026

Kimi K2 is the large language model series developed by Moonshot AI team

10,591 806 Updated Jan 21, 2026

Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your permission every step of the way.

TypeScript 59,844 6,085 Updated Apr 3, 2026

Trae Agent is an LLM-based agent for general purpose software engineering tasks.

Python 11,237 1,217 Updated Feb 5, 2026

slime is an LLM post-training framework for RL Scaling.

Python 5,109 688 Updated Apr 3, 2026

Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.

Python 4,975 446 Updated Apr 3, 2026

[NeurIPS 2025] A Graph-based LLM Framework for Real-world SE Tasks

Python 528 55 Updated Sep 19, 2025

[ICML '24] R2E: Turn any GitHub Repository into a Programming Agent Environment

Python 144 14 Updated Apr 20, 2025

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.5, DeepSeek-R1, GLM-5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, Phi4, ...)…

Python 13,516 1,323 Updated Apr 3, 2026

verl: Volcano Engine Reinforcement Learning for LLMs

Python 20,424 3,572 Updated Apr 3, 2026

My learning notes for ML SYS.

Python 5,846 381 Updated Apr 3, 2026

SWE-agent takes a GitHub issue and tries to automatically fix it, using your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges. [NeurIPS 2024]

Python 18,915 2,047 Updated Mar 30, 2026

Fully open reproduction of DeepSeek-R1

Python 25,967 2,410 Updated Apr 2, 2026

Efficient Triton Kernels for LLM Training

Python 6,258 509 Updated Apr 3, 2026

An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & TIS & vLLM & Ray & Async RL)

Python 9,298 911 Updated Apr 3, 2026

A generative world for general-purpose robotics & embodied AI learning.

Python 28,407 2,647 Updated Apr 3, 2026

Xiaomi Home Integration for Home Assistant

Python 21,594 1,143 Updated Jan 28, 2026

Code for the Molmo Vision-Language Model

Python 896 93 Updated Dec 12, 2024

SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer

Python 5,043 338 Updated Mar 17, 2026

[ICCV 2025, Highlight] ZIM: Zero-Shot Image Matting for Anything

Python 408 30 Updated Aug 28, 2025

Janus-Series: Unified Multimodal Understanding and Generation Models

Python 17,699 2,235 Updated Feb 1, 2025

PyTorch implementation for Score-Based Generative Modeling through Stochastic Differential Equations (ICLR 2021, Oral)

Jupyter Notebook 2,093 352 Updated Jul 14, 2024

小彭老师领衔编写,现代C++的中文百科全书

Typst 1,021 76 Updated Mar 21, 2026

Next-Token Prediction is All You Need

Python 2,389 95 Updated Jan 12, 2026

Official Code for ECCV 2024 paper — One-Shot Diffusion Mimicker for Handwritten Text Generation

Python 544 57 Updated Oct 15, 2025
Next