Skip to content
View zhxieml's full-sized avatar
🤡
🤡

Highlights

  • Pro

Block or report zhxieml

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
1660 results for source starred repositories
Clear filter

OS-Sentinel

Python 32 1 Updated Nov 4, 2025

Open-Source LLM-Based Data Analysis Agents

Python 49 4 Updated Oct 17, 2025

A python module to repair invalid JSON from LLMs

Python 3,859 151 Updated Nov 1, 2025

The Tool Decathlon: Benchmarking Language Agents for Diverse, Realistic, and Long-Horizon Task Execution

Python 120 5 Updated Nov 5, 2025

KernelBench: Can LLMs Write GPU Kernels? - Benchmark with Torch -> CUDA (+ more DSLs)

Python 645 80 Updated Nov 5, 2025

This repository contains the toolkit for replicating results from our technical report.

Python 159 17 Updated Sep 3, 2025

Official repo of Toucan: Synthesizing 1.5M Tool-Agentic Data from Real-World MCP Environments

Python 163 7 Updated Oct 7, 2025
Python 95 4 Updated Nov 4, 2025

A scalable asynchronous reinforcement learning implementation with in-flight weight updates.

Python 279 27 Updated Nov 5, 2025

An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models

Python 2,197 136 Updated Nov 5, 2025

The best ChatGPT that $100 can buy.

Python 35,800 4,128 Updated Nov 5, 2025

Best practices for training DeepSeek, Mixtral, Qwen and other MoE models using Megatron Core.

Python 123 26 Updated Nov 4, 2025

The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.

Python 1,414 201 Updated Oct 31, 2025

An extremely fast Python linter and code formatter, written in Rust.

Rust 43,583 1,600 Updated Nov 6, 2025
Python 59 8 Updated Sep 29, 2025

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, GLM4.5, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, Llava, GLM4v, Ph…

Python 10,900 946 Updated Nov 6, 2025

A Gym for Agentic LLMs

Python 347 20 Updated Oct 30, 2025

Weave is a toolkit for developing AI-powered applications, built by Weights & Biases.

Python 1,016 134 Updated Nov 6, 2025

Post-training with Tinker

Python 1,437 113 Updated Nov 5, 2025

accompanying material for sleep-time compute paper

Python 117 13 Updated Apr 30, 2025

Supporting code for the blog post on modular manifolds.

Python 100 14 Updated Sep 26, 2025

Research code artifacts for Code World Model (CWM) including inference tools, reproducibility, and documentation.

Python 704 55 Updated Sep 24, 2025

A sophisticated multi-step reasoning pipeline powered by the Datarus-R1-14B-Preview model

Python 223 3 Updated Aug 21, 2025

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 15,993 1,265 Updated Oct 27, 2025

Meta Agents Research Environments is a comprehensive platform designed to evaluate AI agents in dynamic, realistic scenarios. Unlike static benchmarks, this platform introduces evolving environment…

Python 338 42 Updated Oct 31, 2025

[ICLR 2025] Automated Design of Agentic Systems

Python 1,444 221 Updated Jan 28, 2025

[NeurIPS 2025 Spotlight] Reasoning Environments for Reinforcement Learning with Verifiable Rewards

Python 1,210 100 Updated Oct 6, 2025

Training LLMs to reason and analyze data with notebooks

Python 49 4 Updated Sep 10, 2025
Next