Skip to content
View emrekuruu's full-sized avatar

Block or report emrekuruu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen3.6, GPT-OSS, Llama, and more!

Python 9,972 892 Updated Jun 12, 2026

Get up and running with Kimi-K2.6, GLM-5.1, MiniMax, DeepSeek, gpt-oss, Qwen, Gemma and other models.

Go 174,190 16,621 Updated Jun 15, 2026

Unsloth Studio is a web UI for training and running open models like Gemma 4, Qwen3.6, DeepSeek, gpt-oss locally.

Python 66,525 5,962 Updated Jun 15, 2026

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 82,874 18,065 Updated Jun 15, 2026

Evaluate and improve models and agents using environments

Python 982 179 Updated Jun 15, 2026

Scalable toolkit for efficient model reinforcement

Python 1,729 423 Updated Jun 15, 2026

R1-searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning

Python 717 46 Updated Aug 5, 2025

πŸš€ EvoAgentX: Building a Self-Evolving Ecosystem of AI Agents

Python 3,071 272 Updated May 24, 2026

Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL

Python 4,938 442 Updated Nov 13, 2025

Tongyi Deep Research, the Leading Open-source Deep Research Agent

Python 19,397 1,489 Updated Feb 27, 2026

Develop. Preview. Ship.

TypeScript 15,666 3,630 Updated Jun 14, 2026

verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework

Python 21,966 4,075 Updated Jun 15, 2026

A Collection of Papers about Memory for Language Agents

569 44 Updated Jun 12, 2026

An openclaw plugin for autonomous multi-agent job searching.

Python 2 Updated Apr 12, 2026

Realistic Multi-Agent Fire Evacuation Simulator with LLM-Powered Human Behavior (Mesa Framework) πŸ† 3rd Place β€” Agentic Hackathon Zurich (DeepMind x Vercel x ASL) πŸ†

Python 3 1 Updated Mar 1, 2026

A multi-hop multimodal RAG system to chat with your PDFs locally, using iterative retrieval and grounded answers from page-level evidence.

Python 1 Updated Mar 15, 2026
Jupyter Notebook 1 Updated Feb 22, 2026