Skip to content
View Murgio's full-sized avatar

Block or report Murgio

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

OpenAI Guardrails Python (Preview)

Python 59 5 Updated Oct 9, 2025

Pretraining data reconstruction scripts for Apertus

Python 92 4 Updated Oct 9, 2025

Tech Report of the Apertus LLM Suite

118 4 Updated Sep 18, 2025

Response format to be used with apertus

Python 6 1 Updated Sep 1, 2025

Generate audiobooks from EPUBs, PDFs and text with synchronized captions.

Python 3,676 206 Updated Oct 7, 2025

Opensource benchmark evaluating web operators/agents performance

Python 44 7 Updated Apr 11, 2025

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 18,750 1,834 Updated Oct 6, 2025

A library for making RepE control vectors

Jupyter Notebook 649 50 Updated Sep 24, 2025

Official github repo for the paper "Compression Represents Intelligence Linearly" [COLM 2024]

Python 142 7 Updated Sep 20, 2024

Democratizing Reinforcement Learning for LLMs

Jupyter Notebook 4,432 416 Updated Oct 9, 2025

Temporal Python SDK

Python 808 131 Updated Oct 9, 2025

Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen2.5, Qwen3, Llama, and more!

Python 7,542 574 Updated Oct 9, 2025

SkyRL: A Modular Full-stack RL Library for LLMs

Python 994 129 Updated Oct 9, 2025

Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL

Python 3,285 278 Updated Oct 4, 2025

Minimal reproduction of DeepSeek R1-Zero

Python 12,243 1,509 Updated Apr 24, 2025
Python 356 27 Updated Jun 10, 2025

Scalable RL solution for advanced reasoning of language models

Python 1,749 100 Updated Mar 18, 2025

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Python 7,141 610 Updated Oct 9, 2025

A lightweight LMM-based Document Parsing Model

Python 6,034 409 Updated Oct 9, 2025

Code and Data for Tau-Bench

Python 869 135 Updated Aug 28, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 59,873 7,343 Updated Oct 9, 2025

Solve Visual Understanding with Reinforced VLMs

Python 5,597 356 Updated Aug 29, 2025

Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.

Python 11,864 1,080 Updated Sep 26, 2025

Environments for LLM Reinforcement Learning

Python 3,269 376 Updated Oct 8, 2025

Nano vLLM

Python 7,007 891 Updated Aug 31, 2025

A playbook for systematically maximizing the performance of deep learning models.

29,232 2,390 Updated Jun 18, 2024

🤗 smolagents: a barebones library for agents that think in code.

Python 23,289 2,043 Updated Oct 9, 2025

🌾 OAT: A research-friendly framework for LLM online alignment, including reinforcement learning, preference learning, etc.

Python 525 40 Updated Oct 2, 2025

Inference and training library for high-quality TTS models.

Python 5,436 579 Updated Dec 10, 2024

verl: Volcano Engine Reinforcement Learning for LLMs

Python 14,119 2,517 Updated Oct 9, 2025
Next