Skip to content
View Murgio's full-sized avatar
💭
I may be slow to respond.
💭
I may be slow to respond.

Block or report Murgio

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

HRM-Text is a 1B text generation model based on the HRM architecture, strengthened by task completion and latent space reasoning.

Python 1,309 119 Updated May 27, 2026

Stable Looped Models and their Scaling Laws

Python 161 11 Updated May 17, 2026

An alignment auditing agent capable of quickly exploring alignment hypothesis

Python 1,232 198 Updated Jun 13, 2026

LiteRT-LM is Google's production-ready, high-performance, open-source inference framework for deploying Large Language Models on edge devices.

C++ 5,584 577 Updated Jun 13, 2026

Fully automatic censorship removal for language models

Python 24,465 2,625 Updated Jun 13, 2026

Research code base for Automatic Textbook Formalization

Python 154 10 Updated Mar 31, 2026

slime is an LLM post-training framework for RL Scaling.

Python 6,112 894 Updated Jun 13, 2026

An interface library for RL post training with environments.

Python 2,183 393 Updated Jun 13, 2026

A JAX Research Toolkit for Visualizing, Manipulating, and Understanding Gemma Models with Multi-modal Support based on Penzai.

Jupyter Notebook 93 10 Updated Jan 13, 2026

Super basic implementation (gist-like) of RLMs with REPL environments.

Python 797 134 Updated Jan 7, 2026

Like catnip, a highly addictive agentic coding tool

Go 486 39 Updated May 20, 2026

Code for tuning Smart Tab Grouping models for Firefox

Jupyter Notebook 21 4 Updated Dec 4, 2025

Clean, reusable paper implementations for trending papers on alphaXiv

Python 174 19 Updated Mar 17, 2026
Python 83 17 Updated Feb 18, 2026

OpenTelemetry Instrumentation for AI Observability

Python 1,023 258 Updated Jun 11, 2026

Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends

Python 2,446 486 Updated Jun 9, 2026

OpenAI Guardrails - Python

Python 214 34 Updated Mar 28, 2026

Pretraining data reconstruction scripts for Apertus

Python 127 12 Updated Oct 27, 2025

Tech Report of the Apertus LLM

133 5 Updated Mar 9, 2026

Response format to be used with apertus

Python 12 1 Updated Dec 3, 2025

Generate audiobooks from EPUBs, PDFs and text with synchronized captions.

Python 4,825 317 Updated May 25, 2026

Opensource benchmark evaluating web operators/agents performance

Python 49 7 Updated Apr 11, 2025

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 20,161 2,095 Updated Jun 9, 2026

A library for making RepE control vectors

Jupyter Notebook 732 64 Updated Sep 24, 2025

Official github repo for the paper "Compression Represents Intelligence Linearly" [COLM 2024]

Python 148 7 Updated Sep 20, 2024

Democratizing Reinforcement Learning for LLMs

Python 5,609 577 Updated Jun 13, 2026

Temporal Python SDK

Python 1,093 195 Updated Jun 12, 2026

Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen3.6, GPT-OSS, Llama, and more!

Python 9,968 892 Updated Jun 12, 2026

SkyRL: A Modular Full-stack RL Library for LLMs

Python 1,994 351 Updated Jun 14, 2026

Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL

Python 4,929 442 Updated Nov 13, 2025
Next