Skip to content
View Murgio's full-sized avatar
💭
I may be slow to respond.
💭
I may be slow to respond.

Block or report Murgio

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

HRM-Text is a 1B text generation model based on the HRM architecture, strengthened by task completion and latent space reasoning.

Python 1,456 135 Updated Jun 17, 2026

Stable Looped Models and their Scaling Laws

Python 166 11 Updated May 17, 2026

An alignment auditing agent capable of quickly exploring alignment hypothesis

Python 1,234 199 Updated Jun 18, 2026

LiteRT-LM is Google's production-ready, high-performance, open-source inference framework for deploying Large Language Models on edge devices.

C++ 5,638 587 Updated Jun 19, 2026

Fully automatic censorship removal for language models

Python 25,164 2,708 Updated Jun 19, 2026

Research code base for Automatic Textbook Formalization

Python 155 10 Updated Mar 31, 2026

slime is an LLM post-training framework for RL Scaling.

Python 6,253 910 Updated Jun 19, 2026

An interface library for RL post training with environments.

Python 2,318 397 Updated Jun 19, 2026

A JAX Research Toolkit for Visualizing, Manipulating, and Understanding Gemma Models with Multi-modal Support based on Penzai.

Jupyter Notebook 93 10 Updated Jan 13, 2026

Super basic implementation (gist-like) of RLMs with REPL environments.

Python 804 135 Updated Jan 7, 2026

Like catnip, a highly addictive agentic coding tool

Go 488 39 Updated May 20, 2026

Code for tuning Smart Tab Grouping models for Firefox

Jupyter Notebook 21 4 Updated Dec 4, 2025

Clean, reusable paper implementations for trending papers on alphaXiv

Python 175 19 Updated Mar 17, 2026
Python 83 17 Updated Feb 18, 2026

OpenTelemetry Instrumentation for AI Observability

Python 1,037 258 Updated Jun 18, 2026

Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends

Python 2,454 492 Updated Jun 9, 2026

OpenAI Guardrails - Python

Python 215 35 Updated Mar 28, 2026

Pretraining data reconstruction scripts for Apertus

Python 129 13 Updated Oct 27, 2025

Tech Report of the Apertus LLM

133 5 Updated Mar 9, 2026

Response format to be used with apertus

Python 12 1 Updated Dec 3, 2025

Generate audiobooks from EPUBs, PDFs and text with synchronized captions.

Python 4,859 320 Updated May 25, 2026

Opensource benchmark evaluating web operators/agents performance

Python 49 7 Updated Apr 11, 2025

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 20,174 2,096 Updated Jun 9, 2026

A library for making RepE control vectors

Jupyter Notebook 733 64 Updated Sep 24, 2025

Official github repo for the paper "Compression Represents Intelligence Linearly" [COLM 2024]

Python 148 7 Updated Sep 20, 2024

Democratizing Reinforcement Learning for LLMs

Python 5,630 577 Updated Jun 18, 2026

Temporal Python SDK

Python 1,103 198 Updated Jun 19, 2026

Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen3.6, GPT-OSS, Llama, and more!

Python 10,000 892 Updated Jun 18, 2026

SkyRL: A Modular Full-stack RL Library for LLMs

Python 2,008 356 Updated Jun 19, 2026

Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL

Python 4,968 445 Updated Nov 13, 2025
Next