- Cloud
- hesiod.ch
- https://muriz.ch
Lists (1)
Sort Name ascending (A-Z)
Stars
HRM-Text is a 1B text generation model based on the HRM architecture, strengthened by task completion and latent space reasoning.
Stable Looped Models and their Scaling Laws
An alignment auditing agent capable of quickly exploring alignment hypothesis
LiteRT-LM is Google's production-ready, high-performance, open-source inference framework for deploying Large Language Models on edge devices.
Fully automatic censorship removal for language models
Research code base for Automatic Textbook Formalization
slime is an LLM post-training framework for RL Scaling.
An interface library for RL post training with environments.
A JAX Research Toolkit for Visualizing, Manipulating, and Understanding Gemma Models with Multi-modal Support based on Penzai.
Super basic implementation (gist-like) of RLMs with REPL environments.
Code for tuning Smart Tab Grouping models for Firefox
Clean, reusable paper implementations for trending papers on alphaXiv
OpenTelemetry Instrumentation for AI Observability
Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends
Pretraining data reconstruction scripts for Apertus
Generate audiobooks from EPUBs, PDFs and text with synchronized captions.
Opensource benchmark evaluating web operators/agents performance
gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI
A library for making RepE control vectors
Official github repo for the paper "Compression Represents Intelligence Linearly" [COLM 2024]
Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen3.6, GPT-OSS, Llama, and more!
SkyRL: A Modular Full-stack RL Library for LLMs
Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL