Skip to content
View amimem's full-sized avatar

Block or report amimem

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

A data-driven, fast driving simulator for multi-agent coordination under partial observability.

Jupyter Notebook 37 6 Updated Aug 29, 2024

Benchmarking Goal-Oriented Software Engineering

Python 72 7 Updated Dec 24, 2025

Codebase for the rational policy gradient algorithm and paper.

Python 10 Updated Nov 13, 2025

RewardBench: the first evaluation tool for reward models.

Python 670 92 Updated Jun 12, 2025

[NeurIPS 2025 & ICLR 2025 Financial AI Best Paper Award] A multi-agent framework that leverages LLMs to simulate socio-economic systems

Python 102 15 Updated Dec 2, 2025

Language modeling that treats text as images, leveraging visual structure for enhanced understanding.

Python 12 3 Updated Dec 17, 2025

Large multi-modal models (L3M) pre-training.

Python 223 13 Updated Sep 22, 2025

LeanRL is a fork of CleanRL, where selected PyTorch scripts optimized for performance using compile and cudagraphs.

Python 665 28 Updated Aug 22, 2025

High throughput synchronous and asynchronous reinforcement learning

Python 962 143 Updated Nov 14, 2025

Async RL Training at Scale

Python 956 166 Updated Dec 24, 2025

The official ElevenLabs MCP server

Python 1,111 188 Updated Nov 17, 2025

FlexAttention based, minimal vllm-style inference engine for fast Gemma 2 inference.

Python 326 19 Updated Nov 2, 2025

🏛️A research-friendly codebase for fast experimentation of single-agent reinforcement learning in JAX • End-to-End JAX RL

Python 376 44 Updated Oct 29, 2025

Open source interpretability artefacts for R1.

Jupyter Notebook 165 10 Updated Apr 21, 2025

FULL Augment Code, Claude Code, Cluely, CodeBuddy, Comet, Cursor, Devin AI, Junie, Kiro, Leap.new, Lovable, Manus, NotionAI, Orchids.app, Perplexity, Poke, Qoder, Replit, Same.dev, Trae, Traycer AI…

102,151 27,180 Updated Dec 19, 2025

Atropos is a Language Model Reinforcement Learning Environments framework for collecting and evaluating LLM trajectories through diverse environments

Python 780 182 Updated Dec 25, 2025

A reinforcement learning codebase focusing on the emergence of cooperation and alignment in multi-agent AI systems.

Python 134 47 Updated Dec 25, 2025

A compilation of the best multi-agent papers

TeX 1,123 93 Updated Dec 12, 2025

Automating the Search for Artificial Life with Foundation Models!

Jupyter Notebook 448 52 Updated Oct 23, 2025
Python 2 Updated Oct 21, 2025

Textbook on reinforcement learning from human feedback

TeX 1,365 119 Updated Dec 24, 2025

Understanding the interplay between memorization and generalization in neural networks, featuring MAT, a learning algorithm to enhance robustness by mitigating spurious correlations.

Python 40 1 Updated Dec 19, 2024

Get up and running with OpenAI gpt-oss, DeepSeek-R1, Gemma 3 and other models.

Go 158,223 14,006 Updated Dec 24, 2025

Really Fast End-to-End Jax RL Implementations

Python 1,006 82 Updated Sep 9, 2024

LLM-Merging: Building LLMs Efficiently through Merging

Jupyter Notebook 208 44 Updated Sep 24, 2024

BenchMARL is a library for benchmarking Multi-Agent Reinforcement Learning (MARL). BenchMARL allows to quickly compare different MARL algorithms, tasks, and models while being systematically ground…

Python 543 108 Updated Nov 10, 2025

A library for generative social simulation

Python 1,124 246 Updated Dec 17, 2025

A course on aligning smol models.

Jupyter Notebook 6,553 2,298 Updated Nov 10, 2025
Next