amimem

Follow

Amin Memarian amimem

Follow

7 followers · 14 following

Achievements

Achievements

Lists (6)

Sort

Competition

Dev

Edu

HF

LLMs

RL

36 repositories

Starred repositories

Emerge-Lab / nocturne_lab

Forked from facebookresearch/nocturne

A data-driven, fast driving simulator for multi-agent coordination under partial observability.

Jupyter Notebook 37 6 Updated Aug 29, 2024

CodeClash-ai / CodeClash

Benchmarking Goal-Oriented Software Engineering

Python 72 7 Updated Dec 24, 2025

niklaslauffer / rational-policy-gradient

Codebase for the rational policy gradient algorithm and paper.

Python 10 Updated Nov 13, 2025

allenai / reward-bench

RewardBench: the first evaluation tool for reward models.

Python 670 92 Updated Jun 12, 2025

FreedomIntelligence / TwinMarket

[NeurIPS 2025 & ICLR 2025 Financial AI Best Paper Award] A multi-agent framework that leverages LLMs to simulate socio-economic systems

Python 102 15 Updated Dec 2, 2025

sign / WeLT

Language modeling that treats text as images, leveraging visual structure for enhanced understanding.

Python 12 3 Updated Dec 17, 2025

apple / ml-l3m

Large multi-modal models (L3M) pre-training.

Python 223 13 Updated Sep 22, 2025

meta-pytorch / LeanRL

LeanRL is a fork of CleanRL, where selected PyTorch scripts optimized for performance using compile and cudagraphs.

Python 665 28 Updated Aug 22, 2025

alex-petrenko / sample-factory

High throughput synchronous and asynchronous reinforcement learning

Python 962 143 Updated Nov 14, 2025

PrimeIntellect-ai / prime-rl

Async RL Training at Scale

Python 956 166 Updated Dec 24, 2025

elevenlabs / elevenlabs-mcp

The official ElevenLabs MCP server

Python 1,111 188 Updated Nov 17, 2025

changjonathanc / flex-nano-vllm

FlexAttention based, minimal vllm-style inference engine for fast Gemma 2 inference.

Python 326 19 Updated Nov 2, 2025

EdanToledo / Stoix

🏛️A research-friendly codebase for fast experimentation of single-agent reinforcement learning in JAX • End-to-End JAX RL

Python 376 44 Updated Oct 29, 2025

Quentin-Anthony / torch-profiling-tutorial

Python 534 32 Updated Aug 6, 2025

goodfire-ai / r1-interpretability

Open source interpretability artefacts for R1.

Jupyter Notebook 165 10 Updated Apr 21, 2025

x1xhlol / system-prompts-and-models-of-ai-tools

FULL Augment Code, Claude Code, Cluely, CodeBuddy, Comet, Cursor, Devin AI, Junie, Kiro, Leap.new, Lovable, Manus, NotionAI, Orchids.app, Perplexity, Poke, Qoder, Replit, Same.dev, Trae, Traycer AI…

102,151 27,180 Updated Dec 19, 2025

NousResearch / atropos

Atropos is a Language Model Reinforcement Learning Environments framework for collecting and evaluating LLM trajectories through diverse environments

Python 780 182 Updated Dec 25, 2025

Metta-AI / metta

A reinforcement learning codebase focusing on the emergence of cooperation and alignment in multi-agent AI systems.

Python 134 47 Updated Dec 25, 2025

kyegomez / awesome-multi-agent-papers

A compilation of the best multi-agent papers

TeX 1,123 93 Updated Dec 12, 2025

SakanaAI / asal

Automating the Search for Artificial Life with Foundation Models!

Jupyter Notebook 448 52 Updated Oct 23, 2025

Wung8 / Reinforcement-Learning

Python 2 Updated Oct 21, 2025

natolambert / rlhf-book

Textbook on reinforcement learning from human feedback

TeX 1,365 119 Updated Dec 24, 2025

facebookresearch / Pitfalls-of-Memorization

Understanding the interplay between memorization and generalization in neural networks, featuring MAT, a learning algorithm to enhance robustness by mitigating spurious correlations.

Python 40 1 Updated Dec 19, 2024

ollama / ollama

Get up and running with OpenAI gpt-oss, DeepSeek-R1, Gemma 3 and other models.

Go 158,223 14,006 Updated Dec 24, 2025

skills / copilot-codespaces-vscode

1,838 7,103 Updated May 14, 2025

luchris429 / purejaxrl

Really Fast End-to-End Jax RL Implementations

Python 1,006 82 Updated Sep 9, 2024

llm-merging / LLM-Merging

LLM-Merging: Building LLMs Efficiently through Merging

Jupyter Notebook 208 44 Updated Sep 24, 2024

facebookresearch / BenchMARL

BenchMARL is a library for benchmarking Multi-Agent Reinforcement Learning (MARL). BenchMARL allows to quickly compare different MARL algorithms, tasks, and models while being systematically ground…

Python 543 108 Updated Nov 10, 2025

google-deepmind / concordia

A library for generative social simulation

Python 1,124 246 Updated Dec 17, 2025

huggingface / smol-course

A course on aligning smol models.

Jupyter Notebook 6,553 2,298 Updated Nov 10, 2025

Starred topics

Hacktoberfest

Ubuntu

Shell

Software-defined networking

Security

Raspberry Pi

Machine learning

IPFS

iOS

Bootstrap

See all starred topics