Skip to content
View aldopareja's full-sized avatar

Sponsoring

@patrick-kidger

Organizations

@instructlab

Block or report aldopareja

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Learning Deep Representations of Data Distributions

TeX 723 59 Updated Dec 23, 2025
Python 308 44 Updated Dec 12, 2025

Declarative visualization library for Python

Python 10,179 831 Updated Dec 22, 2025

Extremely fast Query Engine for DataFrames, written in Rust

Rust 36,661 2,524 Updated Dec 23, 2025

A reactive notebook for Python — run reproducible experiments, query with SQL, execute as a script, deploy as an app, and version with git. Stored as pure Python. All in a modern, AI-native editor.

Python 17,994 841 Updated Dec 23, 2025

Our library for RL environments + evals

Python 3,656 454 Updated Dec 23, 2025

🙌 OpenHands: AI-Driven Development

Python 65,876 8,105 Updated Dec 23, 2025

🌊 The leading agent orchestration platform for Claude. Deploy intelligent multi-agent swarms, coordinate autonomous workflows, and build conversational AI systems. Features enterprise-grade archite…

JavaScript 10,829 1,413 Updated Dec 9, 2025

Official repo for paper: "Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't"

Python 270 27 Updated Oct 16, 2025

MiMo: Unlocking the Reasoning Potential of Language Model – From Pretraining to Posttraining

Python 1,852 75 Updated Jun 5, 2025
Python 1 Updated Jun 8, 2025

Darwin Gödel Machine: Open-Ended Evolution of Self-Improving Agents

Python 1,772 382 Updated Aug 13, 2025

Implementing DeepSeek R1's GRPO algorithm from scratch

Python 1,718 81 Updated Apr 18, 2025

FULL Augment Code, Claude Code, Cluely, CodeBuddy, Comet, Cursor, Devin AI, Junie, Kiro, Leap.new, Lovable, Manus, NotionAI, Orchids.app, Perplexity, Poke, Qoder, Replit, Same.dev, Trae, Traycer AI…

101,764 27,124 Updated Dec 19, 2025

PyTorch building blocks for the OLMo ecosystem

Python 617 110 Updated Dec 23, 2025

Benchmarking Agentic LLM and VLM Reasoning On Games

Python 218 41 Updated Dec 3, 2025

A Collection of Competitive Text-Based Games for Language Model Evaluation and Reinforcement Learning

Python 330 77 Updated Oct 29, 2025

Rust based package manager for macOS

Rust 1,852 20 Updated Sep 2, 2025

fast trainer for educational purposes

Python 22 12 Updated Nov 26, 2025

🌐 Make websites accessible for AI agents. Automate tasks online with ease.

Python 74,066 8,869 Updated Dec 23, 2025

A bunch of kernels that might make stuff slower 😉

Python 69 9 Updated Dec 23, 2025

Amazon Nova Act is an AWS service for building and deploying highly reliable AI agents that automate UI-based workflows at scale.

Python 869 138 Updated Dec 16, 2025

Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.

Python 3,261 256 Updated Dec 23, 2025

Synthetic Data Generation Toolkit for LLMs

Python 79 42 Updated Dec 18, 2025

Pocket Flow: 100-line LLM framework. Let Agents build Agents!

Python 9,293 1,034 Updated Aug 13, 2025

No fortress, purely open ground. OpenManus is Coming.

Python 51,427 8,973 Updated Nov 17, 2025

Democratizing Reinforcement Learning for LLMs

Python 4,897 469 Updated Dec 23, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 17,735 2,878 Updated Dec 23, 2025

Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.

Python 2,993 222 Updated Dec 22, 2025

LM engine is a library for pretraining/finetuning LLMs

Python 102 25 Updated Dec 23, 2025
Next