Skip to content
View saa1605's full-sized avatar
🏠
Working from home
🏠
Working from home

Highlights

  • Pro

Block or report saa1605

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 250 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.

Python 2,331 185 Updated Oct 8, 2025

Environments for LLM Reinforcement Learning

Python 3,265 374 Updated Oct 8, 2025

verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in-Group Policy Optimization for LLM Agent Training"

Python 981 81 Updated Oct 6, 2025

Official code for NeurIPS 2025 paper "GRIT: Teaching MLLMs to Think with Images"

Python 144 6 Updated Aug 4, 2025

Official Repository of Absolute Zero Reasoner

Python 1,702 281 Updated Aug 24, 2025

🔔🧠 Easily experiment with popular language agents across diverse reasoning/decision-making benchmarks!

Python 54 10 Updated Jul 9, 2025

🌍 Repository for "AppWorld: A Controllable World of Apps and People for Benchmarking Interactive Coding Agent", ACL'24 Best Resource Paper.

Python 249 28 Updated Aug 10, 2025

Agent S: an open agentic framework that uses computers like a human

Python 6,946 757 Updated Oct 5, 2025

Open Agent Computer Interface

Python 85 22 Updated Nov 26, 2024

[NeurIPS 2024] OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments

Python 2,207 305 Updated Oct 6, 2025

RL-Agents to play the famous AI-benchmark game hanabi. Contains DQN and PG variants, as well as a Graphical UI, to try out the agents

Python 4 Updated Dec 27, 2022

[ICLR 2024] Source codes for the paper "Building Cooperative Embodied Agents Modularly with Large Language Models"

Python 278 45 Updated Mar 30, 2025
Python 88 14 Updated Apr 15, 2022

Colorful Prompt Tuning for Pre-trained Vision-Language Models

Python 49 3 Updated Nov 1, 2022

Cornell Touchdown natural language navigation and spatial reasoning dataset.

Python 103 13 Updated Sep 5, 2020

Code for ORAR Agent for Vision and Language Navigation on Touchdown and map2seq

Python 17 1 Updated Nov 3, 2023

A Probabilistic Programming Language in 70 lines of Python. Code for the blog post https://mrandri19.github.io/2022/01/12/a-PPL-in-70-lines-of-python.html

Python 18 Updated Feb 10, 2022

Localization via embodied dialog on the navigation graph

Python 13 2 Updated Apr 18, 2022

A music programming language for musicians. 🎶

Go 5,795 300 Updated Sep 20, 2025

Train transformer language models with reinforcement learning.

Python 15,776 2,225 Updated Oct 9, 2025

Keyboard using predictive words generated by an NPL model

Kotlin 41 10 Updated Jan 12, 2019

A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training

Python 22,677 2,956 Updated Aug 15, 2024