Skip to content
View icwhite's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report icwhite

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

CyberGym is a large-scale, high-quality cybersecurity evaluation framework designed to rigorously assess the capabilities of AI agents on real-world vulnerability analysis tasks.

Python 280 42 Updated Apr 16, 2026
Python 28 3 Updated Feb 24, 2026

Open Source Model Context Protocol server for PowerPoint automation on Windows via pywin32

Python 68 10 Updated Apr 17, 2026

Building modular LMs with parameter-efficient fine-tuning.

Python 115 22 Updated Apr 3, 2026

A scalable asynchronous reinforcement learning implementation with in-flight weight updates.

Python 404 41 Updated Apr 28, 2026

[ICLR/AAAI 2026] Open-Source LLM-Based Data Analysis Agents

Python 84 7 Updated Apr 28, 2026

Post-training with Tinker

Python 3,189 404 Updated Apr 30, 2026

Democratizing Reinforcement Learning for LLMs

Python 5,463 547 Updated Apr 30, 2026
Python 52 7 Updated Oct 10, 2024

An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & VLM & TIS & vLLM & Ray & Async RL)

Python 9,435 930 Updated Apr 30, 2026

Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks

Python 266 12 Updated May 5, 2025

Trial and Error: Exploration-Based Trajectory Optimization of LLM Agents (ACL 2024 Main Conference)

Python 161 15 Updated Oct 30, 2024

Project Malmo is a platform for Artificial Intelligence experimentation and research built on top of Minecraft. We aim to inspire a new generation of research into challenging new problems presente…

Java 4,252 610 Updated Sep 3, 2025

Minecraft AI with LLMs+Mineflayer

JavaScript 5,158 779 Updated Apr 24, 2026

Llama factory adaptation for llm minecraft agents

Python 1 Updated Jun 1, 2025

SkyRL: A Modular Full-stack RL Library for LLMs

Python 1,798 311 Updated Apr 30, 2026
JavaScript 3 1 Updated Nov 3, 2025

A Text-Based Environment for Interactive Debugging

Python 298 39 Updated Apr 14, 2026

Text Adventure Learning Environment Suite - Benchmark to evaluate language models on interactive text environments.

Jupyter Notebook 27 6 Updated Apr 1, 2026
Python 58 18 Updated Mar 25, 2026

Anonymous Github is a proxy server to support anonymous browsing of Github repositories for open-science code and data.

HTML 2,048 81 Updated Apr 24, 2026

Scripts for serving vllm on multi node

Python 1 Updated Feb 24, 2025

DialOp: Decision-oriented dialogue environments for collaborative language agents

Python 112 8 Updated Nov 15, 2024

Framework and toolkits for building and evaluating collaborative agents that can work together with humans.

Python 128 19 Updated Apr 30, 2026
Python 116 5 Updated Jan 21, 2025
HTML 1 Updated Jan 22, 2026

🐟 A simple theme for Jekyll. Live at https://eliottvincent.github.io/bay/

HTML 185 403 Updated Apr 19, 2026

An Open-Ended Agentic Simulator

Python 60 8 Updated Aug 11, 2024

Simulating Large-Scale Multi-Agent Interactions with Limited Multimodal Senses and Physical Needs

Python 110 24 Updated Sep 30, 2025
Next