Skip to content
View icwhite's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report icwhite

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

CyberGym is a large-scale, high-quality cybersecurity evaluation framework designed to rigorously assess the capabilities of AI agents on real-world vulnerability analysis tasks.

Python 216 36 Updated Feb 23, 2026
Python 24 3 Updated Feb 24, 2026

Open Source Model Context Protocol server for PowerPoint automation on Windows via pywin32

Python 53 9 Updated Apr 9, 2026

Building modular LMs with parameter-efficient fine-tuning.

Python 114 22 Updated Apr 3, 2026

A scalable asynchronous reinforcement learning implementation with in-flight weight updates.

Python 394 42 Updated Apr 9, 2026

[ICLR/AAAI 2026] Open-Source LLM-Based Data Analysis Agents

Python 79 5 Updated Jan 26, 2026

Post-training with Tinker

Python 3,054 376 Updated Apr 10, 2026

Democratizing Reinforcement Learning for LLMs

Python 5,406 539 Updated Apr 10, 2026
Python 52 7 Updated Oct 10, 2024

An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & TIS & vLLM & Ray & Async RL)

Python 9,336 915 Updated Apr 10, 2026

Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks

Python 266 11 Updated May 5, 2025

Trial and Error: Exploration-Based Trajectory Optimization of LLM Agents (ACL 2024 Main Conference)

Python 161 15 Updated Oct 30, 2024

Project Malmo is a platform for Artificial Intelligence experimentation and research built on top of Minecraft. We aim to inspire a new generation of research into challenging new problems presente…

Java 4,250 607 Updated Sep 3, 2025

Minecraft AI with LLMs+Mineflayer

JavaScript 5,078 749 Updated Apr 4, 2026

Llama factory adaptation for llm minecraft agents

Python 1 Updated Jun 1, 2025

SkyRL: A Modular Full-stack RL Library for LLMs

Python 1,751 295 Updated Apr 10, 2026
JavaScript 3 1 Updated Nov 3, 2025

A Text-Based Environment for Interactive Debugging

Python 296 39 Updated Mar 23, 2026

Text Adventure Learning Environment Suite - Benchmark to evaluate language models on interactive text environments.

Jupyter Notebook 26 6 Updated Apr 1, 2026
Python 58 18 Updated Mar 25, 2026

Anonymous Github is a proxy server to support anonymous browsing of Github repositories for open-science code and data.

TypeScript 2,035 80 Updated Jan 21, 2026

Scripts for serving vllm on multi node

Python 1 Updated Feb 24, 2025

DialOp: Decision-oriented dialogue environments for collaborative language agents

Python 111 8 Updated Nov 15, 2024

Framework and toolkits for building and evaluating collaborative agents that can work together with humans.

Python 124 19 Updated Dec 4, 2025
Python 116 5 Updated Jan 21, 2025
HTML 1 Updated Jan 22, 2026

🐟 A simple theme for Jekyll. Live at https://eliottvincent.github.io/bay/

HTML 184 403 Updated Apr 8, 2026

An Open-Ended Agentic Simulator

Python 60 8 Updated Aug 11, 2024

Simulating Large-Scale Multi-Agent Interactions with Limited Multimodal Senses and Physical Needs

Python 110 24 Updated Sep 30, 2025
Next