icwhite

Follow

🎯

Focusing

Isadora White icwhite

🎯

Focusing

Follow

6 followers · 1 following

Achievements

Achievements

Stars

sunblaze-ucb / cybergym

CyberGym is a large-scale, high-quality cybersecurity evaluation framework designed to rigorously assess the capabilities of AI agents on real-world vulnerability analysis tasks.

Python 280 42 Updated Apr 16, 2026

danieldritter / OAPL

Python 28 3 Updated Feb 24, 2026

Ayushmaniar / powerpoint-mcp

Open Source Model Context Protocol server for PowerPoint automation on Windows via pywin32

Python 68 10 Updated Apr 17, 2026

microsoft / mttl

Building modular LMs with parameter-efficient fine-tuning.

Python 115 22 Updated Apr 3, 2026

ServiceNow / PipelineRL

A scalable asynchronous reinforcement learning implementation with in-flight weight updates.

Python 404 41 Updated Apr 28, 2026

zjunlp / DataMind

[ICLR/AAAI 2026] Open-Source LLM-Based Data Analysis Agents

Python 84 7 Updated Apr 28, 2026

thinking-machines-lab / tinker-cookbook

Post-training with Tinker

Python 3,189 404 Updated Apr 30, 2026

rllm-org / rllm

Democratizing Reinforcement Learning for LLMs

Python 5,463 547 Updated Apr 30, 2026

swt-user / DMPO

Python 52 7 Updated Oct 10, 2024

OpenRLHF / OpenRLHF

An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & VLM & TIS & vLLM & Ray & Async RL)

Python 9,435 930 Updated Apr 30, 2026

facebookresearch / sweet_rl

Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks

Python 266 12 Updated May 5, 2025

Yifan-Song793 / ETO

Trial and Error: Exploration-Based Trajectory Optimization of LLM Agents (ACL 2024 Main Conference)

Python 161 15 Updated Oct 30, 2024

microsoft / malmo

Project Malmo is a platform for Artificial Intelligence experimentation and research built on top of Minecraft. We aim to inspire a new generation of research into challenging new problems presente…

Java 4,252 610 Updated Sep 3, 2025

mindcraft-bots / mindcraft

Minecraft AI with LLMs+Mineflayer

JavaScript 5,158 779 Updated Apr 24, 2026

hlillemark / LLaMA-Factory-mc

Forked from hiyouga/LlamaFactory

Llama factory adaptation for llm minecraft agents

Python 1 Updated Jun 1, 2025

NovaSky-AI / SkyRL

SkyRL: A Modular Full-stack RL Library for LLMs

Python 1,798 311 Updated Apr 30, 2026

icwhite / mindcraft

Forked from mindcraft-bots/mindcraft

JavaScript 3 1 Updated Nov 3, 2025

microsoft / debug-gym

A Text-Based Environment for Interactive Debugging

Python 298 39 Updated Apr 14, 2026

microsoft / tale-suite

Text Adventure Learning Environment Suite - Benchmark to evaluate language models on interactive text environments.

Jupyter Notebook 27 6 Updated Apr 1, 2026

databricks / compose-rl

Python 58 18 Updated Mar 25, 2026

tdurieux / anonymous_github

Anonymous Github is a proxy server to support anonymous browsing of Github repositories for open-science code and data.

HTML 2,048 81 Updated Apr 24, 2026

bosung / vllm-multi-node

Scripts for serving vllm on multi node

Python 1 Updated Feb 24, 2025

Ayushmaniar / mindcraft_multiagent_task_generation

JavaScript 1 Updated Mar 1, 2025

jlin816 / dialop

DialOp: Decision-oriented dialogue environments for collaborative language agents

Python 112 8 Updated Nov 15, 2024

SALT-NLP / collaborative-gym

Framework and toolkits for building and evaluating collaborative agents that can work together with humans.

Python 128 19 Updated Apr 30, 2026

jwhj / OREO

Python 116 5 Updated Jan 21, 2025

ucsd-nlp / ucsd-nlp.github.io

HTML 1 Updated Jan 22, 2026

eliottvincent / bay

🐟 A simple theme for Jekyll. Live at https://eliottvincent.github.io/bay/

HTML 185 403 Updated Apr 19, 2026

luchris429 / JaxLife

An Open-Ended Agentic Simulator

Python 60 8 Updated Aug 11, 2024

cocacola-lab / MineLand

Simulating Large-Scale Multi-Agent Interactions with Limited Multimodal Senses and Physical Needs

Python 110 24 Updated Sep 30, 2025