Stars
Search Self-Play: Pushing the Frontier of Agent Capability without Supervision
The dataset and benchmark of IVMR suite that has been accepted by KDD 2025 Dataset and Benchmark Track
[NeurIPS 2025 D&B] Open-source Multi-agent Poster Generation from Papers
verl: Volcano Engine Reinforcement Learning for LLMs
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
DSAC-v2; DSAC-T; DASC; Distributional Soft Actor-Critic
Our VMAgent is a platform for exploiting Reinforcement Learning (RL) on Virtual Machine (VM) scheduling tasks.
pytorch implementation of "Get To The Point: Summarization with Pointer-Generator Networks"
General Python implementation of Monte Carlo Tree Search for the use with Open AI Gym environments.
A Multi-threaded Implementation of AlphaZero (C++)
[DEPRECATED] Simulation Framework for Virtual Machine Placement in Cloud Computing Environments. [CURRENT]: https://github.com/SDDCVMP/VMP-framework
A simulator for Virtual Machine Placement Algorithms
CloudSim: A Framework For Modeling And Simulation Of Cloud Computing Infrastructures And Services
This repo contains the implementation of deep reinforcement learning (DRL) algorithms for virtual machine rescheduling in data centers.
An Autonomous LLM Agent for Complex Task Solving
Modeling language for Mathematical Optimization (linear, mixed-integer, conic, semidefinite, nonlinear)
Extensible Julia/JuMP optimization package for Security-Constrained Unit Commitment (SCUC)
Code for "TD-MPC2: Scalable, Robust World Models for Continuous Control"
Predict and search framework for MilP
Distributed Training for DeepGCNs: https://www.deepgcns.org