Skip to content
View MDrW's full-sized avatar

Block or report MDrW

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 17 1 Updated Feb 14, 2026

Search Self-Play: Pushing the Frontier of Agent Capability without Supervision

Python 98 8 Updated Mar 4, 2026

The dataset and benchmark of IVMR suite that has been accepted by KDD 2025 Dataset and Benchmark Track

Python 1 2 Updated Jul 28, 2025

[NeurIPS 2025 D&B] Open-source Multi-agent Poster Generation from Papers

Python 3,548 246 Updated Dec 21, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 20,229 3,506 Updated Mar 26, 2026

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 69,117 8,430 Updated Mar 26, 2026

DSAC-v2; DSAC-T; DASC; Distributional Soft Actor-Critic

Python 438 50 Updated Dec 1, 2025

Our VMAgent is a platform for exploiting Reinforcement Learning (RL) on Virtual Machine (VM) scheduling tasks.

Python 96 13 Updated Apr 27, 2023
Python 1 Updated May 5, 2023

Pytorch implementation of Pointer Network

Python 339 71 Updated Apr 17, 2019

pytorch implementation of "Get To The Point: Summarization with Pointer-Generator Networks"

Python 915 236 Updated Jan 23, 2023

General Python implementation of Monte Carlo Tree Search for the use with Open AI Gym environments.

Python 42 17 Updated Oct 8, 2020

A Multi-threaded Implementation of AlphaZero (C++)

Python 387 49 Updated Jan 7, 2023

中国大模型

6,427 554 Updated Nov 30, 2024

基于强化学习的云计算虚拟机放置

Java 18 6 Updated Apr 11, 2019

[DEPRECATED] Simulation Framework for Virtual Machine Placement in Cloud Computing Environments. [CURRENT]: https://github.com/SDDCVMP/VMP-framework

Java 11 8 Updated May 20, 2021

A simulator for Virtual Machine Placement Algorithms

Java 3 1 Updated Jul 23, 2018

CloudSim: A Framework For Modeling And Simulation Of Cloud Computing Infrastructures And Services

Java 987 563 Updated Jan 10, 2026

This repo contains the implementation of deep reinforcement learning (DRL) algorithms for virtual machine rescheduling in data centers.

Python 12 2 Updated Dec 2, 2022

Papers about graph transformers.

922 76 Updated Mar 19, 2025

An Autonomous LLM Agent for Complex Task Solving

Python 8,520 896 Updated Aug 12, 2024

Modeling language for Mathematical Optimization (linear, mixed-integer, conic, semidefinite, nonlinear)

Julia 2,419 415 Updated Mar 26, 2026

Extensible Julia/JuMP optimization package for Security-Constrained Unit Commitment (SCUC)

Julia 138 35 Updated Mar 11, 2026

Code for "TD-MPC2: Scalable, Robust World Models for Continuous Control"

Python 775 180 Updated May 21, 2025

Predict and search framework for MilP

Python 67 15 Updated Nov 20, 2022

Distributed Training for DeepGCNs: https://www.deepgcns.org

Python 8 2 Updated Sep 26, 2022
Next