-
Nanjing University
- Nanjing, China
- http://www.lamda.nju.edu.cn/yuy
-
ZOOpt Public
A python package of Zeroth-Order Optimization (ZOOpt)
-
ZOOclient.jl Public
Client of distributed ZOOpt in Julia
-
-
-
gym Public
Forked from openai/gymA toolkit for developing and comparing reinforcement learning algorithms.
Python Other UpdatedJul 26, 2021 -
ASG Public
Open category classification by adversarial sample generation
-
RACOS Public
A theoretically-grounded derivative-free optimization method, born from a statistical view of evolutionary algorithms
-
MLSim Public
A machine learning based fine-grained disease transmission simulator
-
Deep-Reasoning-Papers Public
Forked from floodsung/Deep-Reasoning-PapersRecent Papers including Neural Symbolic Reasoning, Logical Reasoning, Visual Reasoning, planning and any other topics connecting deep learning and reasoning
-
VirtualTaobao Public
Virtual-Taobao simulators with OpenAI Gym interface
-
ABL-HED Public
Forked from AbductiveLearning/ABL-HEDHandwritten Equations Decipherment with Abductive Learning
Python UpdatedAug 15, 2019 -
PRR Public
Meta-Reinforcement Learning with Policy Residual Representation
-
mind-SC2 Public
Forked from StarBeta/Thought-SC2Efficient Reinforcement Learning with a Mind-Game for Full-Length StarCraft II
-
-
RetroCodes Public
Codes of our team for the OpenAI Retro Contest of reinforcement learning
-
POSEC Public
Learning Environmental Calibration Actions for Policy Self-Evolution
-
-
GANMM Public
Mixture of Generative Adversarial Networks for Clustering
-
-
-
SRE Public
Sequential Random Embedding for High-Dimensional Derivative-Free Optimization