Skip to content
View waterhorse1's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro

Block or report waterhorse1

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. Natural-language-RL Natural-language-RL Public

    Natural Language Reinforcement Learning

    Python 100 7

  2. LLM_Tree_Search LLM_Tree_Search Public

    (ICML 2024) Alphazero-like Tree-Search can guide large language model decoding and training

    Python 281 30

  3. ChessGPT ChessGPT Public

    (NeurIPS 2023) ChessGPT - Bridging Policy Learning and Language Modeling

    Python 129 12

  4. metaopt/torchopt metaopt/torchopt Public

    TorchOpt is an efficient library for differentiable optimization built upon PyTorch.

    Python 622 41

  5. NAC NAC Public

    (NeurIPS 2021) Neural Auto-Curricula in Two-Player Zero-Sum Games.

    Jupyter Notebook 28 3

  6. CMML_pytorch CMML_pytorch Public

    (CIKM 2021) CMML: Contextual Modulation Meta Learningfor Cold-Start Recommendation

    Python 4