Skip to content
View DeeDive's full-sized avatar
🌏
Hello, I am Yongqiang.
🌏
Hello, I am Yongqiang.
  • dyq21@mails.tsinghua.edu.cn

Organizations

@vimalabs

Block or report DeeDive

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
176 stars written in Python
Clear filter

Video PreTraining (VPT): Learning to Act by Watching Unlabeled Online Videos

Python 1,574 156 Updated Sep 3, 2025

ChatArena (or Chat Arena) is a Multi-Agent Language Game Environments for LLMs. The goal is to develop communication and collaboration capabilities of AIs.

Python 1,515 145 Updated Aug 11, 2025

使用盲水印保护创作者的知识产权using invisible watermark to protect creator's intellectual property

Python 1,514 189 Updated Aug 30, 2024

a simple vae and cvae from keras

Python 1,362 377 Updated May 18, 2021

The implementation of "Prismer: A Vision-Language Model with Multi-Task Experts".

Python 1,304 73 Updated Jan 17, 2024

Imitation learning benchmark focusing on complex locomotion tasks using MuJoCo.

Python 1,248 130 Updated May 30, 2025

The official implementation of Self-Play Fine-Tuning (SPIN)

Python 1,213 102 Updated May 8, 2024

SMAC3: A Versatile Bayesian Optimization Package for Hyperparameter Optimization

Python 1,202 240 Updated Nov 7, 2025

One repository is all that is necessary for Multi-agent Reinforcement Learning (MARL)

Python 1,202 186 Updated Nov 28, 2024

This is an vault template for researchers using obsidian.

Python 1,180 188 Updated Jul 18, 2023

BEHAVIOR-1K: a platform for accelerating Embodied AI research. Join our Discord for support: https://discord.gg/bccR5vGFEx

Python 1,093 124 Updated Nov 8, 2025

A Python wrapper for Kaldi

Python 1,030 249 Updated Jan 23, 2025

JMLR: OmniSafe is an infrastructural framework for accelerating SafeRL research.

Python 1,007 142 Updated Mar 17, 2025

Benchmarking Generalized Out-of-Distribution Detection

Python 1,004 160 Updated Aug 10, 2025

🚀 大语言模型高效转发服务 · An efficient forwarding service designed for LLMs. · OpenAI API Reverse Proxy

Python 966 312 Updated Mar 15, 2025

A Light CNN for Deep Face Representation with Noisy Labels, TIFS 2018

Python 961 167 Updated Feb 9, 2022

High throughput synchronous and asynchronous reinforcement learning

Python 949 140 Updated Nov 5, 2025

Modular Reinforcement Learning (RL) library (implemented in PyTorch, JAX, and NVIDIA Warp) with support for Gymnasium/Gym, NVIDIA Isaac Lab, Brax and other environments

Python 897 113 Updated Oct 22, 2025

This is a library that provides dual dexterous hand manipulation tasks through Isaac Gym

Python 896 114 Updated Feb 18, 2025

SEED RL: Scalable and Efficient Deep-RL with Accelerated Central Inference. Implements IMPALA and R2D2 algorithms in TF2 with SEED's architecture.

Python 834 148 Updated Nov 29, 2022

Official Algorithm Implementation of ICML'23 Paper "VIMA: General Robot Manipulation with Multimodal Prompts"

Python 831 96 Updated Apr 18, 2024

Debug PyTorch code using PySnooper

Python 801 43 Updated Apr 28, 2021

[CoRL 2024] HumanPlus: Humanoid Shadowing and Imitation from Humans

Python 786 120 Updated Jul 1, 2024

A suite of test scenarios for multi-agent reinforcement learning.

Python 751 143 Updated Nov 1, 2025

How to use wandb?

Python 682 55 Updated Sep 5, 2023

Long-RL: Scaling RL to Long Sequences (NeurIPS 2025)

Python 649 23 Updated Sep 24, 2025

Class-Balanced Loss Based on Effective Number of Samples. CVPR 2019

Python 613 70 Updated Aug 29, 2021

Optimizing AlphaFold Training and Inference on GPU Clusters

Python 611 89 Updated Jul 16, 2024

Create, manipulate and convert representations of position and orientation in 2D or 3D using Python

Python 600 96 Updated Nov 6, 2025

A tool for enriching the output of nvidia-smi.

Python 569 62 Updated Mar 23, 2024