DeeDive

🌏

Hello, I am Yongqiang.

DeeDive DeeDive

🌏

Hello, I am Yongqiang.

Turning ideas into action.

22 followers · 372 following

dyq21@mails.tsinghua.edu.cn

Achievements

Highlights

Developer Program Member
Pro

Organizations

Lists (15)

Sort

my_collaboration_proj

nameing_convension

1 repository

paper_with_code

16 repositories

philosophy

1 repository

robocup

1 repository

todo

78 repositories

tools

66 repositories

Stars

176 stars written in Python

Clear filter

openai / Video-Pre-Training

Video PreTraining (VPT): Learning to Act by Watching Unlabeled Online Videos

Python 1,574 156 Updated Sep 3, 2025

Farama-Foundation / chatarena

ChatArena (or Chat Arena) is a Multi-Agent Language Game Environments for LLMs. The goal is to develop communication and collaboration capabilities of AIs.

Python 1,515 145 Updated Aug 11, 2025

fire-keeper / BlindWatermark

使用盲水印保护创作者的知识产权using invisible watermark to protect creator's intellectual property

Python 1,514 189 Updated Aug 30, 2024

bojone / vae

a simple vae and cvae from keras

Python 1,362 377 Updated May 18, 2021

NVlabs / prismer

The implementation of "Prismer: A Vision-Language Model with Multi-Task Experts".

Python 1,304 73 Updated Jan 17, 2024

robfiras / loco-mujoco

Imitation learning benchmark focusing on complex locomotion tasks using MuJoCo.

Python 1,248 130 Updated May 30, 2025

uclaml / SPIN

The official implementation of Self-Play Fine-Tuning (SPIN)

Python 1,213 102 Updated May 8, 2024

automl / SMAC3

SMAC3: A Versatile Bayesian Optimization Package for Hyperparameter Optimization

Python 1,202 240 Updated Nov 7, 2025

Replicable-MARL / MARLlib

One repository is all that is necessary for Multi-agent Reinforcement Learning (MARL)

Python 1,202 186 Updated Nov 28, 2024

sheldonxxd / obsidian_vault_template_for_researcher

This is an vault template for researchers using obsidian.

Python 1,180 188 Updated Jul 18, 2023

StanfordVL / BEHAVIOR-1K

BEHAVIOR-1K: a platform for accelerating Embodied AI research. Join our Discord for support: https://discord.gg/bccR5vGFEx

Python 1,093 124 Updated Nov 8, 2025

pykaldi / pykaldi

A Python wrapper for Kaldi

Python 1,030 249 Updated Jan 23, 2025

PKU-Alignment / omnisafe

JMLR: OmniSafe is an infrastructural framework for accelerating SafeRL research.

Python 1,007 142 Updated Mar 17, 2025

Jingkang50 / OpenOOD

Benchmarking Generalized Out-of-Distribution Detection

Python 1,004 160 Updated Aug 10, 2025

KenyonY / openai-forward

🚀 大语言模型高效转发服务 · An efficient forwarding service designed for LLMs. · OpenAI API Reverse Proxy

Python 966 312 Updated Mar 15, 2025

AlfredXiangWu / LightCNN

A Light CNN for Deep Face Representation with Noisy Labels, TIFS 2018

Python 961 167 Updated Feb 9, 2022

alex-petrenko / sample-factory

High throughput synchronous and asynchronous reinforcement learning

Python 949 140 Updated Nov 5, 2025

Toni-SM / skrl

Modular Reinforcement Learning (RL) library (implemented in PyTorch, JAX, and NVIDIA Warp) with support for Gymnasium/Gym, NVIDIA Isaac Lab, Brax and other environments

Python 897 113 Updated Oct 22, 2025