Skip to content
View Nima-Wang's full-sized avatar
😀
😀

Block or report Nima-Wang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

PPTAgent: Generating and Evaluating Presentations Beyond Text-to-Slides [EMNLP 2025]

Python 2,908 348 Updated Dec 18, 2025
Python 15 2 Updated Oct 10, 2025
Python 881 163 Updated Dec 22, 2022
Python 8 Updated May 12, 2025

Learning Auxiliary Monocular Contexts Helps Monocular 3D Object Detection (AAAI'22)

Python 161 23 Updated Jan 3, 2024

Official code for BEVDepth.

Python 834 113 Updated Jan 18, 2023

OpenMMLab's next-generation platform for general 3D object detection.

Python 6,184 1,715 Updated Jul 10, 2024

[ECCV2022] PETR: Position Embedding Transformation for Multi-View 3D Object Detection & [ICCV2023] PETRv2: A Unified Framework for 3D Perception from Multi-Camera Images

Python 1,006 152 Updated Oct 11, 2023

My implementation of "Patch n’ Pack: NaViT, a Vision Transformer for any Aspect Ratio and Resolution"

Python 270 15 Updated Oct 27, 2025

The Enterprise-Grade Production-Ready Multi-Agent Orchestration Framework. Website: https://swarms.ai

Python 5,515 700 Updated Dec 16, 2025

Contexts Optical Compression

Python 21,507 1,924 Updated Oct 25, 2025

The official implementation for [NeurIPS2025 Oral] Gated Attention for Large Language Models: Non-linearity, Sparsity, and Attention-Sink-Free

Jupyter Notebook 661 41 Updated Dec 20, 2025

EvoVLA: Self-Evolving Vision-Language-Action Model

Python 210 115 Updated Dec 5, 2025

[CVPR 2025] Hallo3: Highly Dynamic and Realistic Portrait Image Animation with Video Diffusion Transformer

Python 1,350 174 Updated Mar 13, 2025

Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation

Python 8,629 1,121 Updated Sep 14, 2024

Bring portraits to life!

Python 17,484 1,816 Updated Nov 16, 2025

[AAAI 2026] EchoMimicV3: 1.3B Parameters are All You Need for Unified Multi-Modal and Multi-Task Human Animation

Python 674 70 Updated Nov 24, 2025

[CVPR 2025] EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation

Python 4,423 520 Updated Aug 11, 2025

A course in reinforcement learning in the wild

Jupyter Notebook 6,381 1,773 Updated Sep 21, 2025

Scalable toolkit for efficient model reinforcement

Python 1,148 199 Updated Dec 21, 2025

FinRL®: Financial Reinforcement Learning. 🔥

Jupyter Notebook 13,551 3,081 Updated Dec 8, 2025

The Robotics Library (RL) is a self-contained C++ library for rigid body kinematics and dynamics, motion planning, and control.

C++ 1,141 238 Updated Apr 15, 2025

A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.

Python 3,219 425 Updated Dec 18, 2025

Re-implementation of pi0 vision-language-action (VLA) model from Physical Intelligence

Python 1,313 89 Updated Jan 31, 2025

SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning

Python 1,126 62 Updated Oct 13, 2025

VLA-Adapter: An Effective Paradigm for Tiny-Scale Vision-Language-Action Model

Python 1,814 161 Updated Nov 18, 2025

OpenVLA: An open-source vision-language-action model for robotic manipulation.

Python 4,782 575 Updated Mar 23, 2025

[ICLR'25] MA-RLHF: Reinforcement Learning from Human Feedback with Macro Actions

Python 10 Updated Jun 6, 2025
Next