[ECCV2022] PETR: Position Embedding Transformation for Multi-View 3D Object Detection & [ICCV2023] PETRv2: A Unified Framework for 3D Perception from Multi-Camera Images

Python 1,006 152 Updated Oct 11, 2023

kyegomez / NaViT

My implementation of "Patch n’ Pack: NaViT, a Vision Transformer for any Aspect Ratio and Resolution"

Python 270 15 Updated Oct 27, 2025

google-research / vision_transformer

Jupyter Notebook 12,140 1,431 Updated Mar 6, 2025

kyegomez / swarms

The Enterprise-Grade Production-Ready Multi-Agent Orchestration Framework. Website: https://swarms.ai

Python 5,515 700 Updated Dec 16, 2025

deepseek-ai / DeepSeek-OCR

Contexts Optical Compression

Python 21,507 1,924 Updated Oct 25, 2025

qiuzh20 / gated_attention

The official implementation for [NeurIPS2025 Oral] Gated Attention for Large Language Models: Non-linearity, Sparsity, and Attention-Sink-Free

Jupyter Notebook 661 41 Updated Dec 20, 2025

AIGeeksGroup / EvoVLA

EvoVLA: Self-Evolving Vision-Language-Action Model

Python 210 115 Updated Dec 5, 2025

deepseek-ai / DeepSeek-V3.2-Exp

Python 1,376 111 Updated Nov 18, 2025

fudan-generative-vision / hallo3

[CVPR 2025] Hallo3: Highly Dynamic and Realistic Portrait Image Animation with Video Diffusion Transformer

Python 1,350 174 Updated Mar 13, 2025

fudan-generative-vision / hallo

Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation

Python 8,629 1,121 Updated Sep 14, 2024

KlingTeam / LivePortrait

Bring portraits to life!

Python 17,484 1,816 Updated Nov 16, 2025

antgroup / echomimic_v3

[AAAI 2026] EchoMimicV3: 1.3B Parameters are All You Need for Unified Multi-Modal and Multi-Task Human Animation

Python 674 70 Updated Nov 24, 2025

antgroup / echomimic_v2

[CVPR 2025] EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation

Python 4,423 520 Updated Aug 11, 2025

yandexdataschool / Practical_RL

A course in reinforcement learning in the wild

Jupyter Notebook 6,381 1,773 Updated Sep 21, 2025

NVIDIA-NeMo / RL

Scalable toolkit for efficient model reinforcement

Python 1,148 199 Updated Dec 21, 2025

AI4Finance-Foundation / FinRL

FinRL®: Financial Reinforcement Learning. 🔥

Jupyter Notebook 13,551 3,081 Updated Dec 8, 2025

roboticslibrary / rl

The Robotics Library (RL) is a self-contained C++ library for rigid body kinematics and dynamics, motion planning, and control.

C++ 1,141 238 Updated Apr 15, 2025

pytorch / rl

A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.

Python 3,219 425 Updated Dec 18, 2025

allenzren / open-pi-zero

Re-implementation of pi0 vision-language-action (VLA) model from Physical Intelligence

Python 1,313 89 Updated Jan 31, 2025

PRIME-RL / SimpleVLA-RL

SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning

Python 1,126 62 Updated Oct 13, 2025

OpenHelix-Team / VLA-Adapter

VLA-Adapter: An Effective Paradigm for Tiny-Scale Vision-Language-Action Model

Python 1,814 161 Updated Nov 18, 2025

openvla / openvla

Forked from TRI-ML/prismatic-vlms

OpenVLA: An open-source vision-language-action model for robotic manipulation.

Python 4,782 575 Updated Mar 23, 2025

ernie-research / MA-RLHF

[ICLR'25] MA-RLHF: Reinforcement Learning from Human Feedback with Macro Actions

Python 10 Updated Jun 6, 2025

openvino

Kubernetes

Docker

C++

C

OpenGL

Linux

Rust

Go

Natural language processing

See all starred topics