Skip to content
View cavalleria's full-sized avatar

Block or report cavalleria

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

[R]einforcement [L]earning from [M]odel-rewarded [T]hinking - code for the paper "Language Models That Think, Chat Better"

Python 118 6 Updated Oct 27, 2025
Python 548 66 Updated Jan 2, 2025

[ICCV 2025 Oral] DPoser-X: Diffusion Model as Robust 3D Whole-body Human Pose Prior

Python 190 12 Updated Oct 8, 2025

Building General-Purpose Robots Based on Embodied Foundation Model

Python 587 37 Updated Nov 11, 2025

Code for exploring surface electromyography (sEMG) data and training models associated with Reality Labs' paper

Jupyter Notebook 189 25 Updated Aug 13, 2025

Rex-Thinker: Grounded Object Refering via Chain-of-Thought Reasoning

Python 127 6 Updated Jun 30, 2025

[ICCV2025] Referring any person or objects given a natural language description. Code base for RexSeek and HumanRef Benchmark

Python 172 11 Updated Oct 15, 2025

Code repository for the CVPR 2025 paper "From Sparse Signal to Smooth Motion Real-Time Motion Generation with Rolling Prediction Models" and GORP dataset

Python 26 3 Updated Jun 18, 2025

[NeurIPS 2025] PartCrafter: Structured 3D Mesh Generation via Compositional Latent Diffusion Transformers

Python 2,260 144 Updated Sep 19, 2025

This is a pytorch implementation of method based on Lightweight Multi-View 3D Pose Estimation through Camera-Disentangled Representation applying on human pose estimation tasks using stereo images.

Python 12 2 Updated Jan 25, 2024

Official implementation of "E3D-Bench: A Benchmark for End-to-End 3D Geometric Foundation Models"

102 1 Updated Jun 4, 2025

Sequence to sequence network implementation in Pytorch

Python 5 Updated Mar 27, 2019

[CVPR 2025 Highlight] Official implementation of the solvers and estimators proposed in the paper "Relative Pose Estimation through Affine Corrections of Monocular Depth Priors"

C++ 222 13 Updated Apr 8, 2025

This package contains the original 2012 AlexNet code.

Cuda 2,767 358 Updated Mar 12, 2025

[Lumina Embodied AI] 具身智能技术指南 Embodied-AI-Guide

8,850 592 Updated Sep 22, 2025

Official repository of 'Visual-RFT: Visual Reinforcement Fine-Tuning' & 'Visual-ARFT: Visual Agentic Reinforcement Fine-Tuning'’

Jupyter Notebook 2,248 100 Updated Oct 29, 2025

Extend OpenRLHF to support LMM RL training for reproduction of DeepSeek-R1 on multimodal tasks.

Python 828 54 Updated May 14, 2025

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agentic RL)

Python 8,364 810 Updated Nov 9, 2025

[CVPR 2025] EgoLife: Towards Egocentric Life Assistant

Python 344 19 Updated Mar 19, 2025

A Python package that provides evaluation and visualization tools for the HO-Cap dataset

Python 43 5 Updated Mar 22, 2025

Official Code for "MITracker: Multi-View Integration for Visual Object Tracking"

Python 112 4 Updated Jun 18, 2025

[ICLR 2024] M/EEG-based image decoding with contrastive learning. i. Propose a contrastive learning framework to align image and eeg. ii. Resolving brain activity for biological plausibility.

Python 176 24 Updated Jul 18, 2025

Solve Visual Understanding with Reinforced VLMs

Python 5,685 367 Updated Oct 21, 2025

MoBA: Mixture of Block Attention for Long-Context LLMs

Python 1,972 123 Updated Apr 3, 2025
Python 63 4 Updated Oct 7, 2025

[ICLR 2025] Official implementation of "DiffSplat: Repurposing Image Diffusion Models for Scalable 3D Gaussian Splat Generation".

Python 436 25 Updated Aug 27, 2025

🚀 Efficient implementations of state-of-the-art linear attention models

Python 3,824 299 Updated Nov 12, 2025

Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.

Python 4,733 271 Updated Jul 18, 2025

Official Code for ECCV 2024 paper "EgoPoser: Robust Real-Time Egocentric Pose Estimation from Sparse and Intermittent Observations Everywhere"

Python 33 Updated Aug 28, 2025

An AI Hedge Fund Team

Python 42,286 7,478 Updated Oct 11, 2025
Next