Skip to content
View cavalleria's full-sized avatar

Block or report cavalleria

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

[R]einforcement [L]earning from [M]odel-rewarded [T]hinking - code for the paper "Language Models That Think, Chat Better"

Python 117 6 Updated Oct 27, 2025
Python 548 66 Updated Jan 2, 2025

[ICCV 2025 Oral] DPoser-X: Diffusion Model as Robust 3D Whole-body Human Pose Prior

Python 187 12 Updated Oct 8, 2025

Building General-Purpose Robots Based on Embodied Foundation Model

Python 581 38 Updated Nov 4, 2025

Code for exploring surface electromyography (sEMG) data and training models associated with Reality Labs' paper

Jupyter Notebook 188 24 Updated Aug 13, 2025

Rex-Thinker: Grounded Object Refering via Chain-of-Thought Reasoning

Python 127 6 Updated Jun 30, 2025

[ICCV2025] Referring any person or objects given a natural language description. Code base for RexSeek and HumanRef Benchmark

Python 172 11 Updated Oct 15, 2025

Code repository for the CVPR 2025 paper "From Sparse Signal to Smooth Motion Real-Time Motion Generation with Rolling Prediction Models" and GORP dataset

Python 26 3 Updated Jun 18, 2025

[NeurIPS 2025] PartCrafter: Structured 3D Mesh Generation via Compositional Latent Diffusion Transformers

Python 2,254 142 Updated Sep 19, 2025

This is a pytorch implementation of method based on Lightweight Multi-View 3D Pose Estimation through Camera-Disentangled Representation applying on human pose estimation tasks using stereo images.

Python 12 2 Updated Jan 25, 2024

Official implementation of "E3D-Bench: A Benchmark for End-to-End 3D Geometric Foundation Models"

102 1 Updated Jun 4, 2025

Sequence to sequence network implementation in Pytorch

Python 5 Updated Mar 27, 2019

[CVPR 2025 Highlight] Official implementation of the solvers and estimators proposed in the paper "Relative Pose Estimation through Affine Corrections of Monocular Depth Priors"

C++ 222 13 Updated Apr 8, 2025

This package contains the original 2012 AlexNet code.

Cuda 2,765 358 Updated Mar 12, 2025

[Lumina Embodied AI] 具身智能技术指南 Embodied-AI-Guide

8,681 578 Updated Sep 22, 2025

Official repository of 'Visual-RFT: Visual Reinforcement Fine-Tuning' & 'Visual-ARFT: Visual Agentic Reinforcement Fine-Tuning'’

Jupyter Notebook 2,244 100 Updated Oct 29, 2025

Extend OpenRLHF to support LMM RL training for reproduction of DeepSeek-R1 on multimodal tasks.

Python 828 54 Updated May 14, 2025

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agentic RL)

Python 8,328 808 Updated Oct 31, 2025

[CVPR 2025] EgoLife: Towards Egocentric Life Assistant

Python 342 19 Updated Mar 19, 2025

A Python package that provides evaluation and visualization tools for the HO-Cap dataset

Python 43 5 Updated Mar 22, 2025

Official Code for "MITracker: Multi-View Integration for Visual Object Tracking"

Python 110 4 Updated Jun 18, 2025

[ICLR 2024] M/EEG-based image decoding with contrastive learning. i. Propose a contrastive learning framework to align image and eeg. ii. Resolving brain activity for biological plausibility.

Python 174 23 Updated Jul 18, 2025

Solve Visual Understanding with Reinforced VLMs

Python 5,674 366 Updated Oct 21, 2025

MoBA: Mixture of Block Attention for Long-Context LLMs

Python 1,951 116 Updated Apr 3, 2025
Python 63 4 Updated Oct 7, 2025

[ICLR 2025] Official implementation of "DiffSplat: Repurposing Image Diffusion Models for Scalable 3D Gaussian Splat Generation".

Python 434 25 Updated Aug 27, 2025

🚀 Efficient implementations of state-of-the-art linear attention models

Python 3,764 295 Updated Nov 6, 2025

Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.

Python 4,732 273 Updated Jul 18, 2025

Official Code for ECCV 2024 paper "EgoPoser: Robust Real-Time Egocentric Pose Estimation from Sparse and Intermittent Observations Everywhere"

Python 31 Updated Aug 28, 2025

An AI Hedge Fund Team

Python 42,225 7,472 Updated Oct 11, 2025
Next