Skip to content
View lucas-ventura's full-sized avatar

Highlights

  • Pro

Block or report lucas-ventura

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Code for MIRO: MultI-Reward cOnditioned pretraining improves T2I quality and efficiency

17 Updated Oct 31, 2025

[SIGGRAPH Asia 2025 - TOG] Official implementation of MILo: Mesh-In-the-Loop Gaussian Splatting for Detailed and Efficient Surface Reconstruction

Python 314 26 Updated Dec 2, 2025

StarVector is a foundation model for SVG generation that transforms vectorization into a code generation task. Using a vision-language modeling architecture, StarVector processes both visual and te…

Python 4,169 229 Updated Nov 7, 2025

The simplest, fastest repository for training/finetuning small-sized VLMs.

Python 4,440 434 Updated Oct 27, 2025

Multi-Camera Hand-Eye Calibration Framework for calibrating a camera network with respect to a robot arm

C++ 28 3 Updated Jun 6, 2025

LiveCC: Learning Video LLM with Streaming Speech Transcription at Scale (CVPR 2025)

Python 357 47 Updated Oct 29, 2025

(EarthVision 2025 - CVPR Workshop) Official repository of DAFA-LS, a dataset of satellite image time series for the task of archaeological looting detection.

Python 38 Updated Nov 21, 2024

HORT: Monocular Hand-held Objects Reconstruction with Transformers, ICCV 2025

Python 50 2 Updated Apr 10, 2025
Python 7 3 Updated Dec 16, 2024

[CVPR 2025 - Spotlight] Official PyTorch implementation of MAtCha Gaussians: Atlas of Charts for High-Quality Geometry and Photorealism From Sparse Views

Python 251 13 Updated Apr 8, 2025

Official PyTorch implementation of the paper "Chapter-Llama: Efficient Chaptering in Hour-Long Videos with LLMs"

Python 86 13 Updated Jun 6, 2025
Python 1 Updated May 25, 2025
Python 178 12 Updated Oct 16, 2025

official implementation of the Polynomial Mixer

Python 22 1 Updated Sep 15, 2025

Official code for Paper "Mantis: Multi-Image Instruction Tuning" [TMLR 2024]

Python 237 22 Updated Mar 23, 2025

User-friendly AI Interface (Supports Ollama, OpenAI API, ...)

Svelte 118,828 16,728 Updated Dec 23, 2025

Code for "Around the World in 80 Timesteps: A Generative Approach to Global Visual Geolocation"

Jupyter Notebook 258 21 Updated Sep 29, 2025

A Python project for generating Spot It! (a.k.a Dobble) cards.

Python 9 7 Updated Dec 26, 2019

Official implementation of "Towards Generalizable Vision-Language Robotic Manipulation: A Benchmark and LLM-guided 3D Policy."

Jupyter Notebook 120 15 Updated Oct 23, 2025

Handwritten Text Recognition and Character Detection

Python 163 19 Updated Sep 28, 2025

MiniCPM-V 4.5: A GPT-4o Level MLLM for Single Image, Multi Image and High-FPS Video Understanding on Your Phone

Python 22,435 1,689 Updated Sep 24, 2025

Expressive Neural Network: A Neural Network Model with DCT Adaptive Activation Functions

Jupyter Notebook 9 Updated Sep 20, 2024

Research code for ACL2024 paper: "Synchronized Video Storytelling: Generating Video Narrations with Structured Storyline"

Python 40 6 Updated Dec 27, 2024

[ECCV 2024] EgoCVR: An Egocentric Benchmark for Fine-Grained Composed Video Retrieval

Python 41 Updated Apr 11, 2025

✨✨[CVPR 2025] Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis

699 25 Updated Dec 8, 2025
Python 88 7 Updated Oct 24, 2024

VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs

Python 1,260 83 Updated Jan 23, 2025

Reliability in Semantic Segmentation: Can We Use Synthetic Data? (ECCV 2024)

Jupyter Notebook 40 1 Updated Jul 17, 2024

Implementation of the multi-temporal UTAE for the task of satellite image time series semantic change detection (SITS-SCD)

Python 60 4 Updated Jul 11, 2024
Python 73 5 Updated Oct 25, 2024
Next