lucas-ventura

Lucas Ventura lucas-ventura

33 followers · 33 following

Achievements

Highlights

Lists (1)

Sort

Launching

1 repository

Stars

nicolas-dufour / miro

Code for MIRO: MultI-Reward cOnditioned pretraining improves T2I quality and efficiency

17 Updated Oct 31, 2025

Anttwo / MILo

[SIGGRAPH Asia 2025 - TOG] Official implementation of MILo: Mesh-In-the-Loop Gaussian Splatting for Detailed and Efficient Surface Reconstruction

Python 314 26 Updated Dec 2, 2025

joanrod / star-vector

StarVector is a foundation model for SVG generation that transforms vectorization into a code generation task. Using a vision-language modeling architecture, StarVector processes both visual and te…

Python 4,169 229 Updated Nov 7, 2025

huggingface / nanoVLM

The simplest, fastest repository for training/finetuning small-sized VLMs.

Python 4,440 434 Updated Oct 27, 2025

davidea97 / Multi-Camera-Hand-Eye-Calibration

Multi-Camera Hand-Eye Calibration Framework for calibrating a camera network with respect to a robot arm

C++ 28 3 Updated Jun 6, 2025

showlab / livecc

LiveCC: Learning Video LLM with Streaming Speech Transcription at Scale (CVPR 2025)

Python 357 47 Updated Oct 29, 2025

ElliotVincent / DAFA-LS

(EarthVision 2025 - CVPR Workshop) Official repository of DAFA-LS, a dataset of satellite image time series for the task of archaeological looting detection.

Python 38 Updated Nov 21, 2024

zerchen / hort

HORT: Monocular Hand-held Objects Reconstruction with Transformers, ICCV 2025

Python 50 2 Updated Apr 10, 2025

Qualcomm-AI-research / FitCoach

Python 7 3 Updated Dec 16, 2024

Anttwo / MAtCha

[CVPR 2025 - Spotlight] Official PyTorch implementation of MAtCha Gaussians: Atlas of Charts for High-Quality Geometry and Photorealism From Sparse Views

Python 251 13 Updated Apr 8, 2025

lucas-ventura / chapter-llama

Official PyTorch implementation of the paper "Chapter-Llama: Efficient Chaptering in Hour-Long Videos with LLMs"

Python 86 13 Updated Jun 6, 2025

EGO4D / ego-exo4d-proficiency

Python 1 Updated May 25, 2025

gastruc / AnySat

Python 178 12 Updated Oct 16, 2025

davidpicard / pom

official implementation of the Polynomial Mixer

Python 22 1 Updated Sep 15, 2025

TIGER-AI-Lab / Mantis

Official code for Paper "Mantis: Multi-Image Instruction Tuning" [TMLR 2024]

Python 237 22 Updated Mar 23, 2025

open-webui / open-webui

User-friendly AI Interface (Supports Ollama, OpenAI API, ...)

Svelte 118,828 16,728 Updated Dec 23, 2025

nicolas-dufour / plonk

Code for "Around the World in 80 Timesteps: A Generative Approach to Global Visual Geolocation"

Jupyter Notebook 258 21 Updated Sep 29, 2025

LukasMaly / spot-it-generator

A Python project for generating Spot It! (a.k.a Dobble) cards.

Python 9 7 Updated Dec 26, 2019

vlc-robot / robot-3dlotus

Official implementation of "Towards Generalizable Vision-Language Robotic Manipulation: A Benchmark and LLM-guided 3D Policy."

Jupyter Notebook 120 15 Updated Oct 23, 2025

raphael-baena / DTLR

Handwritten Text Recognition and Character Detection

Python 163 19 Updated Sep 28, 2025

OpenBMB / MiniCPM-V

MiniCPM-V 4.5: A GPT-4o Level MLLM for Single Image, Multi Image and High-FPS Video Understanding on Your Phone

Python 22,435 1,689 Updated Sep 24, 2025

marcmartinezgost / enn

Expressive Neural Network: A Neural Network Model with DCT Adaptive Activation Functions

Jupyter Notebook 9 Updated Sep 20, 2024

alibaba / alimama-video-narrator

Research code for ACL2024 paper: "Synchronized Video Storytelling: Generating Video Narrations with Structured Storyline"

Python 40 6 Updated Dec 27, 2024

ExplainableML / EgoCVR

[ECCV 2024] EgoCVR: An Egocentric Benchmark for Fine-Grained Composed Video Retrieval

Python 41 Updated Apr 11, 2025

MME-Benchmarks / Video-MME

✨✨[CVPR 2025] Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis

699 25 Updated Dec 8, 2025

gastruc / OmniSat

Python 88 7 Updated Oct 24, 2024

DAMO-NLP-SG / VideoLLaMA2

VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs

Python 1,260 83 Updated Jan 23, 2025

valeoai / GenVal

Reliability in Semantic Segmentation: Can We Use Synthetic Data? (ECCV 2024)

Jupyter Notebook 40 1 Updated Jul 17, 2024

ElliotVincent / SitsSCD

Implementation of the multi-temporal UTAE for the task of satellite image time series semantic change detection (SITS-SCD)

Python 60 4 Updated Jul 11, 2024

robincourant / DIRECTOR

Python 73 5 Updated Oct 25, 2024