Skip to content
View Gigi-G's full-sized avatar
:octocat:
Working
:octocat:
Working

Highlights

  • Pro

Organizations

@UNICT-DMI @fpv-iplab @triglie @I-Golem @farsightlab

Block or report Gigi-G

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

A streamlined and customizable framework for efficient large model (LLM, VLM, AIGC) evaluation and performance benchmarking.

Python 2,971 404 Updated Jun 22, 2026

Code, data and weights for the paper **What drives success in physical planning with Joint-Embedding Predictive World Models?**

Python 411 45 Updated Apr 11, 2026

Official implementation of GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

Python 479 32 Updated May 20, 2026

Code implementation for the paper "Large-scale Pre-training for Grounded Video Caption Generation" (ICCV 2025)

Python 31 1 Updated Jan 18, 2026

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Python 22,602 2,313 Updated Jun 3, 2026

A comprehensive list of papers for the definition of World Models and using World Models for General Video Generation, Embodied AI, and Autonomous Driving, including papers, codes, and related webs…

Python 1,822 60 Updated Jun 22, 2026

[IJCV] EgoPlan-Bench: Benchmarking Multimodal Large Language Models for Human-Level Planning

Python 85 5 Updated Dec 6, 2024

State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.

Python 3,976 357 Updated Jan 4, 2024

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…

Jupyter Notebook 23,394 2,646 Updated Mar 3, 2026

The repository provides code for running inference and finetuning with the Meta Segment Anything Model 3 (SAM 3), links for downloading the trained model checkpoints, and example notebooks that sho…

Python 10,641 1,607 Updated Jun 15, 2026

Using advances in generative modeling to learn reward functions from unlabeled videos.

Jupyter Notebook 143 15 Updated Feb 12, 2024

A Large-scale Video Action Dataset

Python 476 14 Updated Jan 16, 2026
Python 28 1 Updated Jul 18, 2025

Official implementation of "HowToCaption: Prompting LLMs to Transform Video Annotations at Scale." ECCV 2024

Python 58 Updated Aug 19, 2025
Python 42 3 Updated Jun 14, 2025

An open-source AI agent that brings the power of Gemini directly into your terminal.

TypeScript 105,489 14,153 Updated Jun 22, 2026

An extension of the PyTorch library containing various tools for performing deep learning in hyperbolic space.

Python 177 13 Updated Jan 7, 2025

OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340

Jupyter Notebook 4,325 362 Updated Dec 4, 2025

UCI chess engine

C++ 37 6 Updated Nov 18, 2025

[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer

Python 13,472 1,498 Updated May 19, 2026

Synchronization is All You Need: Exocentric-to-Egocentric Transfer for Temporal Action Segmentation with Unlabeled Synchronized Video Pairs [ECCV, 2024]

Python 8 1 Updated Jul 19, 2024

Official PyTorch Implementation of Masked Temporal Interpolation Diffusion for Procedure Planning in Instructional Videos

Python 11 1 Updated Apr 26, 2026

Code for the Molmo Vision-Language Model

Python 913 96 Updated Dec 12, 2024

Curated list of papers and resources focused on 3D Gaussian Splatting, intended to keep pace with the anticipated surge of research in the coming months.

HTML 8,706 538 Updated Jun 19, 2026

Differentiable Dynamic Programming

Python 72 19 Updated Sep 15, 2020

Implementation of Autoregressive Diffusion in Pytorch

Python 437 13 Updated Dec 4, 2025

[BMVC2022, IJCV2023, Best Student Paper, Spotlight] Official codes for the paper "In the Eye of Transformer: Global-Local Correlation for Egocentric Gaze Estimation".

Python 35 10 Updated Feb 22, 2025

Official Pytorch Implementation of GraphiT

Python 111 13 Updated Jul 6, 2021
Next