Skip to content
View leotac's full-sized avatar

Highlights

  • Pro

Block or report leotac

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 250 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

An efficient video loader for deep learning with smart shuffling that's super easy to digest

C++ 16 3 Updated Sep 16, 2025

VTGNet: A Vision-based Trajectory Generation Network for Autonomous Vehicles in Urban Environments

Python 40 11 Updated Nov 27, 2020

See the Future: A Semantic Segmentation Network Predicting Ego-vehicle Trajectory with a Single Monocular Camera

8 1 Updated May 27, 2021

[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer

Python 11,189 1,125 Updated Sep 1, 2025

RT-GENE: Real-Time Eye Gaze and Blink Estimation in Natural Environments

Python 422 73 Updated Oct 10, 2024

The official PyTorch implementation of L2CS-Net for gaze estimation and tracking

Python 425 101 Updated Feb 2, 2024

This repository contains demos I made with the Transformers library by HuggingFace.

Jupyter Notebook 11,269 1,679 Updated Jul 2, 2025

Dual Swin Transformer for video-time-series fusion

Python 20 4 Updated Aug 28, 2024

Access large language models from the command-line

Python 9,870 642 Updated Sep 30, 2025

Recipes for shrinking, optimizing, customizing cutting edge vision models. 💜

Jupyter Notebook 1,627 130 Updated Sep 12, 2025

Refine high-quality datasets and visual AI models

Python 9,927 673 Updated Oct 9, 2025

Official repo for our paper: "What Matters in Autonomous Driving Anomaly Detection: A Weakly Supervised Horizon"

2 Updated Oct 21, 2024
Python 631 71 Updated Jun 7, 2025

Annotation for reproducing the result of the paper "Cross-model temporal cooperation via saliency maps for efficient recognition and classification of relevant traffic lights" .

Python 6 Updated Sep 28, 2024

Software Development Kit for the Zenseact Open Dataset (ZOD)

Python 128 17 Updated Oct 6, 2025

A playbook for systematically maximizing the performance of deep learning models.

29,230 2,390 Updated Jun 18, 2024

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 44,907 7,652 Updated Dec 9, 2024

Code for robust monocular depth estimation described in "Ranftl et. al., Towards Robust Monocular Depth Estimation: Mixing Datasets for Zero-shot Cross-dataset Transfer, TPAMI 2022"

Python 5,152 702 Updated Aug 23, 2024

A latent text-to-image diffusion model

Jupyter Notebook 71,560 10,506 Updated Jun 18, 2024

Render JSON into collapsible HTML

JavaScript 424 92 Updated Mar 7, 2023

[CVPR2023] The official repo for OC-SORT: Observation-Centric SORT on video Multi-Object Tracking. OC-SORT is simple, online and robust to occlusion/non-linear motion.

Python 950 136 Updated Jul 18, 2023

[ECCV 2022] ByteTrack: Multi-Object Tracking by Associating Every Detection Box

Python 5,671 1,046 Updated Jun 19, 2024

Awesome list for research on CLIP (Contrastive Language-Image Pre-Training).

1,217 59 Updated Jun 28, 2024

[ICCV 2021] Deep Reinforced Accident Anticipation with Visual Explanation

Python 26 8 Updated Sep 4, 2023

GLIDE: a diffusion-based text-conditional image synthesis model

Python 3,661 504 Updated Mar 8, 2024

[ACM MM 2020] CCD dataset for traffic accident anticipation.

129 11 Updated Sep 2, 2023

Optimization Models used in my e-book with the same title

AMPL 14 2 Updated Feb 26, 2022

This is the repo for our Detection of Traffic Anomaly (DoTA) dataset.

Python 241 39 Updated Dec 28, 2023

[ECCV 2020] Learning stereo from single images using monocular depth estimation networks

Python 406 56 Updated Jul 2, 2021
Next