Stars
BoT-SORT: Robust Associations Multi-Pedestrian Tracking
Supporting code from my related video
Learn RL Techniques in 3 Easy Projects
Context & Guide For Reinforcement Learning with Verifiable Rewards with Large Language Models
Creating 'deep agents' to encourage LLM's to complete long horizon tasks.
Outlining and demonstrating how language models are able to understand image, video, and text content.
An intuitive approach towards understanding how Retrieval Augmented Generation (RAG) systems work, for the curious yet daunted reader
Official implementation of "Training-free Online Video Step Grounding", NeurIPS 2025
ECIR2025 Tutorial: Advanced Methods for Visual Information Retrieval and Exploration in Large Multimedia Collections
All Algorithms implemented in Python
Diagnostic Reasoning Knowledge Graph for Large Language Model Diagnosis Prediction
Maze navigation with MLM-U
This repository contains demos I made with the Transformers library by HuggingFace.
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…
Multivariate Time Series Forecasting with efficient Transformers. Code for the paper "Long-Range Transformers for Dynamic Spatiotemporal Forecasting."
[ACL 2023]: Training Trajectories of Language Models Across Scales https://arxiv.org/pdf/2212.09803.pdf
Whole building non-residential hourly energy meter data from the Great Energy Predictor III competition
The official code repo of "HTS-AT: A Hierarchical Token-Semantic Audio Transformer for Sound Classification and Detection"
A graphical processor simulator and assembly editor for the RISC-V ISA