Stars
A single CLAUDE.md file to improve Claude Code behavior, derived from Andrej Karpathy's observations on LLM coding pitfalls.
Claude Code skill for correctness audits of mathematical proofs in LaTeX papers
Official implementation of Information-Theoretic Decomposition for Multimodal Interaction Learning (DMIL) (CVPR 2026).
Unofficial Python API and agentic skill for Google NotebookLM. Full programmatic access to NotebookLM's features—including capabilities the web UI doesn't expose—via Python, CLI, and AI agents like…
The official repository for CVPR'26 Paper "APPO: Attention-guided Perception Policy Optimization for Video Reasoning"
Resources and paper list for "Thinking with Images for LVLMs". This repository accompanies our survey on how LVLMs can leverage visual information for complex reasoning, planning, and generation.
Elevate your AI research writing, no more tedious polishing ✨
Lumina-DiMOO - An Open-Sourced Multi-Modal Large Diffusion Language Model
✨✨Latest Advances on Multimodal Large Language Models
The official repo for "Efficient Quantification of Multimodal Interaction at Sample Level", ICML 2025
[SCIS] MULTI-Benchmark: Multimodal Understanding Leaderboard with Text and Images
Differentiable ODE solvers with full GPU support and O(1)-memory backpropagation.
Pytorch implementation of Transfusion, "Predict the Next Token and Diffuse Images with One Multi-Modal Model", from MetaAI
Automatically update arXiv papers about SOT & VLT, Multi-modal Learning, LLM and Video Understanding using Github Actions.
[ICLR 2025] Code&Data for the paper "Super(ficial)-alignment: Strong Models May Deceive Weak Models in Weak-to-Strong Generalization"
The repo for "Enhancing Multi-modal Cooperation via Sample-level Modality Valuation", CVPR 2024
Automated Client for generals.io
Reading list for research topics in multimodal machine learning
A pytorch implementation of MINE(Mutual Information Neural Estimation)
A python implement for Certifiable Robust Multi-modal Training
A Simple pytorch implementation of GradCAM and GradCAM++
⛏⚽ Scrape soccer data from Club Elo, ESPN, FBref, Football-Data.co.uk, Sofascore, SoFIFA, Understat and WhoScored.