Lists (11)
Sort Name ascending (A-Z)
Stars
Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.
The world's simplest facial recognition api for Python and the command line
CLI platform to experiment with codegen. Precursor to: https://lovable.dev
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
A community-maintained Python framework for creating mathematical animations.
Official Code for DragGAN (SIGGRAPH 2023)
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
Open standard for machine learning interoperability
Janus-Series: Unified Multimodal Understanding and Generation Models
Code samples for my book "Neural Networks and Deep Learning"
Automate Creation of YouTube Shorts using MoviePy.
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
InstantID: Zero-shot Identity-Preserving Generation in Seconds 🔥
[AAAI 2025] Official implementation of "OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on"
🐍Python 3 wrapper of Microsoft UIAutomation. Support UIAutomation for MFC, WindowsForm, WPF, Modern UI(Metro UI), Qt, IE, Firefox, Chrome ...
PyTorch implementation of Super SloMo by Jiang et al.
An unofficial PyTorch implementation of the audio LM VALL-E
A PyTorch library and evaluation platform for end-to-end compression research
The first competitive instance segmentation approach that runs on small edge devices at real-time speeds.
Step-by-step instructions to build a smartphone that is open-source, upgradeable, repairable, and Big Tech free.
ControlLoRA: A Lightweight Neural Network To Control Stable Diffusion Spatial Information
A novel implementation of fusing ViT with Mamba into a fast, agile, and high performance Multi-Modal Model. Powered by Zeta, the simplest AI framework ever.
Bit-Swap: Recursive Bits-Back Coding for Lossless Compression with Hierarchical Latent Variables