-
ACV
- United States
- https://akhil.ai
- @akhilez_
Stars
Warp faces similar to some Snapchat filters or face swaps. Eyes, eyebrows, nose and mouth can all be moved and scaled interactively. The script can process the changes in real-time and works off imβ¦
(Crafter + NetHack) in JAX. ICML 2024 Spotlight.
This repo implements Denoising Diffusion Probabilistic Models (DDPM) in Pytorch
pix2tex: Using a ViT to convert images of equations into LaTeX code.
[NeurIPS 2024 Best Paper Award][GPT beats diffusionπ₯] [scaling laws in visual generationπ] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". Aβ¦
[CVPR 2025 Oral]Infinity β : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis
Image Stitching algorithm in Python from scratch with gain compensation and blending
The Mulan Framework with Multi-Label Resampling Algorithms
Set of algorithms used to resample (oversample and undersample) multilabel datasets.
Official inference repo for FLUX.1 models
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
Syllabus for IMA Fall 2024 Course
Sample code and notebooks for Generative AI on Google Cloud, with Gemini on Vertex AI
Video+code lecture on building nanoGPT from scratch
Recipes for shrinking, optimizing, customizing cutting edge vision models. π
ALIEN is a CUDA-powered artificial life simulation program.
π Awesome photogrammetry projects
Pretrain Vision and Large Language Models in Python, Published by Packt
Published by Packt
Code for "Embodied Intelligence via Learning and Evolution", Gupta et al, Nature Communications
Repository for the 2023 WACV paper: "Hear The Flow: Optical Flow-Based Self-Supervised Visual Sound Source Localization"
Conditional diffusion model to generate MNIST. Minimal script. Based on 'Classifier-Free Diffusion Guidance'.
Mastering Diverse Domains through World Models
The 2024 edition of The Nature of Code with p5.js. Includes Notion workflow and build system.
PyTorch implementation of TabNet paper : https://arxiv.org/pdf/1908.07442.pdf
Google DeepMind's software stack for physics-based simulation and Reinforcement Learning environments, using MuJoCo.
A small Python class to measure the time taken by indented lines
Get image width and height given a file path using minimal dependencies (no need for PIL, libjpeg, libpng, etc)
Implementing some of Karl Sims' work with virtual creatures in Python
PyTorch Lightning + Hydra. A very user-friendly template for ML experimentation. β‘π₯β‘