Stars
This project extends the idea of the innovative architecture of Kolmogorov-Arnold Networks (KAN) to the Convolutional Layers, changing the classic linear transformation of the convolution to learna…
Official Repository for "DrEureka: Language Model Guided Sim-To-Real Transfer" (RSS 2024)
Schedule-Free Optimization in PyTorch
Make your JSON data collaborative and version-controlled with CRDTs
Single C file, Realtime CPU/GPU Profiler with Remote Web Viewer
A native, user-mode, multi-process, graphical debugger.
🔊 Text-Prompted Generative Audio Model
🤘 TT-NN operator library, and TT-Metalium low level kernel programming model.
CoTracker is a model for tracking any point (pixel) on a video.
Simple embedding -> text model trained on a small subset of Wikipedia sentences.
Muzic: Music Understanding and Generation with Artificial Intelligence
[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters
🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.
PyTorch code and models for the DINOv2 self-supervised learning method.
StableLM: Stability AI Language Models
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
An all in one solution for adding Temporal Stability to a Stable Diffusion Render via an automatic1111 extension
Combining Segment Anything (SAM) with Grounded DINO for zero-shot object detection and CLIPSeg for zero-shot segmentation
[NeurIPS 2023] Official implementation of the paper "Segment Everything Everywhere All at Once"
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything