Lists (10)
Sort Name ascending (A-Z)
Stars
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…
Instruct-tune LLaMA on consumer hardware
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
StableLM: Stability AI Language Models
This repository contains the source code for the paper First Order Motion Model for Image Animation
Foundational Models for State-of-the-Art Speech and Text Translation
QLoRA: Efficient Finetuning of Quantized LLMs
Official inference library for Mistral models
Accepted as [NeurIPS 2024] Spotlight Presentation Paper
A better notebook for Scala (and more)
serp-ai / bark-with-voice-clone
Forked from suno-ai/bark🔊 Text-prompted Generative Audio Model - With the ability to clone voices
A crash course in six episodes for software developers who want to become machine learning practitioners.
Vehicle detection using machine learning and computer vision techniques for Udacity's Self-Driving Car Engineer Nanodegree.
Real-time portrait segmentation for mobile devices
[CVPR 2026] A PyTorch implementation of the paper "EDGS: Eliminating Densification for Efficient Convergence of 3DGS"
Diffusion Illusions: Hiding Images in Plain Sight
Hodgepodge of chessboard chessboard detection algorithms on images from actual matches.
Run Node.js code in Python notebooks
GPT-2 French demo | Démo française de GPT-2
Open source LLM arena created by the French Government
Pipeline to convert real-life chess boards into a 2D digital format(FEN) from images and live camera feeds. The system has 2 versions: one for real-time processing using the OAK-D Lite camera and …
A try of different method for the problem outlined in this article: https://hardmath123.github.io/crown-typewriter.html