-
Hugging Face
- San Francisco
- https://jadechoghari.github.io/
- https://orcid.org/0009-0006-2986-9641
- @jadechoghari
- in/jadechoghari
Highlights
- Pro
Lists (2)
Sort Name ascending (A-Z)
Stars
A universal summary of current robotics simulators
🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning
Open-source autonomous cleaning & housekeeping robot
Inference and fine-tuning examples for vision models from 🤗 Transformers
RepVGG: Making VGG-style ConvNets Great Again
[ACM MM24] MotionMaster: Training-free Camera Motion Transfer For Video Generation
VoiceRestore: Flow-Matching Transformers for Universal Speech Restoration
Official Pytorch Implementation for "VidToMe: Video Token Merging for Zero-Shot Video Editing" (CVPR 2024)
Official implementation of 3Doodle: Compact Abstraction of Objects with 3D Strokes (SIGGRAPH 24', Journal track)
OpenMusic: SOTA Text-to-music (TTM) Generation
Daily tracking of awesome audio papers, including music generation, zero-shot tts, asr, audio generation
The official implementation of our paper "Instruct-MusicGen: Unlocking Text-to-Music Editing for Music Language Models via Instruction Tuning".
Official code for SEE-2-SOUND: Zero-Shot Spatial Environment-to-Spatial Sound
[NeurIPS 2023] LLM-Pruner: On the Structural Pruning of Large Language Models. Support Llama-3/3.1, Llama-2, LLaMA, BLOOM, Vicuna, Baichuan, TinyLlama, etc.
RobustSAM: Segment Anything Robustly on Degraded Images (CVPR 2024 Highlight)
PyTorch implementation of MAR+DiffLoss https://arxiv.org/abs/2406.11838
An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.
High-resolution models for human tasks.
Official implementation of "Slicedit: Zero-Shot Video Editing With Text-to-Image Diffusion Models Using Spatio-Temporal Slices" (ICML 2024).
Library for running a Monte Carlo tree search, either traditionally or with expert policies
serp-ai / bark-with-voice-clone
Forked from suno-ai/bark🔊 Text-prompted Generative Audio Model - With the ability to clone voices
Instant voice cloning by MIT and MyShell. Audio foundation model.
ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search (NeurIPS 2024)
a text-conditional diffusion probabilistic model capable of generating high fidelity audio.
This repository facilitates the creation of Python wheel files (.whl) from the tiny-cuda-nn project to streamline the installation process on Google Colab.