- All languages
- Assembly
- Bikeshed
- C
- C#
- C++
- CMake
- CSS
- Clojure
- Common Lisp
- Fantom
- Fortran
- GLSL
- Go
- HTML
- Java
- JavaScript
- Jupyter Notebook
- Kotlin
- LLVM
- Less
- Lua
- MATLAB
- MDX
- Makefile
- Markdown
- Nim
- Nix
- Objective-C
- Objective-C++
- PEG.js
- PHP
- Perl
- PowerShell
- Python
- ReScript
- Ruby
- Rust
- ShaderLab
- Shell
- Solidity
- Svelte
- Swift
- TeX
- TypeScript
- Vue
- Wren
- XSLT
- Zig
Starred repositories
Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞
HeartMuLa Official Repo: The Most Powerful Open-Source Music Generation Model of 2026
[ICCV 2023] PoseDiffusion: Solving Pose Estimation via Diffusion-aided Bundle Adjustment
[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer
Minimalist, dependency-free virtual machine sandbox for microcontrollers and other resource-constrained devices. Single C file, no dynamic memory allocations, asynchronous design, pure C99
Model-rocketry aerodynamics and trajectory simulation software
GMTalker 由光明实验室媒体智能团队打造的3d数字人。系统集成了语音识别、语音合成、自然语言理解、嘴型动画驱动。支持windows、Linux、安卓快速部署。
The basic distribution probability Tutorial for Deep Learning Researchers
An unofficial PyTorch implementation of the audio LM VALL-E
Reading list for research topics in multimodal machine learning
✨✨Latest Advances on Multimodal Large Language Models
A paper and project list about the cutting edge Speech Synthesis, Text-to-Speech (TTS), Singing Voice Synthesis (SVS), Voice Conversion (VC), Singing Voice Conversion (SVC), and related interesting…
A fundamental toolkit designed for music, song, and audio generation
PyTorch implementation of normalizing flow models
High-performance, real-time optimized, and statically typed embedded language implemented in C.
Real time monaural source separation base on fully convolutional neural network operates on Time-frequency domain.
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…
gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
AAAI 2025: Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model
Text-audio foundation model from Boson AI
The official code repository for LeVo: High-Quality Song Generation with Multi-Preference Alignment
Official PyTorch implementation for "Large Language Diffusion Models"
LLMs-from-scratch项目中文翻译