Highlights
Lists (1)
Sort Name ascending (A-Z)
Stars
- All languages
- C
- C#
- C++
- CMake
- CSS
- ChucK
- Crystal
- Cuda
- Cython
- Dart
- Dockerfile
- Gherkin
- Go
- HCL
- HTML
- Java
- JavaScript
- Jinja
- Julia
- Jupyter Notebook
- Kotlin
- Lex
- Lua
- MATLAB
- MDX
- Macaulay2
- Makefile
- Markdown
- Nim
- Objective-C
- PHP
- Perl
- Python
- R
- Rich Text Format
- Roff
- Ruby
- Rust
- SCSS
- Scala
- Shell
- Singularity
- Swift
- TSQL
- TeX
- TypeScript
- Vim Script
- Vue
The repository provides code for running inference with the Meta Segment Anything Audio Model (SAM-Audio), links for downloading the trained model checkpoints, and example notebooks that show how tโฆ
State-of-the-art Image & Video CLIP, Multimodal Large Language Models, and More!
Production ready toolkit to run AI locally
๐ฆ๏ธ ๐ ๐ป Clash Premium ่งๅ้(RULE-SET)๏ผๅ ผๅฎน ClashX ProใClash for Windows ็ญๅบไบ Clash Premium ๅ ๆ ธ็ๅฎขๆท็ซฏใ
Roo Code gives you a whole dev team of AI agents in your code editor.
Mustango: Toward Controllable Text-to-Music Generation
A Cloudflare Worker that integrates with a Telegram Bot to filter spam and manage silence consensus polls.
[CVPR 2024] Official RT-DETR (RTDETR paddle pytorch), Real-Time DEtection TRansformer, DETRs Beat YOLOs on Real-time Object Detection. ๐ฅ ๐ฅ ๐ฅ
Omnilingual ASR Open-Source Multilingual SpeechRecognition for 1600+ Languages
This repository contains a series of works on diffusion-based speech tokenizers, including the official implementation of the paper: "TaDiCodec: Text-aware Diffusion Speech Tokenizer for Speech Lanโฆ
FlexiCodec: A Dynamic Neural Audio Codec for Low Frame Rates
Ming-UniAudio: Speech LLM for Joint Understanding, Generation and Editing with Unified Representation
A Conversational Speech Generation Model
Official PyTorch Implementation of "Latent Denoising Makes Good Visual Tokenizers"
Official implementation of "Continuous Autoregressive Language Models"
Metrics for evaluating music and audio generative models โ with a focus on long-form, full-band, and stereo generations.
This is the official repo for the paper "LongCat-Flash-Omni Technical Report"
A list of tools, papers and code related to Fake Audio Detection.
"AI-Trader: Can AI Beat the Market?" Live Trading Bench: https://ai4trade.ai Tech Report Link: https://arxiv.org/abs/2512.10971
Official code of ConfTuner: Training Large Language Models to Express Their Confidence Verbally
Trainging, inference, and testing of the SAC speech codec model.
An All-in-One Speech, Sound, Music Codec with Single Nested Codebook