-
LTC AI Lab
- Hong Kong
-
00:24
(UTC +08:00) - @lazercuber
- @lazer_tc
Highlights
- Pro
Lists (1)
Sort Name ascending (A-Z)
Stars
The official gpt4free repository | various collection of powerful language models | o4, o3 and deepseek r1, gpt-4.1, gemini 2.5
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.
Making large AI models cheaper, faster and more accessible
A generative speech model for daily dialogue.
Instant voice cloning by MIT and MyShell. Audio foundation model.
Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚
A TTS model capable of generating ultra-realistic dialogue in one pass.
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations
A Conversational Speech Generation Model
RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RN…
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
ChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source.
Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/
High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.
Tools for merging pretrained large language models.
Modeling, training, eval, and inference code for OLMo
Talk to any LLM with hands-free voice interaction, voice interruption, and Live2D taking face running locally across platforms
H2O LLM Studio - a framework and no-code GUI for fine-tuning LLMs. Documentation: https://docs.h2o.ai/h2o-llmstudio/
Understand Human Behavior to Align True Needs
Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model w/CPU ONNX and NVIDIA GPU PyTorch support, handling, and auto-stitching