Alex-Songs

Alex-Songs

Achievements

Stars

huggingface / transformers

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 156,234 31,994 Updated Feb 8, 2026

openai / whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Python 94,342 11,731 Updated Dec 15, 2025

rasbt / LLMs-from-scratch

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 84,854 12,841 Updated Jan 29, 2026

RVC-Boss / GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 54,835 5,997 Updated Feb 8, 2026

FoundationAgents / OpenManus

No fortress, purely open ground. OpenManus is Coming.

Python 54,348 9,521 Updated Jan 5, 2026

coqui-ai / TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 44,490 5,957 Updated Aug 16, 2024

Light-City / CPlusPlusThings

C++那些事

C++ 42,846 8,834 Updated Jun 14, 2024

zai-org / ChatGLM-6B

ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型

Python 41,224 5,208 Updated Jun 27, 2024

lm-sys / FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python 39,398 4,778 Updated Jun 2, 2025

suno-ai / bark

🔊 Text-Prompted Generative Audio Model

Jupyter Notebook 38,970 4,684 Updated Aug 19, 2024

2noise / ChatTTS

A generative speech model for daily dialogue.

Python 38,679 4,207 Updated Jan 18, 2026

facebookresearch / fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Python 32,143 6,660 Updated Sep 30, 2025

Lightning-AI / pytorch-lightning

Pretrain, finetune ANY AI model of ANY size on 1 or 10,000+ GPUs with zero code changes.

Python 30,815 3,669 Updated Feb 4, 2026

hpcaitech / Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Python 28,504 2,885 Updated Apr 30, 2025

datawhalechina / self-llm

《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调（全参数/Lora）、部署国内外开源大模型（LLM）/多模态大模型（MLLM）教程

Jupyter Notebook 28,132 2,806 Updated Feb 8, 2026

huggingface / open-r1

Fully open reproduction of DeepSeek-R1

Python 25,868 2,408 Updated Nov 24, 2025

OpenBMB / MiniCPM-o

A Gemini 2.5 Flash Level MLLM for Vision, Speech, and Full-Duplex Multimodal Live Streaming on Your Phone

Python 23,384 1,786 Updated Feb 8, 2026

microsoft / unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 22,010 2,695 Updated Jan 23, 2026

verl-project / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 19,068 3,208 Updated Feb 6, 2026

BradyFU / Awesome-Multimodal-Large-Language-Models

✨✨Latest Advances on Multimodal Large Language Models

17,324 1,110 Updated Feb 7, 2026

huggingface / trl

Train transformer language models with reinforcement learning.

Python 17,308 2,481 Updated Feb 8, 2026

TransformerOptimus / SuperAGI

<⚡️> SuperAGI - A dev-first open source autonomous AI agent framework. Enabling developers to build, manage & run useful autonomous agents quickly and reliably.

Python 17,164 2,165 Updated Jan 22, 2025

UFund-Me / Qbot

[🔥updating ...] AI 自动量化交易机器人(完全本地部署) AI-powered Quantitative Investment Research Platform. 📃 online docs: https://ufund-me.github.io/Qbot ✨ :news: qbot-mini: https://github.com/Charmve/iQuant

Jupyter Notebook 16,153 2,297 Updated Jul 6, 2025

modelscope / FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Python 14,843 1,571 Updated Feb 4, 2026

neonbjb / tortoise-tts

A multi-voice TTS system trained with an emphasis on quality

Jupyter Notebook 14,804 2,053 Updated Nov 19, 2024

modelscope / ms-swift

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3, Qwen3-MoE, DeepSeek-R1, GLM4.5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, …

Python 12,591 1,196 Updated Feb 7, 2026

deepseek-ai / FlashMLA

FlashMLA: Efficient Multi-head Latent Attention Kernels

C++ 12,458 981 Updated Feb 6, 2026

wdndev / llm_interview_note

主要记录大语言大模型（LLMs）算法（应用）工程师相关的知识及面试题

HTML 12,307 1,239 Updated Apr 30, 2025

RUCAIBox / LLMSurvey

The official GitHub page for the survey paper "A Survey of Large Language Models".

Python 12,082 939 Updated Mar 11, 2025

facebookresearch / seamless_communication

Foundational Models for State-of-the-Art Speech and Text Translation

Jupyter Notebook 11,745 1,169 Updated Nov 14, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Alex-Songs

Achievements

Achievements

Block or report Alex-Songs

Stars

huggingface / transformers

openai / whisper

rasbt / LLMs-from-scratch

RVC-Boss / GPT-SoVITS

FoundationAgents / OpenManus

coqui-ai / TTS

Light-City / CPlusPlusThings

zai-org / ChatGLM-6B

lm-sys / FastChat

suno-ai / bark

2noise / ChatTTS

facebookresearch / fairseq

Lightning-AI / pytorch-lightning

hpcaitech / Open-Sora

datawhalechina / self-llm

huggingface / open-r1

OpenBMB / MiniCPM-o

microsoft / unilm

verl-project / verl

BradyFU / Awesome-Multimodal-Large-Language-Models

huggingface / trl

TransformerOptimus / SuperAGI

UFund-Me / Qbot

modelscope / FunASR

neonbjb / tortoise-tts

modelscope / ms-swift

deepseek-ai / FlashMLA

wdndev / llm_interview_note

RUCAIBox / LLMSurvey

facebookresearch / seamless_communication