Skip to content
View Alex-Songs's full-sized avatar

Block or report Alex-Songs

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 156,234 31,994 Updated Feb 8, 2026

Robust Speech Recognition via Large-Scale Weak Supervision

Python 94,342 11,731 Updated Dec 15, 2025

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 84,854 12,841 Updated Jan 29, 2026

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 54,835 5,997 Updated Feb 8, 2026

No fortress, purely open ground. OpenManus is Coming.

Python 54,348 9,521 Updated Jan 5, 2026

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 44,490 5,957 Updated Aug 16, 2024

C++那些事

C++ 42,846 8,834 Updated Jun 14, 2024

ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型

Python 41,224 5,208 Updated Jun 27, 2024

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python 39,398 4,778 Updated Jun 2, 2025

🔊 Text-Prompted Generative Audio Model

Jupyter Notebook 38,970 4,684 Updated Aug 19, 2024

A generative speech model for daily dialogue.

Python 38,679 4,207 Updated Jan 18, 2026

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Python 32,143 6,660 Updated Sep 30, 2025

Pretrain, finetune ANY AI model of ANY size on 1 or 10,000+ GPUs with zero code changes.

Python 30,815 3,669 Updated Feb 4, 2026

Open-Sora: Democratizing Efficient Video Production for All

Python 28,504 2,885 Updated Apr 30, 2025

《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调(全参数/Lora)、部署国内外开源大模型(LLM)/多模态大模型(MLLM)教程

Jupyter Notebook 28,132 2,806 Updated Feb 8, 2026

Fully open reproduction of DeepSeek-R1

Python 25,868 2,408 Updated Nov 24, 2025

A Gemini 2.5 Flash Level MLLM for Vision, Speech, and Full-Duplex Multimodal Live Streaming on Your Phone

Python 23,384 1,786 Updated Feb 8, 2026

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 22,010 2,695 Updated Jan 23, 2026

verl: Volcano Engine Reinforcement Learning for LLMs

Python 19,068 3,208 Updated Feb 6, 2026

✨✨Latest Advances on Multimodal Large Language Models

17,324 1,110 Updated Feb 7, 2026

Train transformer language models with reinforcement learning.

Python 17,308 2,481 Updated Feb 8, 2026

<⚡️> SuperAGI - A dev-first open source autonomous AI agent framework. Enabling developers to build, manage & run useful autonomous agents quickly and reliably.

Python 17,164 2,165 Updated Jan 22, 2025

[🔥updating ...] AI 自动量化交易机器人(完全本地部署) AI-powered Quantitative Investment Research Platform. 📃 online docs: https://ufund-me.github.io/Qbot ✨ :news: qbot-mini: https://github.com/Charmve/iQuant

Jupyter Notebook 16,153 2,297 Updated Jul 6, 2025

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Python 14,843 1,571 Updated Feb 4, 2026

A multi-voice TTS system trained with an emphasis on quality

Jupyter Notebook 14,804 2,053 Updated Nov 19, 2024

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3, Qwen3-MoE, DeepSeek-R1, GLM4.5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, …

Python 12,591 1,196 Updated Feb 7, 2026

FlashMLA: Efficient Multi-head Latent Attention Kernels

C++ 12,458 981 Updated Feb 6, 2026

主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题

HTML 12,307 1,239 Updated Apr 30, 2025

The official GitHub page for the survey paper "A Survey of Large Language Models".

Python 12,082 939 Updated Mar 11, 2025

Foundational Models for State-of-the-Art Speech and Text Translation

Jupyter Notebook 11,745 1,169 Updated Nov 14, 2024
Next