-
Shanghai Jiao Tong University & Shanghai Innovation Institute
- Shanghai
-
17:10
(UTC +08:00) - https://zhikangniu.github.io/
Lists (28)
Sort Name ascending (A-Z)
ASR
Awesome List
Bench
Chinese LLM
Codec
CV
Dataset/Tools/Course
Diffusion
emotion
Framework
front
LLM
Music Generation
nano
nlp
other
pipeline
Podcast
PyTorch
RLHF
s2st
speaker diarization
T2V
TTS
tutorial
unify
V2A
Vocoder
Stars
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
A feature-rich command-line audio/video downloader
Collection of awesome LLM apps with AI Agents and RAG using OpenAI, Anthropic, Gemini and opensource models.
《动手学深度学习》:面向中文读者、能运行、可讨论。中英文版被70多个国家的500多所大学用于教学。
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…
🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
The simplest, fastest repository for training/finetuning medium-sized GPTs.
Get your documents ready for gen AI
利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.
🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.
OpenMMLab Detection Toolbox and Benchmark
PyTorch Tutorial for Deep Learning Researchers
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
You like pytorch? You like micrograd? You love tinygrad! ❤️
Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.
A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.
GUI for a Vocal Remover that uses Deep Neural Networks.
Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.
Fast and memory-efficient exact attention
Faster Whisper transcription with CTranslate2
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
Use ChatGPT to summarize the arXiv papers. 全流程加速科研,利用chatgpt进行论文全文总结+专业翻译+润色+审稿+审稿回复
A TTS model capable of generating ultra-realistic dialogue in one pass.
An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System