-
Shanghai Jiao Tong University & Shanghai Innovation Institute
- Shanghai
-
23:43
(UTC +08:00) - https://zhikangniu.github.io/
Lists (28)
Sort Name ascending (A-Z)
ASR
Awesome List
Bench
Chinese LLM
Codec
CV
Dataset/Tools/Course
Diffusion
emotion
Framework
front
LLM
Music Generation
nano
nlp
other
pipeline
Podcast
PyTorch
RLHF
s2st
speaker diarization
T2V
TTS
tutorial
unify
V2A
Vocoder
Stars
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…
Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama mode…
此项目是机器学习(Machine Learning)、深度学习(Deep Learning)、NLP面试中常考到的知识点和代码实现,也是作为一个算法工程师必会的理论基础知识。
Materials for the Learn PyTorch for Deep Learning: Zero to Mastery course.
✔(已完结)超级全面的 深度学习 笔记【土堆 Pytorch】【李沐 动手学深度学习】【吴恩达 深度学习】【大飞 大模型Agent】
AISystem 主要是指AI系统,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术
A multi-voice TTS system trained with an emphasis on quality
State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.
A tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like API
High-Resolution Image Synthesis with Latent Diffusion Models
强化学习中文教程(蘑菇书🍄),在线阅读地址:https://datawhalechina.github.io/easy-rl/
This repository contains demos I made with the Transformers library by HuggingFace.
Official inference library for Mistral models
YSDA course in Natural Language Processing
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
Zero-Shot Speech Editing and Text-to-Speech in the Wild
Taming Transformers for High-Resolution Image Synthesis
Accepted as [NeurIPS 2024] Spotlight Presentation Paper
Pytorch🍊🍉 is delicious, just eat it! 😋😋
Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.
【🔞🔞🔞 内含不适合未成年人阅读的图片】基于我擅长的编程、绘画、写作展开的 AI 探索和总结:StableDiffusion 是一种强大的图像生成模型,能够通过对一张图片进行演化来生成新的图片。ChatGPT 是一个基于 Transformer 的语言生成模型,它能够自动为输入的主题生成合适的文章。而 Github Copilot 是一个智能编程助手,能够加速日常编程活动。
Think DSP: Digital Signal Processing in Python, by Allen B. Downey.
Materials for the Hugging Face Diffusion Models Course
中文nlp解决方案(大模型、数据、模型、训练、推理)