Starred repositories
🔊 Text-Prompted Generative Audio Model
Google Research
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…
Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
One has no future if one couldn't teach themself.
AISystem 主要是指AI系统,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术
Foundational Models for State-of-the-Art Speech and Text Translation
A collection of pre-trained, state-of-the-art models in the ONNX format
Homepage for STAT 157 at UC Berkeley
Tutorials for creating and using ONNX models
Generative AI reference workflows optimized for accelerated infrastructure and microservice architecture.
This is code of book "Learn Deep Learning with PyTorch"
A computer vision closed-loop learning platform where code can be run interactively online. 学习闭环《计算机视觉实战演练:算法与应用》中文电子书、源码、读者交流社区(持续更新中 ...) 📘 在线电子书 https://charmve.github.io/computer-vision-in-acti…
Parquet-based ML data format optimized for working with unstructured data