-
Shanghai Jiao Tong University
- Shanghai, China
-
02:52
(UTC +08:00) - gszfwsb.github.io
- @ShaoboWang6
Highlights
- Pro
Lists (11)
Sort Name ascending (A-Z)
Starred repositories
📚 Freely available programming books
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
Command-line program to download videos from YouTube.com and other video sites
Tensors and Dynamic neural networks in Python with strong GPU acceleration
Robust Speech Recognition via Large-Scale Weak Supervision
Magnificent app which corrects your previous console command.
《动手学深度学习》:面向中文读者、能运行、可讨论。中英文版被70多个国家的500多所大学用于教学。
A high-throughput and memory-efficient inference and serving engine for LLMs
The Python micro framework for building web applications.
为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, m…
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…
AI agents running research on single-GPU nanochat training automatically
The simplest, fastest repository for training/finetuning medium-sized GPTs.
Collection of Summer 2026 tech internships!
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
2025年最新总结,阿里,腾讯,百度,美团,头条等技术面试题目,以及答案,专家出题人分析汇总。
🚀Clone a voice in 5 seconds to generate arbitrary speech in real-time
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…
A modular graph-based Retrieval-Augmented Generation (RAG) system
Open-Sora: Democratizing Efficient Video Production for All
Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.
SGLang is a high-performance serving framework for large language models and multimodal models.