Skip to content
View lishuai-97's full-sized avatar
🎯
Focusing
🎯
Focusing
  • Beijing
  • 06:52 (UTC +08:00)

Block or report lishuai-97

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
451 stars written in Python
Clear filter

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 152,181 31,063 Updated Nov 6, 2025

🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…

Python 64,153 6,506 Updated Sep 19, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 62,342 11,082 Updated Nov 6, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 61,969 7,494 Updated Nov 6, 2025

Inference code for Llama models

Python 58,905 9,812 Updated Jan 26, 2025

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 49,050 8,214 Updated Dec 9, 2024

High-Resolution Image Synthesis with Latent Diffusion Models

Python 41,949 5,336 Updated Jun 25, 2025

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 40,624 4,613 Updated Nov 6, 2025

2025年最新总结,阿里,腾讯,百度,美团,头条等技术面试题目,以及答案,专家出题人分析汇总。

Python 37,045 9,447 Updated Oct 22, 2025

🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!

Python 32,622 3,778 Updated Nov 6, 2025

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.

Python 31,507 6,476 Updated Nov 6, 2025

The official Meta Llama 3 GitHub site

Python 29,073 3,476 Updated Jan 26, 2025

Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.

Python 27,276 4,823 Updated Aug 18, 2024

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 23,908 2,659 Updated Aug 12, 2024

Graph Neural Network Library for PyTorch

Python 23,100 3,910 Updated Nov 3, 2025

A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training

Python 22,899 2,999 Updated Aug 15, 2024

Fast and memory-efficient exact attention

Python 20,371 2,118 Updated Nov 5, 2025

Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"

Python 19,324 2,724 Updated Oct 17, 2025

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

Python 18,941 1,877 Updated Jul 15, 2025

PyTorch implementations of Generative Adversarial Networks.

Python 17,317 4,096 Updated Jun 18, 2024

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 16,050 3,179 Updated Nov 6, 2025

Machine Learning Engineering Open Book

Python 15,625 957 Updated Oct 27, 2025

This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".

Python 15,384 2,191 Updated Jul 24, 2024

Ongoing research training transformer models at scale

Python 14,110 3,247 Updated Nov 6, 2025

RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RN…

Python 14,108 969 Updated Nov 3, 2025

[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation

Python 13,332 2,547 Updated Jun 26, 2024

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

Python 12,899 856 Updated Dec 17, 2024

An open source implementation of CLIP.

Python 12,895 1,193 Updated Nov 4, 2025

Easy-to-use and powerful LLM and SLM library with awesome model zoo.

Python 12,834 3,078 Updated Nov 5, 2025

Generate 3D objects conditioned on text or images

Python 12,125 1,048 Updated Jun 22, 2024
Next