Skip to content
View CanyonWind's full-sized avatar
😃
😃

Block or report CanyonWind

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 438 30 Updated Mar 27, 2024

【TMM 2025🔥】 Mixture-of-Experts for Large Vision-Language Models

Python 2,266 140 Updated Jul 15, 2025

High-performance In-browser LLM Inference Engine

TypeScript 16,753 1,130 Updated Nov 2, 2025

Machine Learning Engineering Open Book

Python 15,606 956 Updated Oct 27, 2025

⛷️ LLaMA-MoE: Building Mixture-of-Experts from LLaMA with Continual Pre-training (EMNLP 2024)

Python 995 62 Updated Dec 6, 2024

Official inference library for Mistral models

Jupyter Notebook 10,531 981 Updated Mar 20, 2025

Official repository for the paper "SwitchHead: Accelerating Transformers with Mixture-of-Experts Attention"

Python 100 7 Updated Sep 30, 2024

OneFlow is a deep learning framework designed to be user-friendly, scalable and efficient.

C++ 9,367 1,010 Updated Aug 20, 2025

(Unofficial) Implementation of dilated attention from "LongNet: Scaling Transformers to 1,000,000,000 Tokens" (https://arxiv.org/abs/2307.02486)

Python 53 8 Updated Aug 7, 2023

Implementation of the dilated self attention as described in "LongNet: Scaling Transformers to 1,000,000,000 Tokens"

Python 13 Updated Jul 23, 2023

Integral Neural Networks in PyTorch

Python 128 11 Updated Dec 2, 2024

An open-source project dedicated to tracking and segmenting any objects in videos, either automatically or interactively. The primary algorithms utilized include the Segment Anything Model (SAM) fo…

Jupyter Notebook 3,067 353 Updated Apr 25, 2024

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

Python 179,503 46,096 Updated Nov 5, 2025

Speed up Stable Diffusion with this one simple trick!

Python 1,384 83 Updated Nov 29, 2023

Bringing stable diffusion models to web browsers. Everything runs inside the browser with no server support.

Jupyter Notebook 3,692 235 Updated Mar 12, 2024

A playbook for systematically maximizing the performance of deep learning models.

29,334 2,399 Updated Jun 18, 2024

An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.

Python 14,290 1,836 Updated Jul 3, 2024

Transformer related optimization, including BERT, GPT

C++ 6,342 920 Updated Mar 27, 2024

MegCC是一个运行时超轻量,高效,移植简单的深度学习模型编译器

C++ 488 58 Updated Oct 23, 2024

Summaries and resources for Designing Machine Learning Systems book (Chip Huyen, O'Reilly 2022)

3,714 691 Updated Oct 31, 2025

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper, Ada and Blackwell GPUs, to provide better performance with lower memory…

Python 2,884 540 Updated Nov 5, 2025

Generic Neural Architecture Search via Regression (NeurIPS'21 Spotlight)

Python 36 8 Updated Aug 29, 2022

Hackable and optimized Transformers building blocks, supporting a composable construction.

Python 10,062 731 Updated Oct 31, 2025

FLASHQuad_pytorch

Python 68 9 Updated Apr 1, 2022

🚀 PyTorch Implementation of "Progressive Distillation for Fast Sampling of Diffusion Models(v-diffusion)"

Python 254 33 Updated May 31, 2022

Implementation of the Transformer variant proposed in "Transformer Quality in Linear Time"

Python 369 25 Updated Sep 26, 2023

PyTorch/TorchScript/FX compiler for NVIDIA GPUs using TensorRT

Python 2,878 369 Updated Nov 5, 2025

Towards Unified Keyframe Propagation Models

Python 242 29 Updated Aug 19, 2022

Flexible and powerful tensor operations for readable and reliable code (for pytorch, jax, TF and others)

Python 9,260 385 Updated Aug 12, 2025

Fast and memory-efficient exact attention

Python 20,353 2,114 Updated Nov 5, 2025
Next