-
onnxruntime Public
Forked from microsoft/onnxruntimeONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
C++ MIT License UpdatedNov 5, 2025 -
flash-linear-attention Public
Forked from fla-org/flash-linear-attention🚀 Efficient implementations of state-of-the-art linear attention models
Python MIT License UpdatedNov 1, 2025 -
-
onnxruntime-genai Public
Forked from microsoft/onnxruntime-genaiGenerative AI extensions for onnxruntime
C++ MIT License UpdatedOct 8, 2025 -
libbacktrace Public
Forked from ianlancetaylor/libbacktraceA C library that may be linked into a C/C++ program to produce symbolic backtraces
C Other UpdatedOct 5, 2025 -
-
opencompass Public
Forked from open-compass/opencompassOpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
Python Apache License 2.0 UpdatedMar 25, 2025 -
optimum Public
Forked from huggingface/optimum🚀 Accelerate training and inference of 🤗 Transformers and 🤗 Diffusers with easy to use hardware optimization tools
Python Apache License 2.0 UpdatedNov 14, 2024 -
TensorRT-Model-Optimizer Public
Forked from NVIDIA/TensorRT-Model-OptimizerTensorRT Model Optimizer is a unified library of state-of-the-art model optimization techniques such as quantization, pruning, distillation, etc. It compresses deep learning models for downstream d…
Python Other UpdatedOct 23, 2024 -
libflash_attn Public
Forked from tlc-pack/libflash_attnStandalone Flash Attention v2 kernel without libtorch dependency
C++ BSD 3-Clause "New" or "Revised" License UpdatedMay 21, 2024 -
Stable-Diffusion-WebUI-OnnxRuntime Public
Forked from microsoft/Stable-Diffusion-WebUI-DirectMLExtension for Automatic1111's Stable Diffusion WebUI, using OnnxRuntime CUDA execution provider to deliver high performance result on Nvidia GPU.
-
-
unsloth Public
Forked from unslothai/unsloth2-5X faster 70% less memory QLoRA & LoRA finetuning
Python Apache License 2.0 UpdatedApr 2, 2024 -
ByteTransformer Public
Forked from bytedance/ByteTransformeroptimized BERT transformer inference on NVIDIA GPU. https://arxiv.org/abs/2210.03052
C++ Apache License 2.0 UpdatedMar 15, 2024 -
gdrivedl Public
Forked from matthuisman/gdrivedlGoogle Drive Download Python Script
Python GNU General Public License v3.0 UpdatedMar 2, 2024 -
Amuse Public
.NET application for stable diffusion, Leveraging OnnxStack, Amuse seamlessly integrates many StableDiffusion capabilities all within the .NET eco-system
-
onnx-modifier Public
Forked from ZhangGe6/onnx-modifierA tool to modify ONNX models in a visualization fashion, based on Netron and Flask.
JavaScript MIT License UpdatedDec 17, 2023 -
Faster-Diffusion Public
Forked from hutaiHang/Faster-DiffusionPython Apache License 2.0 UpdatedDec 16, 2023 -
diffusers Public
Forked from huggingface/diffusers🤗 Diffusers: experiment of diffusion ONNX models
Python Apache License 2.0 UpdatedDec 15, 2023 -
DemoFusion Public
Forked from PRIS-CV/DemoFusionLet us democratise high-resolution generation! (arXiv 2023)
Jupyter Notebook UpdatedDec 15, 2023 -
segment-anything Public
Forked from OroChippw/segment-anythingONNX Runtime support for SAM
Jupyter Notebook Apache License 2.0 UpdatedJun 30, 2023 -
TensorRT Public
Forked from NVIDIA/TensorRTNVIDIA® TensorRT™, an SDK for high-performance deep learning inference, includes a deep learning inference optimizer and runtime that delivers low latency and high throughput for inference applicat…
C++ Apache License 2.0 UpdatedMay 3, 2023 -
transformers Public
Forked from huggingface/transformers🤗 Transformers: State-of-the-art Natural Language Processing for TensorFlow 2.0 and PyTorch.
Python Apache License 2.0 UpdatedJul 11, 2022 -
Open Neural Network Exchange
PureBasic MIT License UpdatedMay 3, 2022 -
inference Public
Forked from mlcommons/inferenceReference implementations of inference benchmarks
Python Apache License 2.0 UpdatedSep 24, 2020 -
tutorials Public
Forked from onnx/tutorialsTutorials for creating and using ONNX models
Jupyter Notebook MIT License UpdatedMay 15, 2020 -
bert Public
Forked from google-research/bertTensorFlow code and pre-trained models for BERT
Python Apache License 2.0 UpdatedJun 11, 2019 -
CNTK Public
Forked from microsoft/CNTKMicrosoft Cognitive Toolkit (CNTK), an open source deep-learning toolkit
C++ Other UpdatedFeb 15, 2018