Skip to content
View BodhiHu's full-sized avatar
🌴
bodhicitta
🌴
bodhicitta
  • AMD, MooreThreads
  • Shanghai

Block or report BodhiHu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
777 results for source starred repositories
Clear filter

Efficient Triton Kernels for LLM Training

Python 5,802 426 Updated Nov 5, 2025

FlagGems is an operator library for large language models implemented in the Triton Language.

Python 747 147 Updated Nov 5, 2025

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda 5,861 736 Updated Oct 15, 2025

ChatGPT CLI is a versatile tool for interacting with LLMs through OpenAI, Azure, and other popular providers like Perplexity AI and Llama. It supports prompt files, history tracking, and live data …

Go 830 51 Updated Oct 9, 2025

A client implementation for ChatGPT and Bing AI. Available as a Node.js module, REST API server, and CLI app.

JavaScript 4,202 724 Updated Jan 27, 2024

User-friendly Desktop Client App for AI Models/LLMs (GPT, Claude, Gemini, Ollama...)

TypeScript 37,311 3,774 Updated Nov 4, 2025

Convert ONNX models to PyTorch.

Python 707 85 Updated Oct 14, 2025

A unified library of state-of-the-art model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment…

Python 1,511 190 Updated Nov 5, 2025

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…

Python 35,668 5,059 Updated Nov 5, 2025
Python 2 Updated Jan 28, 2025

Development repository for the Triton language and compiler

MLIR 17,469 2,360 Updated Nov 5, 2025

Open-Sora: Democratizing Efficient Video Production for All

Python 27,755 2,754 Updated Apr 30, 2025

Mirage Persistent Kernel: Compiling LLMs into a MegaKernel

C++ 1,934 148 Updated Nov 5, 2025

C implementation of the L-Mul f32/f16 multiplications from paper: https://arxiv.org/html/2410.00907

C 28 Updated Oct 12, 2024

ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator

C++ 18,288 3,534 Updated Nov 5, 2025

An open-source computer vision framework to build and deploy apps in minutes

Rust 767 41 Updated May 8, 2024

A GStreamer Deep Learning Inference Framework

C 131 30 Updated Nov 7, 2023

Data manipulation and transformation for audio signal processing, powered by PyTorch

Python 2,763 738 Updated Nov 5, 2025

Datasets, Transforms and Models specific to Computer Vision

Python 17,279 7,170 Updated Nov 5, 2025

Ultralytics YOLO 🚀

Python 48,300 9,311 Updated Nov 5, 2025

PyTorch native quantization and sparsity for training and inference

Python 2,490 360 Updated Nov 5, 2025

NVIDIA curated collection of educational resources related to general purpose GPU programming.

Jupyter Notebook 803 135 Updated Nov 5, 2025

Samples for CUDA Developers which demonstrates features in CUDA Toolkit

C 8,388 2,170 Updated Sep 5, 2025

Tensor library for machine learning

C++ 13,375 1,375 Updated Nov 4, 2025

Diffusion model(SD,Flux,Wan,Qwen Image,...) inference in pure C/C++

C++ 4,517 439 Updated Nov 3, 2025

Wan: Open and Advanced Large-Scale Video Generative Models

Python 14,631 2,111 Updated Jul 17, 2025

A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations

Python 15,375 1,112 Updated Nov 5, 2025
Jupyter Notebook 573 25 Updated Aug 23, 2024

LLM model quantization (compression) toolkit with hw acceleration support for Nvidia CUDA, AMD ROCm, Intel XPU and Intel/AMD/Apple CPU via HF, vLLM, and SGLang.

Python 857 124 Updated Nov 5, 2025
Next