BodhiHu

🌴

bodhicitta

中土 Bodhi BodhiHu

🌴

bodhicitta

namo amituofo❤ > bodhicitta > 🎾🧘‍♂️ > 阿彌陀佛 ❤️❤️❤️

23 followers · 10 following

AMD, MooreThreads
Shanghai

Achievements

Liger-Kernel Public
Forked from linkedin/Liger-Kernel

Efficient Triton Kernels for LLM Training

Python BSD 2-Clause "Simplified" License Updated Oct 27, 2025
ultralytics Public
Forked from ultralytics/ultralytics

Ultralytics YOLO11 🚀

Python GNU Affero General Public License v3.0 Updated Oct 16, 2025
TensorRT-Model-Optimizer Public
Forked from NVIDIA/TensorRT-Model-Optimizer

A unified library of state-of-the-art model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment…

Python Apache License 2.0 Updated Sep 9, 2025
llama.cpp Public
Forked from ggml-org/llama.cpp

LLM inference in C/C++

C++ MIT License Updated Jul 17, 2025
mirage-llm-megakernel Public
Forked from mirage-project/mirage

Mirage: Automatically Generating Fast GPU Kernels without Programming in Triton/CUDA

C++ Apache License 2.0 Updated Jun 22, 2025
onnxruntime Public
Forked from microsoft/onnxruntime

ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator

C++ MIT License Updated Jun 19, 2025
torch_audio Public
Forked from pytorch/audio

Data manipulation and transformation for audio signal processing, powered by PyTorch

Python BSD 2-Clause "Simplified" License Updated Apr 16, 2025
torch_vision Public
Forked from pytorch/vision

Datasets, Transforms and Models specific to Computer Vision

Python BSD 3-Clause "New" or "Revised" License Updated Apr 16, 2025
accelerated-computing-hub Public
Forked from NVIDIA/accelerated-computing-hub

NVIDIA curated collection of educational resources related to general purpose GPU programming.

Jupyter Notebook Other Updated Mar 15, 2025
distributed-llama Public
Forked from b4rtaz/distributed-llama

Connect home devices into a powerful cluster to accelerate LLM inference. More devices means faster inference.

C++ MIT License Updated Mar 10, 2025
sglang Public
Forked from sgl-project/sglang

SGLang is a fast serving framework for large language models and vision language models.

Python Apache License 2.0 Updated Mar 10, 2025
ktransformers Public
Forked from kvcache-ai/ktransformers

A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations

Python Apache License 2.0 Updated Mar 5, 2025
Wan2.1 Public
Forked from Wan-Video/Wan2.1

Wan: Open and Advanced Large-Scale Video Generative Models

Python Apache License 2.0 Updated Mar 4, 2025
stable-diffusion.cpp Public
Forked from leejet/stable-diffusion.cpp

Stable Diffusion and Flux in pure C/C++

C++ MIT License Updated Mar 1, 2025
ollama Public
Forked from ollama/ollama

Get up and running with Llama 2, Mistral, and other large language models locally.

Go MIT License Updated Feb 27, 2025
executorch Public
Forked from pytorch/executorch

On-device AI across mobile, embedded and edge for PyTorch

C++ Other Updated Feb 5, 2025
AutoGPTQ Public
Forked from AutoGPTQ/AutoGPTQ

An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.

Python MIT License Updated Dec 15, 2024
LLaMA-MoE-v2 Public
Forked from OpenSparseLLMs/LLaMA-MoE-v2

🚀LLaMA-MoE v2: Exploring Sparsity of LLaMA from Perspective of Mixture-of-Experts with Post-Training

Python Apache License 2.0 Updated Dec 12, 2024
LocalAI Public
Forked from mudler/LocalAI

🤖 The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on consumer-grade hardware. No GPU required. Runs gguf, transf…

C++ MIT License Updated Oct 24, 2024
lobe-cli-toolbox Public

TypeScript MIT License Updated Oct 24, 2024
MInference Public
Forked from microsoft/MInference

[NeurIPS'24 Spotlight] To speed up Long-context LLMs' inference, approximate and dynamic sparse calculate the attention, which reduces inference latency by up to 10x for pre-filling on an A100 whil…

Python MIT License Updated Oct 16, 2024
L-Mul Public

C implementation of the L-Mul f32/f16 multiplications from paper: https://arxiv.org/html/2410.00907

C 28 Updated Oct 12, 2024
llama-cpp-openai-server Public
Forked from abetlen/llama-cpp-python

Python bindings for llama.cpp

Python MIT License Updated Oct 3, 2024
muAlg Public
Forked from MooreThreads/muAlg

Cooperative primitives for CUDA C++. See https://github.com/NVIDIA/cccl

Cuda Other Updated Sep 13, 2024
transformer-explainer Public
Forked from poloclub/transformer-explainer

Transformer Explained Visually: Learn How LLM Transformer Models Work with Interactive Visualization

JavaScript MIT License Updated Sep 5, 2024
node-screenshots Public
Forked from nashaofu/node-screenshots

Zero-dependent. A native nodejs screenshots library for Mac、Windows、Linux.

Rust Apache License 2.0 Updated Aug 11, 2024
PowerInfer-forked Public
Forked from SJTU-IPADS/PowerInfer

High-speed Large Language Model Serving on PCs with Consumer-grade GPUs

C++ MIT License Updated Jul 15, 2024
ai-search-memfree Public
Forked from memfreeme/memfree

MemFree - Hybrid AI Search Engine

TypeScript MIT License Updated Jul 15, 2024
searxng Public
Forked from searxng/searxng

SearXNG is a free internet metasearch engine which aggregates results from various search services and databases. Users are neither tracked nor profiled.

Python GNU Affero General Public License v3.0 Updated Jul 15, 2024
huggingface-text-generation-inference Public
Forked from huggingface/text-generation-inference

Large Language Model Text Generation Inference

Python Apache License 2.0 Updated Jul 10, 2024

中土 Bodhi BodhiHu

Achievements

Achievements

Liger-Kernel Public

Uh oh!

ultralytics Public

Uh oh!

TensorRT-Model-Optimizer Public

Uh oh!

llama.cpp Public

Uh oh!

mirage-llm-megakernel Public

Uh oh!

onnxruntime Public

Uh oh!

torch_audio Public

Uh oh!

torch_vision Public

Uh oh!

accelerated-computing-hub Public

Uh oh!

distributed-llama Public

Uh oh!

sglang Public

Uh oh!

ktransformers Public

Uh oh!

Wan2.1 Public

Uh oh!

stable-diffusion.cpp Public

Uh oh!

ollama Public

Uh oh!

executorch Public

Uh oh!

AutoGPTQ Public

Uh oh!

LLaMA-MoE-v2 Public

Uh oh!

LocalAI Public

Uh oh!

lobe-cli-toolbox Public

Uh oh!

MInference Public

Uh oh!

L-Mul Public

Uh oh!

llama-cpp-openai-server Public

Uh oh!

muAlg Public

Uh oh!

transformer-explainer Public

Uh oh!

node-screenshots Public

Uh oh!

PowerInfer-forked Public

Uh oh!

ai-search-memfree Public

Uh oh!

searxng Public

Uh oh!

huggingface-text-generation-inference Public

Uh oh!