Created and enhanced a local LLM training system on Apple Silicon with MLX and Metal API, overcoming the absence of CUDA support. Fine-tuned the Llama3 model on 16 GPUs for streamlined solution of …

Python 20 5 Updated May 29, 2024

UbiquitousLearning / mllm

Fast Multimodal LLM on Mobile Devices

C++ 1,162 141 Updated Nov 5, 2025

justADeni / intel-npu-llm

A simple Python script for running LLMs on Intel's Neural Processing Units (NPUs)

Python 24 1 Updated Oct 17, 2025

Alibaba-NLP / DeepResearch

Tongyi Deep Research, the Leading Open-source Deep Research Agent

Python 16,927 1,292 Updated Nov 3, 2025

zipnn / zipnn

A Lossless Compression Library for AI pipelines

Python 285 31 Updated Jul 3, 2025

dingwentao / GPU-lossless-compression

GPU-Accelerated Lossless Data Compressors Survey

Cuda 121 11 Updated Sep 10, 2020

hsharma35 / dnnweaver2

Open Source Specialized Computing Stack for Accelerating Deep Neural Networks.

Jupyter Notebook 224 74 Updated Apr 22, 2019

geochri / AlphaZero_Chess

PyTorch implementation of AlphaZero Chess from scratch

Python 176 34 Updated Aug 7, 2024

microsoft / vattention

Dynamic Memory Management for Serving LLMs without PagedAttention

C 434 33 Updated May 30, 2025

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 62,154 11,046 Updated Nov 5, 2025

PKUZHOU / NeoMem-MICRO-2024

The Artifact of NeoMem: Hardware/Software Co-Design for CXL-Native Memory Tiering

59 5 Updated Aug 11, 2024

OpenXiangShan / XiangShan

Open-source high-performance RISC-V processor

Scala 6,718 830 Updated Nov 5, 2025

sgl-project / sglang

SGLang is a fast serving framework for large language models and vision language models.

Python 19,765 3,273 Updated Nov 5, 2025

sgl-project / SpecForge

Train speculative decoding models effortlessly and port them smoothly to SGLang serving.

Python 457 105 Updated Nov 5, 2025

SafeAILab / EAGLE

Official Implementation of EAGLE-1 (ICML'24), EAGLE-2 (EMNLP'24), and EAGLE-3 (NeurIPS'25).

Python 1,971 220 Updated Nov 5, 2025

riscv / integrated-matrix-extension

Forked from riscv/riscv-isa-manual

RISC-V Integrated Matrix Development Repository

TeX 18 Updated Oct 13, 2025

XUANTIE-RV / riscv-matrix-extension-spec

A matrix extension proposal for AI applications under RISC-V architecture

Makefile 154 29 Updated Feb 11, 2025

amd / RyzenAI-SW

AMD Ryzen™ AI Software includes the tools and runtime libraries for optimizing and deploying AI inference on AMD Ryzen™ AI powered PCs.

Python 683 107 Updated Nov 5, 2025

iamshcha / HBM2-PIMSimulator-lab

HBM2-PIM Simulator for lecture at the KAIST AI-PIM Center

C++ 6 1 Updated Jul 3, 2024

accel-sim / gpu-app-collection

A repository where GPU applications are aggregated using a common build flow that supports multiple CUDA versions.

Cuda 81 54 Updated Oct 27, 2025

LaTeX

Google

Go

Docker

Pixel Art

Linux

IPFS

Electron

C#

C++

See all starred topics