Skip to content
View muxamilian's full-sized avatar

Block or report muxamilian

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
245 results for source starred repositories
Clear filter

Open Source Continuous Inference Benchmarking - GB200 NVL72 vs MI355X vs B200 vs H200 vs MI325X & soon™ TPUv6e/v7/Trainium2/3/GB300 NVL72 - DeepSeek 670B MoE, GPTOSS

Python 330 44 Updated Nov 11, 2025

wiringPi for Orange Pi

C 511 211 Updated Oct 11, 2025

OpenSHMEM Application Programming Interface

TeX 60 42 Updated Nov 11, 2024

cocotb: Python-based chip (RTL) verification

Python 2,140 592 Updated Nov 11, 2025

PCI express simulation framework for Cocotb

Python 182 56 Updated Sep 8, 2025

Gem5 with PCI Express integrated.

C++ 22 7 Updated Sep 29, 2018

OMNeT++ Discrete Event Simulator

C 704 166 Updated Nov 10, 2025

Primary Git Repository for the Zephyr Project. Zephyr is a new generation, scalable, optimized, secure RTOS for multiple hardware architectures.

C 13,645 8,206 Updated Nov 10, 2025

[CVPR 2024] Real-Time Open-Vocabulary Object Detection

Python 5,990 567 Updated Feb 26, 2025

YOLOE: Real-Time Seeing Anything [ICCV 2025]

Python 1,889 180 Updated Jun 26, 2025

Nano vLLM

Python 8,641 1,046 Updated Nov 3, 2025

The CORE-V CVA6 is a highly configurable, 6-stage RISC-V core for both application and embedded applications. Application class configurations are capable of booting Linux.

Assembly 2,678 847 Updated Nov 10, 2025

Generative AI extensions for onnxruntime

C++ 874 225 Updated Nov 11, 2025

ncnn is a high-performance neural network inference framework optimized for the mobile platform

C++ 22,256 4,349 Updated Nov 11, 2025

Fast and memory-efficient exact attention

Python 20,439 2,126 Updated Nov 9, 2025

MSCCL++: A GPU-driven communication stack for scalable AI applications

C++ 433 72 Updated Nov 11, 2025

Optimized primitives for collective multi-GPU communication

C++ 4,216 1,063 Updated Nov 10, 2025

RDMA and SHARP plugins for nccl library

C 212 40 Updated Oct 21, 2025

Unified Collective Communication Library

C 278 118 Updated Nov 9, 2025

Analyze computation-communication overlap in V3/R1.

1,116 143 Updated Mar 21, 2025

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Python 7,239 617 Updated Nov 11, 2025

Universal LLM Deployment Engine with ML Compilation

Python 21,587 1,852 Updated Nov 4, 2025

Tests and sample code for https://github.com/floooh/chips

C 460 49 Updated Sep 15, 2025

A Datacenter Scale Distributed Inference Serving Framework

Rust 5,444 677 Updated Nov 11, 2025

Run Linux on RISC-V Spike Simulator

Makefile 63 18 Updated Oct 18, 2025

QEMU based emulation library for micro-architectural simulation (ARM64 and x86)

C 43 17 Updated Jun 30, 2019

SST Structural Simulation Toolkit Parallel Discrete Event Core and Services

C++ 179 102 Updated Nov 10, 2025

To ensure developers can get the most out of our performance-leading hardware, we built the Voyager™ SDK which facilitates the development of high-performance applications.

Python 76 13 Updated Oct 21, 2025
Next