muxamilian

Maximilian Bachl muxamilian

31 followers · 17 following

Axelera AI
Vienna
https://scholar.google.at/citations?user=llysaaMAAAAJ

Achievements

Lists (3)

Sort

Stars

245 results for source starred repositories

Clear filter

InferenceMAX / InferenceMAX

Open Source Continuous Inference Benchmarking - GB200 NVL72 vs MI355X vs B200 vs H200 vs MI325X & soon™ TPUv6e/v7/Trainium2/3/GB300 NVL72 - DeepSeek 670B MoE, GPTOSS

Python 330 44 Updated Nov 11, 2025

orangepi-xunlong / wiringOP

wiringPi for Orange Pi

C 511 211 Updated Oct 11, 2025

openshmem-org / specification

OpenSHMEM Application Programming Interface

TeX 60 42 Updated Nov 11, 2024

airockchip / rknn-toolkit2

C 2,289 254 Updated Jul 29, 2025

airockchip / rknn_model_zoo

C 1,947 340 Updated Apr 9, 2025

cocotb / cocotb

cocotb: Python-based chip (RTL) verification

Python 2,140 592 Updated Nov 11, 2025

alexforencich / cocotbext-pcie

PCI express simulation framework for Cocotb

Python 182 56 Updated Sep 8, 2025

Krishnaps / Gem5-PCI-Express

Gem5 with PCI Express integrated.

C++ 22 7 Updated Sep 29, 2018

omnetpp / omnetpp

OMNeT++ Discrete Event Simulator

C 704 166 Updated Nov 10, 2025

zephyrproject-rtos / zephyr

Primary Git Repository for the Zephyr Project. Zephyr is a new generation, scalable, optimized, secure RTOS for multiple hardware architectures.

C 13,645 8,206 Updated Nov 10, 2025

AILab-CVC / YOLO-World

[CVPR 2024] Real-Time Open-Vocabulary Object Detection

Python 5,990 567 Updated Feb 26, 2025

THU-MIG / yoloe

YOLOE: Real-Time Seeing Anything [ICCV 2025]

Python 1,889 180 Updated Jun 26, 2025

GeeeekExplorer / nano-vllm

Nano vLLM

Python 8,641 1,046 Updated Nov 3, 2025

openhwgroup / cva6

The CORE-V CVA6 is a highly configurable, 6-stage RISC-V core for both application and embedded applications. Application class configurations are capable of booting Linux.

Assembly 2,678 847 Updated Nov 10, 2025

microsoft / onnxruntime-genai

Generative AI extensions for onnxruntime

C++ 874 225 Updated Nov 11, 2025

Tencent / ncnn

ncnn is a high-performance neural network inference framework optimized for the mobile platform

C++ 22,256 4,349 Updated Nov 11, 2025

Dao-AILab / flash-attention

Fast and memory-efficient exact attention

Python 20,439 2,126 Updated Nov 9, 2025

microsoft / mscclpp

MSCCL++: A GPU-driven communication stack for scalable AI applications

C++ 433 72 Updated Nov 11, 2025

NVIDIA / nccl

Optimized primitives for collective multi-GPU communication

C++ 4,216 1,063 Updated Nov 10, 2025

Mellanox / nccl-rdma-sharp-plugins

RDMA and SHARP plugins for nccl library

C 212 40 Updated Oct 21, 2025

openucx / ucc

Unified Collective Communication Library

C 278 118 Updated Nov 9, 2025

deepseek-ai / profile-data

Analyze computation-communication overlap in V3/R1.

1,116 143 Updated Mar 21, 2025

InternLM / lmdeploy

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Python 7,239 617 Updated Nov 11, 2025

mlc-ai / mlc-llm

Universal LLM Deployment Engine with ML Compilation

Python 21,587 1,852 Updated Nov 4, 2025

floooh / chips-test

Tests and sample code for https://github.com/floooh/chips

C 460 49 Updated Sep 15, 2025

ai-dynamo / dynamo

A Datacenter Scale Distributed Inference Serving Framework

Rust 5,444 677 Updated Nov 11, 2025

sycuricon / riscv-spike-sdk

Run Linux on RISC-V Spike Simulator

Makefile 63 18 Updated Oct 18, 2025

gtcasl / qsim

QEMU based emulation library for micro-architectural simulation (ARM64 and x86)

C 43 17 Updated Jun 30, 2019

sstsimulator / sst-core

SST Structural Simulation Toolkit Parallel Discrete Event Core and Services

C++ 179 102 Updated Nov 10, 2025

axelera-ai-hub / voyager-sdk

To ensure developers can get the most out of our performance-leading hardware, we built the Voyager™ SDK which facilitates the development of high-performance applications.

Python 76 13 Updated Oct 21, 2025

Maximilian Bachl muxamilian

Lists (3)

collective_communication

EBS and QEMU

LLM serving

Stars