Skip to content
View parano's full-sized avatar
🍱
🍱

Organizations

@sysu @CSE512-14W @horseshoe477 @VoteWithYourFeet @atalaya-io @bentoml

Block or report parano

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

The Modular Platform (includes MAX & Mojo)

Mojo 25,589 2,773 Updated Feb 17, 2026

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 203,002 36,640 Updated Feb 17, 2026

Kimi Code CLI is your next CLI agent.

Python 6,444 612 Updated Feb 11, 2026

TurboDiffusion: 100–200× Acceleration for Video Diffusion Models

Python 3,335 232 Updated Feb 16, 2026

A fast communication-overlapping library for tensor/expert parallelism on GPUs.

C++ 1,250 93 Updated Aug 28, 2025
Python 164 12 Updated Jul 22, 2024

Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels

Python 5,205 452 Updated Feb 16, 2026

Run LLMs with MLX

Python 3,663 426 Updated Feb 17, 2026

dstack is an open-source control plane for running development, training, and inference jobs on GPUs—across hyperscalers, neoclouds, or on-prem.

Python 2,035 214 Updated Feb 17, 2026

Open ABI and FFI for Machine Learning Systems

C++ 346 60 Updated Feb 17, 2026

Deep learning at the speed of light.

Rust 2,766 194 Updated Feb 17, 2026

A Python-embedded DSL that makes it easy to write fast, scalable ML kernels with minimal boilerplate.

Python 754 103 Updated Feb 17, 2026

Automatically Discovering Fast Parallelization Strategies for Distributed Deep Neural Network Training

C++ 1,860 248 Updated Feb 15, 2026

The best ChatGPT that $100 can buy.

Python 43,516 5,670 Updated Feb 16, 2026

PyTorch Single Controller

Rust 969 137 Updated Feb 17, 2026

🧬 The Huxley-Gödel Machine

Python 325 56 Updated Feb 7, 2026

Development repository for the Triton language and compiler

MLIR 18,437 2,584 Updated Feb 17, 2026

Approximate Nearest Neighbors in C++/Python optimized for memory usage and loading/saving to disk

C++ 14,151 1,217 Updated Oct 29, 2025

💫 Toolkit to help you get started with Spec-Driven Development

Python 70,087 6,044 Updated Feb 12, 2026

Benchmark and optimize LLM inference across frameworks with ease

Python 167 17 Updated Sep 12, 2025
Python 2 Updated Aug 19, 2025

Original Apollo 11 Guidance Computer (AGC) source code for the command and lunar modules.

Assembly 64,597 7,373 Updated Jan 22, 2026

The simplest, fastest repository for training/finetuning small-sized VLMs.

Python 4,660 464 Updated Oct 27, 2025

[DEPRECATED] Moved to ROCm/rocm-libraries repo. NOTE: develop branch is maintained as a read-only mirror

C++ 520 271 Updated Feb 17, 2026

super repo for rocm systems projects

C++ 252 138 Updated Feb 17, 2026

Open Source framework for voice and multimodal conversational AI

Python 10,308 1,731 Updated Feb 16, 2026

[ICLR2025 Spotlight] SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models

Python 3,685 223 Updated Feb 14, 2026

PyTorch native quantization and sparsity for training and inference

Python 2,691 429 Updated Feb 17, 2026

A better build tool for Java, Scala and Kotlin: Simpler than Maven, easier than Gradle, with 3-7x faster dev workflows than other JVM build tools

Scala 2,693 431 Updated Feb 16, 2026

A nano Claude Code–like agent, built from 0 to 1

Python 17,149 3,629 Updated Feb 16, 2026
Next