Skip to content
View reminisce's full-sized avatar

Block or report reminisce

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

NKIPy: Rapid Prototyping on Trainium

Python 28 9 Updated Jun 19, 2026

NanoGPT (124M) in 90 seconds

Python 5,427 814 Updated Jun 20, 2026

UCCL is an efficient communication library for GPUs, covering collectives, P2P (e.g., KV cache transfer, RL weight transfer), and EP (e.g., GPU-driven)

C++ 1,420 158 Updated Jun 20, 2026

DSPy: The framework for programming—not prompting—language models

Python 35,244 2,993 Updated Jun 18, 2026

USP: Unified (a.k.a. Hybrid, 2D) Sequence Parallel Attention for Long Context Transformers Model Training and Inference

Python 673 80 Updated May 21, 2026

Transformer related optimization, including BERT, GPT

C++ 6,427 935 Updated Mar 27, 2024

Makes ARM NEON documentation accessible (with examples)

413 65 Updated Apr 13, 2024

Mask R-CNN for object detection and instance segmentation on Keras and TensorFlow

Python 25,563 11,640 Updated Jun 7, 2024

TNN: developed by Tencent Youtu Lab and Guangying Lab, a uniform deep learning inference framework for mobile、desktop and server. TNN is distinguished by several outstanding features, including its…

C++ 4,641 774 Updated May 9, 2025

MACE is a deep learning inference framework optimized for mobile heterogeneous computing platforms.

C++ 5,041 823 Updated Jun 17, 2024

MNN: A blazing-fast, lightweight inference engine battle-tested by Alibaba, powering high-performance on-device LLMs and Edge AI.

C++ 15,524 2,369 Updated Jun 18, 2026

ncnn is a high-performance neural network inference framework optimized for the mobile platform

C++ 23,394 4,440 Updated Jun 19, 2026

A PyTorch Implementation of Single Shot MultiBox Detector

Python 5,225 1,724 Updated Dec 29, 2021

ResNeSt: Split-Attention Networks

Python 3,261 495 Updated Dec 9, 2022

🚀 Awesome System for Machine Learning ⚡️ AI System Papers and Industry Practice. ⚡️ System for Machine Learning, LLM (Large Language Model), GenAI (Generative AI). 🍻 OSDI, NSDI, SIGCOMM, SoCC, MLSy…

4,122 398 Updated Jul 25, 2025

scikit-learn: machine learning in Python

Python 66,378 27,087 Updated Jun 20, 2026

Benchmarks for NumPy compatible frameworks.

Python 16 2 Updated Jan 6, 2026

Notebooks for a single-day DL crash course in Chinese

Jupyter Notebook 60 21 Updated Sep 5, 2019

Dive into Deep Learning Compiler

Python 647 95 Updated Jun 19, 2022

架构师技术图谱,助你早日成为架构师

9,662 1,643 Updated Jan 6, 2021

⚡️Optimizing einsum functions in NumPy, Tensorflow, Dask, and more with contraction order optimization.

Python 984 76 Updated Mar 19, 2026

A high performance and generic framework for distributed DNN training

Python 3,722 493 Updated Oct 3, 2023

Books with Jupyter notebooks

Python 264 110 Updated Dec 12, 2023

Documents for MXNet's deepnumpy API

JavaScript 23 4 Updated Dec 9, 2019

An open-access book on numpy vectorization techniques, Nicolas P. Rougier, 2017

Python 2,136 349 Updated May 6, 2025

Bringing Characters to Life with Computer Brains in Unity

C++ 8,771 1,139 Updated Apr 17, 2026

State-of-the-art 2D and 3D Face Analysis Project

Python 29,037 6,043 Updated May 23, 2026

The fundamental package for scientific computing with Python.

Python 32,223 12,463 Updated Jun 20, 2026
Next