Skip to content
View antsuge's full-sized avatar

Highlights

  • Pro

Block or report antsuge

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

ResearchClaw is a personal AI assistant built for research: fast to set up, easy to run locally or in the cloud, and ready to integrate with the chat apps you already use. With extensible skills, i…

Python 247 24 Updated Apr 4, 2026

Vue + SpringBoot + RocketMQ + Redis 智能AI视频解析平台。用户可通过 本地视频 或 在线链接 ,一键提取音频,文字,AI总结概括。涉及消息队列异步化、分布式锁防并发、分片上传与断点续传等等

Java 57 3 Updated Mar 21, 2026

CS33作业 2 的代码和飞书 qa, 这个作业太恶心了, 绝对是所有作业里面花的最久的

Python 22 Updated Jul 17, 2025

最新Docker容器技术,从真实案例中学习最佳实践!| Learn and understand Docker&Container technologies, with real DevOps practice!

Go 25,947 5,784 Updated Apr 5, 2026

slime is an LLM post-training framework for RL Scaling.

Python 5,135 692 Updated Apr 5, 2026

A tutorial for CUDA&PyTorch

Cuda 384 53 Updated Mar 23, 2026

分享AI Infra知识&代码练习:PyTorch/vLLM/SGLang框架入门⚡️、性能加速🚀、大模型基础🧠、AI软硬件🔧等

Jupyter Notebook 1,525 122 Updated Apr 6, 2026

AIInfra(AI 基础设施)指AI系统从底层芯片等硬件,到上层软件栈支持AI大模型训练和推理。

Jupyter Notebook 6,624 873 Updated Dec 22, 2025

Open deep learning compiler stack for Kendryte AI accelerators ✨

C# 870 206 Updated Mar 26, 2026

Open ABI and FFI for Machine Learning Systems

C++ 375 67 Updated Apr 2, 2026

PIM Runtime Library and Tools

C++ 28 10 Updated Dec 18, 2023

A list of awesome compiler projects and papers for tensor computation and deep learning.

2,734 324 Updated Oct 19, 2024

Backward compatible ML compute opset inspired by HLO/MHLO

MLIR 634 188 Updated Mar 30, 2026

Distributed Compiler based on Triton for Parallel Systems

Python 1,401 136 Updated Mar 11, 2026

Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels

Python 5,464 497 Updated Apr 5, 2026

compiler learning resources collect.

Python 2,702 368 Updated Mar 19, 2025

An experimental CPU backend for Triton

MLIR 185 39 Updated Apr 2, 2026

OneDiff: An out-of-the-box acceleration library for diffusion models.

Jupyter Notebook 1,972 127 Updated Dec 4, 2025

PyGim is the first runtime framework to efficiently execute Graph Neural Networks (GNNs) on real Processing-in-Memory systems. It provides a high-level Python interface, currently integrated with P…

C 37 2 Updated Apr 23, 2025

PiDRAM is the first flexible end-to-end framework that enables system integration studies and evaluation of real Processing-using-Memory techniques. Prototype on a RISC-V rocket chip system impleme…

73 13 Updated Dec 11, 2023

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

77,845 9,029 Updated Feb 5, 2026

xDiT: A Scalable Inference Engine for Diffusion Transformers (DiTs) with Massive Parallelism

Python 2,588 315 Updated Apr 1, 2026

Ongoing research training transformer models at scale

Python 15,929 3,786 Updated Apr 6, 2026

📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA Kernels, Tensor Cores, HGEMM, FA-2 MMA.🎉

Cuda 10,154 1,034 Updated Apr 5, 2026
10 Updated Apr 2, 2025

BitBLAS is a library to support mixed-precision matrix multiplications, especially for quantized LLM deployment.

Python 757 59 Updated Aug 6, 2025

Mirage Persistent Kernel: Compiling LLMs into a MegaKernel

C++ 2,179 188 Updated Apr 2, 2026

Open-source Framework for HPCA2024 paper: Gemini: Mapping and Architecture Co-exploration for Large-scale DNN Chiplet Accelerators

C++ 112 23 Updated Apr 28, 2025

深度学习经典、新论文逐段精读

32,829 2,781 Updated Mar 22, 2025
Next