Skip to content
View BruceYang-yeu's full-sized avatar
:octocat:
I may be slow to respond.
:octocat:
I may be slow to respond.

Block or report BruceYang-yeu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Android Live Demo inferenece of Yolov7 using ncnn

C++ 155 37 Updated Jul 23, 2022

An open-source AI agent that brings the power of Gemini directly into your terminal.

TypeScript 87,992 10,076 Updated Dec 19, 2025

Me patching up the `stress` tool to build properly on school computers

C 12 8 Updated Dec 15, 2025

Machine Learning Journal for Intermediate to Advanced Topics.

Jupyter Notebook 2,251 245 Updated Sep 8, 2025

A natural language interface for computers

Python 61,141 5,242 Updated Dec 5, 2025

SGLang is a fast serving framework for large language models and vision language models.

Python 21,739 3,809 Updated Dec 19, 2025

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda 5,977 778 Updated Dec 8, 2025

DeepEP: an efficient expert-parallel communication library

Cuda 8,814 1,033 Updated Dec 5, 2025

12 weeks, 26 lessons, 52 quizzes, classic Machine Learning for all

Jupyter Notebook 82,386 19,325 Updated Dec 19, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 65,778 12,066 Updated Dec 19, 2025

llm deploy project based onnx.

C++ 47 9 Updated Oct 9, 2024

MNN is a blazing fast, lightweight deep learning framework, battle-tested by business-critical use cases in Alibaba. Full multimodal LLM Android App:[MNN-LLM-Android](./apps/Android/MnnLlmChat/READ…

C++ 13,735 2,141 Updated Dec 19, 2025

www.giantpandacv.com

Python 153 31 Updated Jun 20, 2024

LLM101n: Let's build a Storyteller

35,898 1,961 Updated Aug 1, 2024

Material for gpu-mode lectures

Jupyter Notebook 5,435 552 Updated Dec 8, 2025

高性能并行编程与优化 - 课件

C++ 4,126 560 Updated Oct 18, 2024

A simple project (without tests) to get started with (modern) CMake and CMakePresets.json

CMake 1 Updated Dec 30, 2023

A collection of resources on modern C++

HTML 12,783 1,214 Updated Aug 20, 2024

CMake Tools provides a robust, convenient workflow for CMake projects in VS Code. It simplifies configurations with CMake presets, supports IntelliSense and built-in debugging for CMake scripts, an…

TypeScript 1,627 514 Updated Dec 18, 2025

Vundle, the plug-in manager for Vim

Vim Script 24,014 2,558 Updated Jul 30, 2024

A General-purpose Task-parallel Programming System using Modern C++

C++ 11,487 1,343 Updated Dec 19, 2025

[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

Python 3,393 284 Updated Jul 17, 2025

High-Performance C++ Fundamental Library

C++ 625 93 Updated Nov 9, 2025

Image Signal Processor

Python 1,338 457 Updated Feb 1, 2023

Samples for CUDA Developers which demonstrates features in CUDA Toolkit

C 8,588 2,223 Updated Sep 5, 2025

Efficient inference of large language models.

C++ 150 7 Updated Sep 28, 2025

Collaborative Collection of C++ Best Practices. This online resource is part of Jason Turner's collection of C++ Best Practices resources. See README.md for more information.

8,644 910 Updated Aug 6, 2024

A library for efficient similarity search and clustering of dense vectors.

C++ 38,482 4,156 Updated Dec 19, 2025

Computer Architecture Written Node in Chinese | 计算机系统结构的学习笔记

169 16 Updated Oct 22, 2023
Next