Skip to content
View BruceYang-yeu's full-sized avatar
:octocat:
I may be slow to respond.
:octocat:
I may be slow to respond.

Block or report BruceYang-yeu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Android Live Demo inferenece of Yolov7 using ncnn

C++ 155 37 Updated Jul 23, 2022

An open-source AI agent that brings the power of Gemini directly into your terminal.

TypeScript 88,575 10,174 Updated Dec 24, 2025

Me patching up the `stress` tool to build properly on school computers

C 12 8 Updated Dec 15, 2025

Machine Learning Journal for Intermediate to Advanced Topics.

Jupyter Notebook 2,252 245 Updated Sep 8, 2025

A natural language interface for computers

Python 61,273 5,252 Updated Dec 5, 2025

SGLang is a fast serving framework for large language models and vision language models.

Python 21,957 3,862 Updated Dec 25, 2025

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda 5,997 779 Updated Dec 23, 2025

DeepEP: an efficient expert-parallel communication library

Cuda 8,830 1,037 Updated Dec 24, 2025

12 weeks, 26 lessons, 52 quizzes, classic Machine Learning for all

Jupyter Notebook 82,519 19,405 Updated Dec 25, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 66,141 12,177 Updated Dec 25, 2025

llm deploy project based onnx.

C++ 48 8 Updated Oct 9, 2024

MNN is a blazing fast, lightweight deep learning framework, battle-tested by business-critical use cases in Alibaba. Full multimodal LLM Android App:[MNN-LLM-Android](./apps/Android/MnnLlmChat/READ…

C++ 13,774 2,147 Updated Dec 24, 2025

www.giantpandacv.com

Python 153 31 Updated Jun 20, 2024

LLM101n: Let's build a Storyteller

35,948 1,962 Updated Aug 1, 2024

Material for gpu-mode lectures

Jupyter Notebook 5,448 553 Updated Dec 8, 2025

高性能并行编程与优化 - 课件

C++ 4,131 560 Updated Oct 18, 2024

A simple project (without tests) to get started with (modern) CMake and CMakePresets.json

CMake 1 Updated Dec 30, 2023

A collection of resources on modern C++

HTML 12,790 1,217 Updated Aug 20, 2024

CMake Tools provides a robust, convenient workflow for CMake projects in VS Code. It simplifies configurations with CMake presets, supports IntelliSense and built-in debugging for CMake scripts, an…

TypeScript 1,626 514 Updated Dec 18, 2025

Vundle, the plug-in manager for Vim

Vim Script 24,014 2,556 Updated Jul 30, 2024

A General-purpose Task-parallel Programming System using Modern C++

C++ 11,497 1,345 Updated Dec 24, 2025

[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

Python 3,399 286 Updated Jul 17, 2025

High-Performance C++ Fundamental Library

C++ 625 93 Updated Dec 22, 2025

Image Signal Processor

Python 1,340 457 Updated Feb 1, 2023

Samples for CUDA Developers which demonstrates features in CUDA Toolkit

C 8,605 2,227 Updated Sep 5, 2025

Efficient inference of large language models.

C++ 150 7 Updated Sep 28, 2025

Collaborative Collection of C++ Best Practices. This online resource is part of Jason Turner's collection of C++ Best Practices resources. See README.md for more information.

8,645 908 Updated Aug 6, 2024

A library for efficient similarity search and clustering of dense vectors.

C++ 38,524 4,163 Updated Dec 23, 2025

Computer Architecture Written Node in Chinese | 计算机系统结构的学习笔记

169 16 Updated Oct 22, 2023
Next