Skip to content
View wanghqc's full-sized avatar
  • Qualcomm
  • San Diego, CA, USA
  • 15:36 (UTC -08:00)
  • LinkedIn in/hongqiang

Block or report wanghqc

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 188,899 31,995 Updated Feb 12, 2026

CUDA Templates and Python DSLs for High-Performance Linear Algebra

C++ 9,271 1,678 Updated Feb 11, 2026

Samples for CUDA Developers which demonstrates features in CUDA Toolkit

C 8,839 2,266 Updated Jan 6, 2026

MLX: An array framework for Apple silicon

C++ 23,908 1,508 Updated Feb 12, 2026

Distributed MoE in a Single Kernel [NeurIPS '25]

Cuda 193 21 Updated Feb 7, 2026

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 19,767 2,036 Updated Jan 13, 2026

LLM inference in C/C++

C++ 20 4 Updated Oct 22, 2025

Real-time webcam demo with SmolVLM and llama.cpp server

HTML 5,524 893 Updated May 12, 2025

Beignet is an open source implementation of the OpenCL specification - a generic compute oriented API. Here is Beignet Source Code Mirror in github- This is a publish-only repository and all pull r…

C++ 101 40 Updated Jan 7, 2023

Compute Benchmarks for oneAPI Level Zero and OpenCL™ Driver

C++ 41 43 Updated Feb 12, 2026

The fastest and most memory efficient lattice Boltzmann CFD software, running on all GPUs and CPUs via OpenCL. Free for non-commercial use.

C++ 4,891 448 Updated Jan 19, 2026

Qualcomm® AI Hub Models is our collection of state-of-the-art machine learning models optimized for performance (latency, memory etc.) and ready to deploy on Qualcomm® devices.

Python 918 158 Updated Feb 12, 2026

pocl - Portable Computing Language

C 1,050 283 Updated Feb 12, 2026

LM Studio CLI

TypeScript 4,184 330 Updated Feb 12, 2026

Microsoft Automatic Mixed Precision Library

Python 636 49 Updated Dec 1, 2025

Print all known information about all available OpenCL platforms and devices in the system

C 371 84 Updated Dec 19, 2025

A comprehensive 10-page probability cheatsheet that covers a semester's worth of introduction to probability.

TeX 3,139 702 Updated Jun 15, 2022

Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama mode…

Jupyter Notebook 18,204 2,701 Updated Nov 3, 2025

lightweight, standalone C++ inference engine for Google's Gemma models.

C++ 6,730 599 Updated Feb 12, 2026

Notes about "Attention is all you need" video (https://www.youtube.com/watch?v=bCz4OMemCcA)

337 76 Updated May 28, 2023

[ICML 2024] Break the Sequential Dependency of LLM Inference Using Lookahead Decoding

Python 1,317 79 Updated Mar 6, 2025

A C++ GPU Computing Library for OpenCL

C++ 1,645 340 Updated Feb 6, 2026

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

Python 18,962 1,866 Updated Jul 15, 2025

Llama中文社区,实时汇总最新Llama学习资料,构建最好的中文Llama大模型开源生态,完全开源可商用

Python 14,748 1,306 Updated Apr 6, 2025

LlamaIndex is the leading framework for building LLM-powered agents over your data.

Python 46,965 6,811 Updated Feb 12, 2026

Inference Llama 2 in one file of pure C

C 19,174 2,448 Updated Aug 6, 2024

Inference code for Llama models

Python 59,144 9,823 Updated Jan 26, 2025

A curated list of awesome computer vision resources

23,052 4,431 Updated May 17, 2024

LLM inference in C/C++

C++ 94,939 14,890 Updated Feb 12, 2026
Next