Skip to content
View TKH666's full-sized avatar

Highlights

  • Pro

Block or report TKH666

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

[HPCA 2026] Official implementation of "Focus: A Streaming Concentration Architecture for Efficient Vision-Language Models"

Python 14 1 Updated Dec 23, 2025

✨✨Latest Advances on Multimodal Large Language Models

17,062 1,098 Updated Dec 26, 2025

One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks

Python 3,462 465 Updated Dec 18, 2025

This is the offical repository of InfiniteVL

Python 62 3 Updated Dec 16, 2025

🚀 Efficient implementations of state-of-the-art linear attention models

Python 4,125 340 Updated Dec 25, 2025

a clone of POCL that includes RISC-V newlib devices support and Vortex

C 49 18 Updated Nov 24, 2025

基于NPS 0.26.10 版本二开而来,NPS接力项目。

Go 3,120 392 Updated Dec 12, 2025

A Survey of Efficient Attention Methods: Hardware-efficient, Sparse, Compact, and Linear Attention

267 5 Updated Dec 1, 2025

Deep Learning Primitives and Mini-Framework for OpenCL

C++ 205 21 Updated Sep 9, 2024

DLPrimitives/OpenCL out of tree backend for pytorch

C++ 382 29 Updated Nov 26, 2025

Mobile-Agent: The Powerful GUI Agent Family

Python 6,825 698 Updated Dec 2, 2025

Chat with AI large language models running natively in your browser. Enjoy private, server-free, seamless AI conversations.

TypeScript 915 174 Updated Sep 3, 2025

🎯 告别信息过载,AI 助你看懂新闻资讯热点,简单的舆情监控分析 - 多平台热点聚合+基于 MCP 的AI分析工具。监控35个平台(抖音、知乎、B站、华尔街见闻、财联社等),智能筛选+自动推送+AI对话分析(用自然语言深度挖掘新闻:趋势追踪、情感分析、相似检索等13种工具)。支持企业微信/个人微信/飞书/钉钉/Telegram/邮件/ntfy/bark/slack 推送,1分钟手机通知,无需…

Python 40,450 20,915 Updated Dec 23, 2025

A small OpenCL benchmark program to measure peak GPU/CPU performance.

C++ 270 35 Updated Nov 23, 2025
Python 89 13 Updated Nov 16, 2025
Python 298 19 Updated Apr 8, 2025

Scientific computing with Metal in C++: Matrix multiplication example

C++ 44 2 Updated Sep 18, 2022

Tilus is a tile-level kernel programming language with explicit control over shared memory and registers.

Python 434 15 Updated Dec 16, 2025

Derived from Nemes' gpuperftests

C++ 33 6 Updated Jul 11, 2024

A curated collection of fun and creative examples generated with Nano Banana & Nano Banana Pro🍌, Gemini-2.5-flash-image based model. We also release Nano-consistent-150K openly to support the commu…

19,174 2,000 Updated Dec 12, 2025

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. Tensor…

Python 12,479 1,979 Updated Dec 26, 2025

Demonstration of running a native LLM on Android device.

Python 209 23 Updated Dec 21, 2025
C++ 41 6 Updated Dec 16, 2025

Self-implemented NN operators for Qualcomm's Hexagon NPU

C 34 6 Updated Sep 30, 2025

Run Stable Diffusion on Android Devices with Snapdragon NPU acceleration. Also supports CPU/GPU inference.

Kotlin 1,413 76 Updated Dec 17, 2025

Trying to figure various CPU things out

C 153 23 Updated Dec 16, 2025

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 17,407 1,457 Updated Nov 28, 2025

将SmolVLM2的视觉头与Qwen3-0.6B模型进行了拼接微调

Python 480 48 Updated Sep 8, 2025

Mirage Persistent Kernel: Compiling LLMs into a MegaKernel

C++ 2,008 161 Updated Dec 20, 2025

Cloud Mail - Simple Email Service on Cloudflare | 基于 Cloudflare 的简约响应式邮箱服务 | Cloudflare Email 邮箱 Mail

JavaScript 3,710 3,131 Updated Dec 10, 2025
Next