Skip to content
View YuY-SuN's full-sized avatar

Block or report YuY-SuN

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Instant voice cloning by MIT and MyShell. Audio foundation model.

Python 36,204 4,033 Updated Apr 19, 2025

Official inference framework for 1-bit LLMs

Python 37,863 3,364 Updated Mar 10, 2026

🌐 Make websites accessible for AI agents. Automate tasks online with ease.

Python 86,530 10,006 Updated Apr 7, 2026

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 18,917 1,721 Updated Jan 30, 2026

🌐 The Internet Computer! Free, Open-Source, and Self-Hostable.

JavaScript 40,191 3,601 Updated Apr 8, 2026

f.k.a. Awesome ChatGPT Prompts. Share, discover, and collect prompts from the community. Free and open source — self-host for your organization with complete privacy.

HTML 158,283 20,724 Updated Apr 8, 2026

Stable Diffusion web UI

Python 162,197 30,237 Updated Mar 2, 2026

[ICML'24] Data and code for our paper "Training-Free Long-Context Scaling of Large Language Models"

Python 450 23 Updated Oct 16, 2024

Labeling extension for Automatic1111's Web UI

Python 684 84 Updated May 14, 2024

Let us control diffusion models!

Python 33,789 3,006 Updated Feb 25, 2024

Distributed LLM inference. Connect home devices into a powerful cluster to accelerate LLM inference. More devices means faster inference.

C++ 2,889 220 Updated Feb 10, 2026

This guide has been archived. Please see https://github.com/awsdocs/amazon-s3-userguide for an open source version of the Amazon S3 docs.

Java 222 217 Updated Jan 20, 2021

Code associated with Tuning Language Models by Proxy (Liu et al., 2024)

Python 129 18 Updated Mar 30, 2024

Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, DeepSeek, Mixtral, Gemma, Phi, MiniCPM, Qwen-VL, MiniCPM-V, etc.) on Intel XPU (e.g., local PC with iGPU and NPU, discr…

Python 8,756 1,412 Updated Jan 28, 2026

mergekit-evolve for elyza task 100

Python 9 2 Updated Apr 28, 2024

🌸 A command-line fuzzy finder

Go 79,385 2,755 Updated Apr 8, 2026

thumbor is an open-source photo thumbnail service by globo.com

Python 10,477 863 Updated Apr 7, 2026

A fast MoE impl for PyTorch

Python 1,846 204 Updated Feb 10, 2025

🚀🀄️ A fast and strong AI for riichi mahjong, powered by Rust and deep reinforcement learning.

Rust 1,405 186 Updated Sep 28, 2025

FuseAI Project

Python 592 37 Updated Jan 25, 2025

Official implementation of Half-Quadratic Quantization (HQQ)

Python 925 90 Updated Feb 26, 2026

llama.cpp for Flutter

C++ 205 35 Updated Apr 4, 2026

Spezi LLM inference in C/C++

C++ 29 5 Updated May 12, 2024

A mobile Implementation of llama.cpp

Dart 327 36 Updated Feb 1, 2024

MLX: An array framework for Apple silicon

C++ 25,213 1,665 Updated Apr 7, 2026

llama and other large language models on iOS and MacOS offline using GGML library.

C 2,007 165 Updated Jan 30, 2026

MiniCPM4 & MiniCPM4.1: Ultra-Efficient LLMs on End Devices, achieving 3+ generation speedup on reasoning tasks

Jupyter Notebook 8,782 559 Updated Feb 11, 2026

Distribute and run LLMs with a single file.

C++ 24,080 1,308 Updated Apr 8, 2026

Convert PDF to markdown + JSON quickly with high accuracy

Python 33,514 2,316 Updated Apr 4, 2026
Next