- Japan
- @sandriver1987
Starred repositories
Instant voice cloning by MIT and MyShell. Audio foundation model.
Official inference framework for 1-bit LLMs
🌐 Make websites accessible for AI agents. Automate tasks online with ease.
Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
🌐 The Internet Computer! Free, Open-Source, and Self-Hostable.
f.k.a. Awesome ChatGPT Prompts. Share, discover, and collect prompts from the community. Free and open source — self-host for your organization with complete privacy.
Stable Diffusion web UI
[ICML'24] Data and code for our paper "Training-Free Long-Context Scaling of Large Language Models"
picobyte / stable-diffusion-webui-wd14-tagger
Forked from kawalain/stable-diffusion-webui-wd14-taggerLabeling extension for Automatic1111's Web UI
Distributed LLM inference. Connect home devices into a powerful cluster to accelerate LLM inference. More devices means faster inference.
This guide has been archived. Please see https://github.com/awsdocs/amazon-s3-userguide for an open source version of the Amazon S3 docs.
Code associated with Tuning Language Models by Proxy (Liu et al., 2024)
Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, DeepSeek, Mixtral, Gemma, Phi, MiniCPM, Qwen-VL, MiniCPM-V, etc.) on Intel XPU (e.g., local PC with iGPU and NPU, discr…
mergekit-evolve for elyza task 100
thumbor is an open-source photo thumbnail service by globo.com
🚀🀄️ A fast and strong AI for riichi mahjong, powered by Rust and deep reinforcement learning.
Official implementation of Half-Quadratic Quantization (HQQ)
StanfordBDHG / llama.cpp
Forked from ggml-org/llama.cppSpezi LLM inference in C/C++
llama and other large language models on iOS and MacOS offline using GGML library.
MiniCPM4 & MiniCPM4.1: Ultra-Efficient LLMs on End Devices, achieving 3+ generation speedup on reasoning tasks
Distribute and run LLMs with a single file.
Convert PDF to markdown + JSON quickly with high accuracy