Skip to content
View sgsdxzy's full-sized avatar

Organizations

@HimenoSoft

Block or report sgsdxzy

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3, Qwen3-MoE, DeepSeek-R1, GLM4.5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, …

Python 12,529 1,190 Updated Feb 4, 2026

An optimized quantization and inference library for running LLMs locally on modern consumer-class GPUs

Python 625 68 Updated Jan 26, 2026

VPTQ, A Flexible and Extreme low-bit quantization algorithm

Python 674 51 Updated Apr 25, 2025

aider is AI pair programming in your terminal

Python 40,347 3,870 Updated Jan 19, 2026

Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM

Python 2,685 387 Updated Feb 4, 2026
Python 135 40 Updated Mar 11, 2025

YuE: Open Full-song Generation Foundation Model, something similar to Suno.ai but open

Python 7 Updated Feb 2, 2025

YuE: Open Full-song Music Generation Foundation Model, something similar to Suno.ai but open

Python 6,004 709 Updated Jun 4, 2025

Smart proxy for LLM APIs that enables model-specific parameter control, automatic mode switching (like Qwen3's /think and /no_think), and <think> tag filtering. Perfect for using advanced models wi…

Python 50 5 Updated May 19, 2025

🚀 豆包大模型逆向API【特长:超强联网搜索】,零配置部署,多路token支持,仅供测试,如需商用请前往官方开放平台。

TypeScript 659 201 Updated Nov 27, 2025

A Python implementation of global optimization with gaussian processes.

Python 8,543 1,596 Updated Dec 27, 2025

🏝️ OASIS: Open Agent Social Interaction Simulations with One Million Agents.

Python 2,417 265 Updated Feb 3, 2026

Official repo for paper "Structured 3D Latents for Scalable and Versatile 3D Generation" (CVPR'25 Spotlight).

Python 11,779 1,095 Updated Nov 5, 2025

HunyuanVideo: A Systematic Framework For Large Video Generation Model

Python 11,689 1,186 Updated Nov 21, 2025

LLM-powered multiagent persona simulation for imagination enhancement and business insights.

Jupyter Notebook 7,212 640 Updated Feb 2, 2026

BS::thread_pool: a fast, lightweight, modern, and easy-to-use C++17 / C++20 / C++23 thread pool library

C++ 2,878 311 Updated Jan 4, 2026

Implements harmful/harmless refusal removal using pure HF Transformers

Python 1,478 242 Updated Nov 27, 2025

A lightweight multilingual LLM

Python 1,012 48 Updated Aug 8, 2025

An easy-to-use library for quantizing LLMs

Python 8 3 Updated Jun 20, 2024

Enforce the output format (JSON Schema, Regex etc) of a language model

Python 1,986 81 Updated Aug 24, 2025

A fast inference library for running LLMs locally on modern consumer-class GPUs

Python 4,436 329 Updated Dec 9, 2025
JavaScript 12 2 Updated Sep 22, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 132 14 Updated Jun 25, 2024

Auto convert transformers models to QuaRot.

Python 8 2 Updated Apr 12, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 69,490 13,188 Updated Feb 4, 2026

Structured Outputs

Python 13,369 662 Updated Feb 2, 2026

Zero-Shot Speech Editing and Text-to-Speech in the Wild

Jupyter Notebook 8,459 801 Updated Mar 15, 2025

Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.

Python 13,192 1,241 Updated Feb 3, 2026

🤖 The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement, running on consumer-grade hardware. No GPU required. Runs gguf, transformers, dif…

Go 42,582 3,519 Updated Feb 4, 2026
Next