sgsdxzy

Follow

sgsdxzy

Follow

25 followers · 3 following

Achievements

Achievements

Organizations

Stars

modelscope / ms-swift

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3, Qwen3-MoE, DeepSeek-R1, GLM4.5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, …

Python 12,529 1,190 Updated Feb 4, 2026

turboderp-org / exllamav3

An optimized quantization and inference library for running LLMs locally on modern consumer-class GPUs

Python 625 68 Updated Jan 26, 2026

microsoft / VPTQ

VPTQ, A Flexible and Extreme low-bit quantization algorithm

Python 674 51 Updated Apr 25, 2025

Aider-AI / aider

aider is AI pair programming in your terminal

Python 40,347 3,870 Updated Jan 19, 2026

vllm-project / llm-compressor

Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM

Python 2,685 387 Updated Feb 4, 2026

sgsdxzy / YuE-exllamav2

Python 135 40 Updated Mar 11, 2025

sgsdxzy / YuE-exllamav2-fork

Forked from AlpinDale/Better-YuE

YuE: Open Full-song Generation Foundation Model, something similar to Suno.ai but open

Python 7 Updated Feb 2, 2025

multimodal-art-projection / YuE

YuE: Open Full-song Music Generation Foundation Model, something similar to Suno.ai but open

Python 6,004 709 Updated Jun 4, 2025

bold84 / cot_proxy

Smart proxy for LLM APIs that enables model-specific parameter control, automatic mode switching (like Qwen3's /think and /no_think), and <think> tag filtering. Perfect for using advanced models wi…

Python 50 5 Updated May 19, 2025

LLM-Red-Team / doubao-free-api

🚀 豆包大模型逆向API【特长：超强联网搜索】，零配置部署，多路token支持，仅供测试，如需商用请前往官方开放平台。

TypeScript 659 201 Updated Nov 27, 2025

bayesian-optimization / BayesianOptimization

A Python implementation of global optimization with gaussian processes.

Python 8,543 1,596 Updated Dec 27, 2025

camel-ai / oasis

🏝️ OASIS: Open Agent Social Interaction Simulations with One Million Agents.

Python 2,417 265 Updated Feb 3, 2026

microsoft / TRELLIS

Official repo for paper "Structured 3D Latents for Scalable and Versatile 3D Generation" (CVPR'25 Spotlight).

Python 11,779 1,095 Updated Nov 5, 2025

Tencent-Hunyuan / HunyuanVideo

HunyuanVideo: A Systematic Framework For Large Video Generation Model

Python 11,689 1,186 Updated Nov 21, 2025

kijai / ComfyUI-HunyuanVideoWrapper

Python 2,573 204 Updated Aug 20, 2025

microsoft / TinyTroupe

LLM-powered multiagent persona simulation for imagination enhancement and business insights.

Jupyter Notebook 7,212 640 Updated Feb 2, 2026

bshoshany / thread-pool

BS::thread_pool: a fast, lightweight, modern, and easy-to-use C++17 / C++20 / C++23 thread pool library

C++ 2,878 311 Updated Jan 4, 2026

Sumandora / remove-refusals-with-transformers

Implements harmful/harmless refusal removal using pure HF Transformers

Python 1,478 242 Updated Nov 27, 2025

bilibili / Index-1.9B

A lightweight multilingual LLM

Python 1,012 48 Updated Aug 8, 2025

PygmalionAI / quantizers

An easy-to-use library for quantizing LLMs

Python 8 3 Updated Jun 20, 2024

noamgat / lm-format-enforcer

Enforce the output format (JSON Schema, Regex etc) of a language model

Python 1,986 81 Updated Aug 24, 2025

turboderp-org / exllamav2

A fast inference library for running LLMs locally on modern consumer-class GPUs

Python 4,436 329 Updated Dec 9, 2025

lucyknada / detective-needle-llm

JavaScript 12 2 Updated Sep 22, 2024

chu-tianxiang / vllm-gptq

Forked from vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 132 14 Updated Jun 25, 2024

sgsdxzy / AutoQuarot

Auto convert transformers models to QuaRot.

Python 8 2 Updated Apr 12, 2024

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 69,490 13,188 Updated Feb 4, 2026

dottxt-ai / outlines

Structured Outputs

Python 13,369 662 Updated Feb 2, 2026

jasonppy / VoiceCraft

Zero-Shot Speech Editing and Text-to-Speech in the Wild

Jupyter Notebook 8,459 801 Updated Mar 15, 2025

QwenLM / Qwen-Agent

Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.

Python 13,192 1,241 Updated Feb 3, 2026

mudler / LocalAI

🤖 The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement, running on consumer-grade hardware. No GPU required. Runs gguf, transformers, dif…

Go 42,582 3,519 Updated Feb 4, 2026