Skip to content
View Poet-LiBai's full-sized avatar
🪐
🪐

Block or report Poet-LiBai

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

LLM

大模型
53 repositories

[EMNLP 2024 & AAAI 2026] A powerful toolkit for compressing large models including LLM, VLM, and video generation models.

Python 642 64 Updated Nov 19, 2025

Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.

Python 50,789 4,211 Updated Dec 16, 2025

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

6,870 371 Updated Dec 17, 2025

An Open Large Reasoning Model for Real-World Solutions

Python 1,537 80 Updated May 30, 2025

OpenDiLoCo: An Open-Source Framework for Globally Distributed Low-Communication Training

Python 554 46 Updated Jan 13, 2025

prime is a framework for efficient, globally distributed training of AI models over the internet.

Python 848 94 Updated Nov 16, 2025

AllenAI's post-training codebase

Python 3,464 476 Updated Dec 21, 2025

SGLang is a fast serving framework for large language models and vision language models.

Python 21,869 3,825 Updated Dec 22, 2025

Kheish: A multi-role LLM agent for tasks like code auditing, file searching, and more seamlessly leveraging RAG and extensible modules.

Rust 142 13 Updated Dec 28, 2024

Code for BLT research paper

Python 2,018 188 Updated Nov 3, 2025

InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions

Python 2,909 177 Updated May 26, 2025

Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, DeepSeek, Mixtral, Gemma, Phi, MiniCPM, Qwen-VL, MiniCPM-V, etc.) on Intel XPU (e.g., local PC with iGPU and NPU, discr…

Python 8,551 1,390 Updated Oct 14, 2025

SOTA low-bit LLM quantization (INT8/FP8/MXFP8/INT4/MXFP4/NVFP4) & sparsity; leading model compression techniques on PyTorch, TensorFlow, and ONNX Runtime

Python 2,552 287 Updated Dec 21, 2025

Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Flax.

Python 2,503 258 Updated Aug 13, 2024

noise_step: Training in 1.58b With No Gradient Memory

TeX 220 10 Updated Dec 25, 2024

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Python 7,408 635 Updated Dec 20, 2025

🍒 Cherry Studio is a desktop client that supports for multiple LLM providers.

TypeScript 36,820 3,384 Updated Dec 21, 2025

Get up and running with OpenAI gpt-oss, DeepSeek-R1, Gemma 3 and other models.

Go 158,008 13,981 Updated Dec 21, 2025

Enhanced ChatGPT Clone: Features Agents, MCP, DeepSeek, Anthropic, AWS, OpenAI, Responses API, Azure, Groq, o1, GPT-5, Mistral, OpenRouter, Vertex AI, Gemini, Artifacts, AI model switching, message…

TypeScript 32,568 6,466 Updated Dec 20, 2025

🤯 LobeHub - an open-source, modern design AI Agent Workspace. Supports multiple AI providers, Knowledge Base (file upload / RAG ), one click install MCP Marketplace and Artifacts / Thinking. One-cl…

TypeScript 69,294 14,274 Updated Dec 21, 2025

User-friendly AI Interface (Supports Ollama, OpenAI API, ...)

Svelte 118,380 16,675 Updated Dec 21, 2025

LLM4AD: A Platform for Algorithm Design with Large Language Model

Python 552 55 Updated Dec 17, 2025

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & TIS & vLLM & Ray & Dynamic Sampling & Async Agentic RL)

Python 8,634 838 Updated Dec 18, 2025

《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调(全参数/Lora)、部署国内外开源大模型(LLM)/多模态大模型(MLLM)教程

Jupyter Notebook 26,764 2,679 Updated Dec 20, 2025

ModelScope: bring the notion of Model-as-a-Service to life.

Python 8,578 893 Updated Dec 18, 2025

A curated, but incomplete, list of data-centric AI resources.

1,138 80 Updated Jun 26, 2024

ICML2025: Forest-of-Thought: Scaling Test-Time Compute for Enhancing LLM Reasoning

Python 51 3 Updated May 1, 2025

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Python 25,830 1,813 Updated Oct 13, 2025

AnimationGPT:An AIGC tool for generating game combat motion assets

Python 450 43 Updated Dec 14, 2024