Poet-LiBai

🪐

Poet-LiBai

🪐

24 followers · 359 following

Stars

LLM

大模型

53 repositories

ModelTC / LightCompress

[EMNLP 2024 & AAAI 2026] A powerful toolkit for compressing large models including LLM, VLM, and video generation models.

Python 642 64 Updated Nov 19, 2025

opendatalab / MinerU

Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.

Python 50,789 4,211 Updated Dec 16, 2025

hijkzzz / Awesome-LLM-Strawberry

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

6,870 371 Updated Dec 17, 2025

AIDC-AI / Marco-o1

An Open Large Reasoning Model for Real-World Solutions

Python 1,537 80 Updated May 30, 2025

PrimeIntellect-ai / OpenDiloco

OpenDiLoCo: An Open-Source Framework for Globally Distributed Low-Communication Training

Python 554 46 Updated Jan 13, 2025

PrimeIntellect-ai / prime-diloco

prime is a framework for efficient, globally distributed training of AI models over the internet.

Python 848 94 Updated Nov 16, 2025

allenai / open-instruct

AllenAI's post-training codebase

Python 3,464 476 Updated Dec 21, 2025

sgl-project / sglang

SGLang is a fast serving framework for large language models and vision language models.

Python 21,869 3,825 Updated Dec 22, 2025

graniet / kheish

Kheish: A multi-role LLM agent for tasks like code auditing, file searching, and more seamlessly leveraging RAG and extensible modules.

Rust 142 13 Updated Dec 28, 2024

facebookresearch / blt

Code for BLT research paper

Python 2,018 188 Updated Nov 3, 2025

InternLM / InternLM-XComposer

InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions

Python 2,909 177 Updated May 26, 2025

intel / ipex-llm

Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, DeepSeek, Mixtral, Gemma, Phi, MiniCPM, Qwen-VL, MiniCPM-V, etc.) on Intel XPU (e.g., local PC with iGPU and NPU, discr…

Python 8,551 1,390 Updated Oct 14, 2025

intel / neural-compressor

SOTA low-bit LLM quantization (INT8/FP8/MXFP8/INT4/MXFP4/NVFP4) & sparsity; leading model compression techniques on PyTorch, TensorFlow, and ONNX Runtime

Python 2,552 287 Updated Dec 21, 2025

young-geng / EasyLM

Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Flax.

Python 2,503 258 Updated Aug 13, 2024

wbrickner / noise_step

noise_step: Training in 1.58b With No Gradient Memory

TeX 220 10 Updated Dec 25, 2024

InternLM / lmdeploy

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Python 7,408 635 Updated Dec 20, 2025

CherryHQ / cherry-studio

🍒 Cherry Studio is a desktop client that supports for multiple LLM providers.

TypeScript 36,820 3,384 Updated Dec 21, 2025

deepseek-ai / DeepSeek-V3

Python 100,799 16,424 Updated Aug 28, 2025

ollama / ollama

Get up and running with OpenAI gpt-oss, DeepSeek-R1, Gemma 3 and other models.

Go 158,008 13,981 Updated Dec 21, 2025

danny-avila / LibreChat

Enhanced ChatGPT Clone: Features Agents, MCP, DeepSeek, Anthropic, AWS, OpenAI, Responses API, Azure, Groq, o1, GPT-5, Mistral, OpenRouter, Vertex AI, Gemini, Artifacts, AI model switching, message…

TypeScript 32,568 6,466 Updated Dec 20, 2025

lobehub / lobe-chat

🤯 LobeHub - an open-source, modern design AI Agent Workspace. Supports multiple AI providers, Knowledge Base (file upload / RAG ), one click install MCP Marketplace and Artifacts / Thinking. One-cl…

TypeScript 69,294 14,274 Updated Dec 21, 2025

open-webui / open-webui

User-friendly AI Interface (Supports Ollama, OpenAI API, ...)

Svelte 118,380 16,675 Updated Dec 21, 2025

Optima-CityU / LLM4AD

LLM4AD: A Platform for Algorithm Design with Large Language Model

Python 552 55 Updated Dec 17, 2025

OpenRLHF / OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & TIS & vLLM & Ray & Dynamic Sampling & Async Agentic RL)

Python 8,634 838 Updated Dec 18, 2025

datawhalechina / self-llm

《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调（全参数/Lora）、部署国内外开源大模型（LLM）/多模态大模型（MLLM）教程

Jupyter Notebook 26,764 2,679 Updated Dec 20, 2025

modelscope / modelscope

ModelScope: bring the notion of Model-as-a-Service to life.

Python 8,578 893 Updated Dec 18, 2025

daochenzha / data-centric-AI

A curated, but incomplete, list of data-centric AI resources.

1,138 80 Updated Jun 26, 2024

iamhankai / Forest-of-Thought

ICML2025: Forest-of-Thought: Scaling Test-Time Compute for Enhancing LLM Reasoning

Python 51 3 Updated May 1, 2025

QwenLM / Qwen3

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Python 25,830 1,813 Updated Oct 13, 2025

fyyakaxyy / AnimationGPT

AnimationGPT:An AIGC tool for generating game combat motion assets

Python 450 43 Updated Dec 14, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly