Stars
Master programming by recreating your favorite technologies from scratch.
This repo includes ChatGPT prompt curation to use ChatGPT and other LLM tools better.
Robust Speech Recognition via Large-Scale Weak Supervision
Source code for the X Recommendation Algorithm
🤯 LobeHub - an open-source, modern design AI Agent Workspace. Supports multiple AI providers (OpenAI / Claude 4 / Gemini / DeepSeek / Ollama / Qwen), Knowledge Base (file upload / RAG ), one click …
A high-throughput and memory-efficient inference and serving engine for LLMs
ChatGPT 中文调教指南。各种场景使用指南。学习怎么让它听你的话。
The simplest, fastest repository for training/finetuning medium-sized GPTs.
这是一个用于显示当前网速、CPU及内存利用率的桌面悬浮窗软件,并支持任务栏显示,支持更换皮肤。
中文分词 词性标注 命名实体识别 依存句法分析 成分句法分析 语义依存分析 语义角色标注 指代消解 风格转换 语义相似度 新词发现 关键词短语提取 自动摘要 文本分类聚类 拼音简繁转换 自然语言处理
🍒 Cherry Studio is a desktop client that supports for multiple LLM providers.
Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.
OpenUI let's you describe UI using your imagination, then see it rendered live.
本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)
🤗 The largest hub of ready-to-use datasets for AI models with fast, easy-to-use and efficient data manipulation tools
Fast and memory-efficient exact attention
SGLang is a fast serving framework for large language models and vision language models.
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations
FlashMLA: Efficient Multi-head Latent Attention Kernels
主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题
Hackable and optimized Transformers building blocks, supporting a composable construction.
High-speed Large Language Model Serving for Local Deployment