Stars
AI agents running research on single-GPU nanochat training automatically
Code for ALBEF: a new vision-language pre-training method
ModelScope: bring the notion of Model-as-a-Service to life.
主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.6, DeepSeek-R1, GLM-5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, Phi4, ...)…
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
An open-source tool-augmented conversational language model from Fudan University
Easy-to-use and powerful LLM and SLM library with awesome model zoo.