llm
Llama from scratch, or How to implement a paper without crying
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.
Evaluate your LLM's response with Prometheus and GPT4 💯
Manage scalable open LLM inference endpoints in Slurm clusters
DevQualityEval: An evaluation benchmark 📈 and framework to compare and evolve the quality of code generation of LLMs.
Hugging Face Audio Course中文版,帮助学习者快速入门音频模态
Automated Identification of Redundant Layer Blocks for Pruning in Large Language Models
大语言模型应用:RAG、NL2SQL、聊天机器人、预训练、MOE混合专家模型、微调训练、强化学习、天池数据竞赛
The Universe of Evaluation. All about the evaluation for LLMs.
首个中医大语言模型——“仲景”。受古代中医学巨匠张仲景深邃智慧启迪,专为传统中医领域打造的预训练大语言模型。 The first-ever Traditional Chinese Medicine large language model - "CMLM-ZhongJing". Inspired by the profound wisdom of the ancient Chinese me…
🧑🚀 全世界最好的LLM资料总结(多模态生成、Agent、辅助编程、AI审稿、数据处理、模型训练、模型推理、o1 模型、MCP、小语言模型、视觉语言模型) | Summary of the world's best LLM resources.
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
Quick Start for Large Language Models (Theoretical Learning and Practical Fine-tuning) 大语言模型快速入门(理论学习与微调实战)
An Open Source Toolkit For LLM Distillation
A curated list of resources for using LLMs to develop more competitive grant applications.
Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.
🚀🚀 「大模型」2小时完全从0训练64M的小参数GPT!🌏 Train a 64M-parameter GPT from scratch in just 2h!
A GPU cluster manager that configures and orchestrates inference engines like vLLM and SGLang for high-performance AI model deployment.
Composable building blocks to build LLM Apps
The all-in-one AI productivity accelerator. On device and privacy first with no annoying setup or configuration.
Simple, unified interface to multiple Generative AI providers
Model Context Protocol Servers
动手学Ollama,CPU玩转大模型部署,在线阅读地址:https://datawhalechina.github.io/handy-ollama/
The LLVM Project is a collection of modular and reusable compiler and toolchain technologies.
gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI