Starred repositories
NVIDIA Linux open GPU with P2P support
This project hosts security advisories and their accompanying proof-of-concepts related to research conducted at Google which impact non-Google owned code.
A high-throughput and memory-efficient inference and serving engine for LLMs
Power management, monitoring and VirtualSMC plugin for AMD processors
RAPL power capping C interface with multiple implementations
Tools for experimenting with Running Average Power Limit (RAPL)
Official release of InternLM series (InternLM, InternLM2, InternLM2.5, InternLM3).
A curated list of open-source projects related to DeepSeek Coder
DeepSeek Coder: Let the Code Write Itself
[NeurIPS'24] SelfCodeAlign: Self-Alignment for Code Generation
Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and…
Llama中文社区,实时汇总最新Llama学习资料,构建最好的中文Llama大模型开源生态,完全开源可商用
Get up and running with Kimi-K2.6, GLM-5.1, MiniMax, DeepSeek, gpt-oss, Qwen, Gemma and other models.
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
The VMware Architecture Migration Tool (VAMT) is designed to provide an easy and automated process to cold migrate machines between clusters of different architecture types within the same vCenter …
A Python package for extending the official PyTorch that can easily obtain performance on Intel platform
[HPCA 2026] AI Accelerator Benchmark focuses on evaluating AI Accelerators from a practical production perspective, including the ease of use and versatility of software and hardware.
Reference implementations of MLPerf® training benchmarks
Awesome-LLM-Benchmark: List of benchmarks for Large-Language Models
A collection of benchmarks and datasets for evaluating LLM.
Rcmp: Reconstructing RDMA-based Memory Disaggregation via CXL
A developer reference project for creating Retrieval Augmented Generation (RAG) chatbots on Windows using TensorRT-LLM
AMD Ryzen™ AI Software includes the tools and runtime libraries for optimizing and deploying AI inference on AMD Ryzen™ AI powered PCs.