Stars
AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data…
Production-Grade Container Scheduling and Management
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.
😘 让你“爱”上 GitHub,解决访问时图裂、加载慢的问题。(无需安装)
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
Development repository for the Triton language and compiler
Introduction to Machine Learning Systems
Making text a first-class citizen in TensorFlow.
scikit-learn: machine learning in Python
A profiling and performance analysis tool for machine learning
📝 Algorithms and data structures implemented in JavaScript with explanations and links to further readings
The new Windows Terminal and the original Windows console host, all in the same place!
Implement a reasoning LLM in PyTorch from scratch, step by step
Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search
The Generative AI Landscape - A Collection of Awesome Generative AI Applications
PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)
A command line tool and library for transferring data with URL syntax, supporting DICT, FILE, FTP, FTPS, GOPHER, GOPHERS, HTTP, HTTPS, IMAP, IMAPS, LDAP, LDAPS, MQTT, POP3, POP3S, RTMP, RTMPS, RTSP…
Open-source high-performance RISC-V processor
🏋️ Python / Modern C++ Solutions of All 3735 LeetCode Problems (Weekly Update)
A collection of design patterns/idioms in Python
Making large AI models cheaper, faster and more accessible
Vald. A Highly Scalable Distributed Vector Search Engine
12 Lessons to Get Started Building AI Agents
The Triton Inference Server provides an optimized cloud and edge inferencing solution.
cube studio开源云原生一站式机器学习/深度学习/大模型AI平台,mlops算法链路全流程,算力租赁平台,notebook在线开发,拖拉拽任务流pipeline编排,多机多卡分布式训练,超参搜索,推理服务VGPU虚拟化,边缘计算,标注平台自动化标注,deepseek等大模型sft微调/奖励模型/强化学习训练,vllm/ollama/mindie大模型多机推理,私有知识库,AI模型市场…
SOTA low-bit LLM quantization (INT8/FP8/MXFP8/INT4/MXFP4/NVFP4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime
🧑🚀 全世界最好的LLM资料总结(语音视频生成、Agent、辅助编程、数据处理、模型训练、模型推理、o1 模型、MCP、小语言模型、视觉语言模型) | Summary of the world's best LLM resources.