Stars
Pretrain, finetune ANY AI model of ANY size on 1 or 10,000+ GPUs with zero code changes.
Tensors and Dynamic neural networks in Python with strong GPU acceleration
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search
A simple Java library for interacting with Ollama server.
PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)
The swiss army knife of lossless video/audio editing
The official repository for ERNIE 4.5 and ERNIEKit – its industrial-grade development toolkit based on PaddlePaddle.
Anserini is a Lucene toolkit for reproducible information retrieval research
Get up and running with OpenAI gpt-oss, DeepSeek-R1, Gemma 3 and other models.
The AI developer platform. Use Weights & Biases to train and fine-tune models, and manage models from experimentation to production.
A library for efficient similarity search and clustering of dense vectors.
Models and examples built with TensorFlow
Stanford NLP Python library for tokenization, sentence segmentation, NER, and parsing of many human languages
The interactive graphing library for Python ✨
🤗 The largest hub of ready-to-use datasets for AI models with fast, easy-to-use and efficient data manipulation tools
Official Python client for Elasticsearch
DSPy: The framework for programming—not prompting—language models
A topic-centric list of HQ open datasets.
Leveraging BERT and c-TF-IDF to create easily interpretable topics.
📚 Freely available programming books
Fast computation of Krippendorff's alpha agreement measure in Python.
Collection of extracted System Prompts from popular chatbots like ChatGPT, Claude & Gemini