Stars
Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with structured filtering with the fault tolerance and scalability of …
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search
Run, manage, and scale AI workloads on any AI infrastructure. Use one system to access & manage all AI compute (Kubernetes, 17+ clouds, or on-prem).
MNN is a blazing fast, lightweight deep learning framework, battle-tested by business-critical use cases in Alibaba. Full multimodal LLM Android App:[MNN-LLM-Android](./apps/Android/MnnLlmChat/READ…
💥💻💥 A data-parallel functional programming language
The Learning Interpretability Tool: Interactively analyze ML models to understand their behavior in an extensible and framework agnostic interface.
AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data…
A high-throughput and memory-efficient inference and serving engine for LLMs
ZenML 🙏: MLOps for Reliable AI: from Classical ML to Agents. https://zenml.io.
🪢 Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with OpenTelemetry, Langchain, OpenAI SDK, LiteLLM, and more. 🍊YC W23
TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. Tensor…
Fast and Accurate ML in 3 Lines of Code
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
Making large AI models cheaper, faster and more accessible
🚀 Efficient implementations of state-of-the-art linear attention models
A PyTorch native platform for training generative AI models
DSPy: The framework for programming—not prompting—language models
The definitive Web UI for local AI, with powerful features and easy setup.
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
A curated list of insanely awesome libraries, packages and resources for Quants (Quantitative Finance)
Large Language Model Text Generation Inference
đź’« Industrial-strength Natural Language Processing (NLP) in Python
A curated list of awesome open source libraries to deploy, monitor, version and scale your machine learning
Langflow is a powerful tool for building and deploying AI-powered agents and workflows.
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit and 4-bit floating point (FP8 and FP4) precision on Hopper, Ada and Blackwell GPUs, to provide better performance…
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…