Stars
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
Linux, Jenkins, AWS, SRE, Prometheus, Docker, Python, Ansible, Git, Kubernetes, Terraform, OpenStack, SQL, NoSQL, Azure, GCP, DNS, Elastic, Network, Virtualization. DevOps Interview Questions
A high-throughput and memory-efficient inference and serving engine for LLMs
Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.
Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
You like pytorch? You like micrograd? You love tinygrad! ❤️
AIHawk aims to easy job hunt process by automating the job application process. Utilizing artificial intelligence, it enables users to apply for multiple jobs in a tailored way.
Official inference framework for 1-bit LLMs
Fully open reproduction of DeepSeek-R1
🤗 smolagents: a barebones library for agents that think in code.
Universal LLM Deployment Engine with ML Compilation
The official Python SDK for Model Context Protocol servers and clients
Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.
Open standard for machine learning interoperability
TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. Tensor…
Hackable and optimized Transformers building blocks, supporting a composable construction.
Open Source framework for voice and multimodal conversational AI
Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.
💬 Machine Learning Course with Python:
📚A curated list of Awesome LLM/VLM Inference Papers with Codes: Flash-Attention, Paged-Attention, WINT8/4, Parallelism, etc.🎉
On-device AI across mobile, embedded and edge for PyTorch
Bi-directional Attention Flow (BiDAF) network is a multi-stage hierarchical process that represents context at different levels of granularity and uses a bi-directional attention flow mechanism to …
Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks