Starred repositories
Free and Open Source Machine Translation API. Self-hosted, offline capable and easy to setup.
Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)
Synapse: Matrix homeserver written in Python/Twisted.
Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.
An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)
A framework for few-shot evaluation of language models.
The Triton Inference Server provides an optimized cloud and edge inferencing solution.
ChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source.
LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath
Run, manage, and scale AI workloads on any AI infrastructure. Use one system to access & manage all AI compute (Kubernetes, 17+ clouds, or on-prem).
Pythonic AI generation of images and videos
🕷️ An undetectable, powerful, flexible, high-performance Python library to make Web Scraping Easy and Effortless as it should be!
Code for the paper "Jukebox: A Generative Model for Music"
An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries
Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022
[NeurIPS 2023] Tree of Thoughts: Deliberate Problem Solving with Large Language Models
A node-based image processing GUI aimed at making chaining image processing tasks easy and customizable. Born as an AI upscaling application, chaiNNer has grown into an extremely flexible and power…
[ICLR'24 spotlight] An open platform for training, serving, and evaluating large language model for tool learning.
AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.
A fast inference library for running LLMs locally on modern consumer-class GPUs
Universal and Transferable Attacks on Aligned Language Models
Google DeepMind's software stack for physics-based simulation and Reinforcement Learning environments, using MuJoCo.
[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration