Stars
verl: Volcano Engine Reinforcement Learning for LLMs
Swap GPT for any LLM by changing a single line of code. Xinference lets you run open-source, speech, and multimodal models on cloud, on-prem, or your laptop — all through one unified, production-re…
a local implementation of OpenAI Assistants API: myla stands for MY Local Assistant
Calculate token/s & GPU memory requirement for any LLM. Supports llama.cpp/ggml/bnb/QLoRA quantization
A high-throughput and memory-efficient inference and serving engine for LLMs
Instruct-tune LLaMA on consumer hardware
🦜🔗 The platform for reliable agents.
Python actor framework for heterogeneous computing.
Scalable Python DS & ML, in an API compatible & lightning fast way.
xorbitsai / mars
Forked from mars-project/marsMars is a tensor-based unified framework for large-scale data computation which scales numpy, pandas, scikit-learn and Python functions.