- London
Stars
Disaggregated serving system for Large Language Models (LLMs).
A ChatGPT(GPT-3.5) & GPT-4 Workload Trace to Optimize LLM Serving Systems
A high-throughput and memory-efficient inference and serving engine for LLMs
Bringing stable diffusion models to web browsers. Everything runs inside the browser with no server support.
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
A latent text-to-image diffusion model
CUDA integration for Python, plus shiny features
CUDA Python: Performance meets Productivity
CGBN: CUDA Accelerated Multiple Precision Arithmetic (Big Num) using Cooperative Groups
A testing framework for Cobol applications
LLM training code for Databricks foundation models
Share, discover, and collect prompts from the community. Free and open source — self-host for your organization with complete privacy.
Handy toolbelt to deal nicely with offline/online connectivity in a React Native app. Smooth redux integration
Source code for the X Recommendation Algorithm
oneAPI Threading Building Blocks (oneTBB)
A minimalistic and high-performance SAT solver
A collection of Jupyter notebooks developed by the community showing how to use Qiskit
JavaScript implementation of CRYSTALS-KYBER (version 3) post-quantum key exchange algorithm.
An open-source cross-platform alternative to AirDrop
C library for prototyping and experimenting with quantum-resistant cryptography