Highlights
- Pro
Starred repositories
An Open Source Machine Learning Framework for Everyone
FlatBuffers: Memory Efficient Serialization Library
A high-performance distributed file system designed to address the challenges of AI training and inference workloads.
CUDA Templates and Python DSLs for High-Performance Linear Algebra
This is an online course where you can learn and master the skill of low-level performance analysis and tuning.
Motion imitation with deep reinforcement learning.
Mirage Persistent Kernel: Compiling LLMs into a MegaKernel
Necessary material to build and use Zooids to create Swarm User Interfaces
The Tensor Algebra Compiler (taco) computes sparse tensor expressions on CPUs and GPUs
LLMs as Copilots for Theorem Proving in Lean
VUDA is a header-only library based on Vulkan that provides a CUDA Runtime API interface for writing GPU-accelerated applications.
cmix is a lossless data compression program aimed at optimizing compression ratio at the cost of high CPU/memory usage.
ROS packages for Jaco2 and Mico robotic arms
C++ RRT (Rapidly-exploring Random Tree) Implementation
Stonefish - an advanced C++ simulation library designed for (but not limited to) marine robotics.
High-Performance Linear Algebra-based Graph Primitives on GPUs
Leopard-RS : O(N Log N) MDS Reed-Solomon Block Erasure Code for Large Data
Library to support implementation of language specific ROS Client Libraries.