Lists (1)
Sort Name ascending (A-Z)
Stars
A Datacenter Scale Distributed Inference Serving Framework
PaperBanana: Automating Academic Illustration For AI Scientists
A comprehensive repository for Compute Express Link (CXL) resources: covering research papers, specifications, simulation/emulation tools, and benchmarks for Type 1, 2, and 3 devices.
A library for constructing allocators and memory pools. It also contains broadly useful abstractions and utilities for memory management. UMF allows users to manage multiple memory pools characteri…
OCEAN – Open-source CXL Emulation at Hyperscale Architecture and Networking.
Comprehensive open-source library of AI research and engineering skills for any AI model. Package the skills and your claude code/codex/gemini agent will be an AI research agent with full horsepowe…
AI-based stock analysis and trading system
bpftop provides a dynamic real-time view of running eBPF programs. It displays the average runtime, events per second, and estimated total CPU % for each program.
eBPF Hello World Program using ebpf-go framework
This repository contains the code and scripts for all experiments presented in our paper "LightDSA: Enabling Efficient DSA Through Hardware-Aware Transparent Optimization", submitted for the AE of …
The repo for Eurosys'26 paper -- LightDSA: Enabling Efficient DSA Through Hardware-Aware Transparent Optimization
Code / solutions for Mathematics for Machine Learning (MML Book)
casys-kaist / MTTM_ae_EuroSys26
Forked from Leechangjun1011/mttmMulti tenants tiered memory system
[WACV'26] ForestSplats: Deformable transient field for Gaussian Splatting in the Wild
Pgtune - tuning PostgreSQL config by your hardware
FlashInfer Bench @ MLSys 2026: Building AI agents to write high performance GPU kernels
CXLMemUring / Engram-cxl
Forked from deepseek-ai/EngramConditional Memory via Scalable Lookup: A New Axis of Sparsity for Large Language Models
📚A curated list of Awesome LLM/VLM Inference Papers with Codes: Flash-Attention, Paged-Attention, WINT8/4, Parallelism, etc.🎉
Conditional Memory via Scalable Lookup: A New Axis of Sparsity for Large Language Models
NVIDIA Linux open GPU with P2P support
Generate eBPF programs and tracing with ChatGPT
This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and cont…
Companion webpage to the book "Mathematics For Machine Learning"
"AI-Trader: Can AI Beat the Market?" Live Trading Bench: https://ai4trade.ai Tech Report Link: https://arxiv.org/abs/2512.10971