Stars
- All languages
- Assembly
- BitBake
- C
- C#
- C++
- CMake
- CSS
- Cuda
- Dockerfile
- Eagle
- F#
- Go
- HCL
- HTML
- Haskell
- Java
- JavaScript
- JetBrains MPS
- Jupyter Notebook
- Kotlin
- LLVM
- MATLAB
- MDX
- MLIR
- Makefile
- NewLisp
- Nix
- Objective-C
- PHP
- PLpgSQL
- Pascal
- Perl
- PlantUML
- PowerShell
- Prolog
- Python
- R
- RobotFramework
- Roff
- Ruby
- Rust
- SCSS
- Sail
- Scala
- Scheme
- Shell
- Standard ML
- Swift
- SystemVerilog
- Tcl
- TeX
- Thrift
- TypeScript
- Verilog
- Visual Basic .NET
- Vue
- XSLT
- nesC
Transformer Explained Visually: Learn How LLM Transformer Models Work with Interactive Visualization
A Lightweight LLM Inference Performance Simulator
LLMServingSim 2.0: A Unified Simulator for Heterogeneous and Disaggregated LLM Serving Infrastructure
Accurate, large-scale, and extensible simulator for LLM inference Systems
Open Source Continuous Inference Benchmark Research Platform — Kimi K2.7-Code, MiniMax M3, DeepSeekv4, GLM5 - GB200 NVL72 vs MI355X vs B200 vs GB300 NVL72 & soon™ TPUv6e/v7/Trainium2/3
Offline optimization of your disaggregated Dynamo graph
FlagTree IR is forked from microsoft/triton-shared, which is a Shared Middle-Layer for Triton Compilation. It is used for FlagTree.
💫 Toolkit to help you get started with Spec-Driven Development
Heuristic Learning Blog Post
AI agents running research on single-GPU nanochat training automatically
A stand-alone implementation of several NumPy dtype extensions used in machine learning.
Allo Accelerator Design and Programming Framework (PLDI'24)
Research and development (R&D) is crucial for the enhancement of industrial productivity, especially in the AI era, where the core aspects of R&D are mainly focused on data and models. We are commi…
Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞
LLM 驱动的多市场股票智能分析系统:多源行情、实时新闻、决策看板与自动推送,支持零成本定时运行。 LLM-powered multi-market stock analysis system with multi-source market data, real-time news, decision dashboard, automated notifications, and cost…
Conditional Memory via Scalable Lookup: A New Axis of Sparsity for Large Language Models
Development repository for the Triton language and compiler
Artifact Evaluation for ASPLOS '25 Exo 2 paper
Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels
Extremely fast Query Engine for DataFrames, written in Rust
⭐AI-driven public opinion & trend monitor with multi-platform aggregation, RSS, and smart alerts.🎯 告别信息过载,你的 AI 舆情监控助手与热点筛选工具!聚合多平台热点 + RSS 订阅,支持关键词精准筛选。AI 智能筛选新闻 + AI 翻译 + AI 分析简报直推手机,也支持接入 MCP 架构…
A framework for building hardware verification platform using software method
A library for efficient similarity search and clustering of dense vectors.
Use pytest's runner to discover and execute C++ tests