- All languages
- Assembly
- Batchfile
- C
- C#
- C++
- CMake
- CSS
- Clojure
- CoffeeScript
- Common Workflow Language
- Cuda
- Cython
- Dockerfile
- Fortran
- GAP
- Go
- HCL
- HTML
- Haskell
- Java
- JavaScript
- Jsonnet
- Julia
- Jupyter Notebook
- Kotlin
- Limbo
- Lua
- MATLAB
- MDX
- MLIR
- Makefile
- Markdown
- Objective-C
- Objective-C++
- OpenEdge ABL
- PHP
- PLSQL
- Perl
- PostScript
- PureBasic
- Python
- QML
- R
- Roff
- Ruby
- Rust
- SCSS
- Sass
- Scala
- Shell
- SourcePawn
- Svelte
- Swift
- SystemVerilog
- TSQL
- TeX
- Thrift
- TypeScript
- Vim Script
- Vue
- WebAssembly
- Zig
Starred repositories
A modern Rust template for developing DuckDB extensions
Lock-free MPSC channel in Zig achieving 50+ billion messages/second via ring-decomposed architecture
Embedding Atlas is a tool that provides interactive visualizations for large embeddings. It allows you to visualize, cross-filter, and search embeddings and metadata.
In Pursuit of Pixel Supervision for Visual Pre-training
JAX in JavaScript – an ML library for the web, running on WebGPU & Wasm
An extensible, state of the art columnar file format. Formerly at @spiraldb, now an Incubation Stage project at LFAI&Data, part of the Linux Foundation.
Lightweight and extensible compatibility layer between dataframe libraries!
Apache DataFusion Ballista Distributed Query Engine
Pixeltable — Data Infrastructure providing a declarative, incremental approach for multimodal AI workloads.
Task and time-tracking management with calendar integration for Obsidian
DuckDB community extension for locality-sensitive hashing (LSH)
The official Python client for the Hugging Face Hub.
Database connectivity API standard and libraries for Apache Arrow
Evolution Pretraining Fully in Int Formats
Jax Codebase for Evolutionary Strategies at the Hyperscale
Visionary: The World Model Carrier Built on WebGPU-Powered Gaussian Splatting Platform
[NeurIPS 2025 Oral] Representation Entanglement for Generation: Training Diffusion Transformers Is Much Easier Than You Think
Glance: Accelerating Diffusion Models with 1 Sample
RealGen: Photorealistic Text-to-Image Generation via Detector-Guided Rewards.
High-Performance Engine for Multi-Vector Search
[NeurIPS 2025 Oral]Infinity⭐️: Unified Spacetime AutoRegressive Modeling for Visual Generation
A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speech analysis on Apple Silicon.
TRivia: Self-supervised Fine-tuning of Vision-Language Models for Table Recognition