Stars
Make Any Website & Tool Your CLI. A universal CLI Hub and AI-native runtime. Transform any website, Electron app, or local binary into a standardized command-line interface. Built for AI Agents to …
verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in-Group Policy Optimization for LLM Agent Training"
An AI-powered search engine with a generative UI
SearXNG is a free internet metasearch engine which aggregates results from various search services and databases. Users are neither tracked nor profiled.
Development repository for the Triton language and compiler
Unsloth Studio is a web UI for training and running open models like Gemma 4, Qwen3.5, DeepSeek, gpt-oss locally.
A Flexible Framework for Experiencing Heterogeneous LLM Inference/Fine-tune Optimizations
BtrBlocks: Efficient Columnar Compression for Data Lakes (SIGMOD 2023 Paper)
Extremely fast Query Engine for DataFrames, written in Rust
Apache ORC - the smallest, fastest columnar storage for Hadoop workloads
Apache Doris is an easy-to-use, high performance and unified analytics database.
DuckDB is an analytical in-process SQL database management system
Data Agent Ready Warehouse : One for Analytics, Search, AI, Python Sandbox. — rebuilt from scratch. Unified architecture on your S3.
YugabyteDB - the cloud native distributed SQL database for mission-critical applications.
Collection of experiments to carve out the differences between two types of relational query processing engines: Vectorizing (interpretation based) engines and compiling engines.
Empowering everyone to build reliable and efficient software.
ClickHouse® is a real-time analytics database management system
A library that provides an embeddable, persistent key-value store for fast storage.
A distributed, fast open-source graph database featuring horizontal scalability and high availability
Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.
Alluxio, data orchestration for analytics and machine learning in the cloud
Apache Spark - A unified analytics engine for large-scale data processing
The official home of the Presto distributed SQL query engine for big data
CloudStack OSS Cloud.com/community - IaaS (Infrastructure as a Service)