Skip to content
View jiaoew1991's full-sized avatar
🤞
🤞

Block or report jiaoew1991

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
138 results for source starred repositories
Clear filter

The lance extensions for DuckDB enable reading and writing of lance tables.

C++ 70 6 Updated Feb 4, 2026
6 1 Updated Feb 4, 2026
Python 3 Updated Jan 13, 2026

ZeroFS - The Filesystem That Makes S3 your Primary Storage. ZeroFS is 9P/NFS/NBD on top of S3. Initially built for www.merklemap.com

Rust 1,649 60 Updated Jan 28, 2026

High-performance distributed multi-tier cache system. Built in Rust.

Rust 578 73 Updated Feb 5, 2026

A cloud native embedded storage engine built on object storage.

Rust 2,698 188 Updated Feb 5, 2026

LakeSail's computation framework with a mission to unify batch processing, stream processing, and compute-intensive AI workloads.

Rust 1,161 81 Updated Feb 5, 2026

Spark integrations for working with Lance datasets

Java 41 39 Updated Feb 4, 2026

Integration between Lance and Ray for distributed data processing

Python 20 21 Updated Feb 5, 2026

Declarative context engineering for agents

Python 436 28 Updated Feb 5, 2026

The observability platform for Iceberg lakehouses.

TypeScript 436 24 Updated Jan 12, 2026

Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.

Python 3,501 279 Updated Feb 5, 2026

A high-performance distributed file system designed to address the challenges of AI training and inference workloads.

C++ 9,698 1,003 Updated Feb 4, 2026

A lightweight data processing framework built on DuckDB and 3FS.

Python 4,921 443 Updated Mar 5, 2025

Perforator is a cluster-wide continuous profiling tool designed for large data centers

C++ 3,382 153 Updated Feb 5, 2026

The official repo of MiniMax-Text-01 and MiniMax-VL-01, large-language-model & vision-language-model based on Linear Attention

Python 3,328 315 Updated Jul 7, 2025

Lakekeeper is an Apache-Licensed, secure, fast and easy to use Apache Iceberg REST Catalog written in Rust.

Rust 1,170 118 Updated Feb 4, 2026

Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷

Python 5,828 322 Updated Feb 5, 2026

Apache DataFusion Ray

Python 229 25 Updated Oct 5, 2025

A collection of RBIR projects and posts for anyone interested in joining this journey.

Rust 306 11 Updated Feb 5, 2026

Open Lakehouse Format for Multimodal AI. Convert from Parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckDB, Polars, Pyarrow, a…

Rust 6,009 541 Updated Feb 5, 2026

Use your Neovim like using Cursor AI IDE!

Lua 17,307 794 Updated Feb 3, 2026

Eclipse Theia is a cloud & desktop IDE framework implemented in TypeScript.

TypeScript 21,341 2,786 Updated Feb 5, 2026

World's most powerful open data catalog for building a high-performance, geo-distributed and federated metadata lake.

Java 2,758 719 Updated Feb 5, 2026

New file format for storage of large columnar datasets.

C++ 687 63 Updated Feb 4, 2026

High-performance data engine for AI and multimodal workloads. Process images, audio, video, and structured data at any scale

Rust 5,192 393 Updated Feb 5, 2026

Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.

Scala 1,506 567 Updated Feb 5, 2026

Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics

C++ 16,482 4,003 Updated Feb 5, 2026

Apache OpenDAL: One Layer, All Storage.

Rust 4,879 710 Updated Feb 3, 2026

Alluxio, data orchestration for analytics and machine learning in the cloud

Java 7,151 2,954 Updated Apr 29, 2025
Next