Skip to content
View Deegue's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report Deegue

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

cuDF - GPU DataFrame Library

C++ 9,678 1,070 Updated Jun 23, 2026

Apache DataFusion SQL Query Engine

Rust 8,908 2,176 Updated Jun 23, 2026

Empowering everyone to build reliable and efficient software.

Rust 114,042 14,989 Updated Jun 23, 2026

Axiom is a set of reusable and extensible components designed to be compatible with Velox. Its primary purpose is to simplify the process of building front-ends for query execution powered by Velox.

C++ 74 79 Updated Jun 23, 2026
C++ 163 80 Updated Jun 22, 2026

An extensible, state-of-the-art framework for columnar compression, and the fastest FOSS columnar file format. Formerly at @spiraldb, now an Incubation Stage project at LFAI&Data, part of the Linux…

Rust 3,031 173 Updated Jun 22, 2026

The Auron accelerator for distributed computing framework (e.g., Spark) leverages native vectorized execution to accelerate query processing

Rust 1,772 226 Updated Jun 22, 2026

Real-time analytics on Postgres tables

Rust 1,978 70 Updated Mar 31, 2026

open-source agentic AI data assistant for the next generation of AI + Data products.

Python 19,056 2,746 Updated Jun 19, 2026

Pretrain, finetune and serve LLMs on Intel platforms with Ray

Python 130 35 Updated Sep 23, 2025

RayDP provides simple APIs for running Spark on Ray and integrating Spark with AI libraries.

Scala 373 82 Updated Jun 10, 2026

Power CLI and Workflow manager for LLMs (core package)

Python 3,719 468 Updated Apr 30, 2026

LLM inference in C/C++

C++ 117,706 19,825 Updated Jun 22, 2026

Data Agent Ready Warehouse : One for Analytics, Search, AI, Python Sandbox. — rebuilt from scratch. Unified architecture on your S3.

Rust 9,348 885 Updated Jun 22, 2026

ClickBench: a Benchmark For Analytical Databases

HTML 1,022 288 Updated Jun 22, 2026

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 42,974 7,714 Updated Jun 23, 2026

A memory profiler for Linux.

C 4,791 202 Updated Jul 28, 2023

Graphs for Everyone

Java 16,770 2,634 Updated Jun 8, 2026

LingoDB: A new analytical database system that blurs the lines between databases and compilers.

C++ 309 63 Updated Jun 22, 2026

A modular acceleration toolkit for big data analytic engines

C++ 66 25 Updated May 6, 2024

Native SQL Engine plugin for Spark SQL with vectorized SIMD optimizations.

Scala 255 73 Updated Feb 21, 2023

The Fastest Distributed Database for Transactional, Analytical, and AI Workloads.

C++ 10,159 1,903 Updated Jun 23, 2026

JDK main-line development https://openjdk.org/projects/jdk

Java 23,018 6,356 Updated Jun 22, 2026

JuiceFS is a distributed POSIX file system built on top of Redis and S3.

Go 14,075 1,238 Updated Jun 22, 2026

hera 分布式任务调度系统 大数据任务调度系统 任务调度 (数据部门专用)

Java 378 98 Updated Aug 14, 2023

Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, DeepSeek, Mixtral, Gemma, Phi, MiniCPM, Qwen-VL, MiniCPM-V, etc.) on Intel XPU (e.g., local PC with iGPU and NPU, discr…

Python 8,843 1,428 Updated Jan 28, 2026

Kubernetes operator for managing the lifecycle of Apache Spark applications on Kubernetes.

Python 3,128 1,491 Updated Jun 22, 2026

BigDL: Distributed TensorFlow, Keras and PyTorch on Apache Spark/Flink & Ray

Jupyter Notebook 2,699 730 Updated Jun 12, 2026

Collection of open-source Spark tools & frameworks that have made the data engineering and data science teams at Swoop highly productive

Scala 191 35 Updated Oct 15, 2025

Byzer (former MLSQL): A low-code open-source programming language for data pipeline, analytics and AI.

Scala 1,837 539 Updated May 29, 2024
Next