Skip to content
View mengxr's full-sized avatar

Block or report mengxr

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A project to map out the relations between different equational theories of Magmas.

Lean 464 87 Updated Dec 16, 2025

A programming framework for agentic AI. Discord: https://discord.gg/pAbnFJrkgZ

Jupyter Notebook 137 26 Updated Feb 5, 2025

English SDK for Apache Spark

Python 879 136 Updated Jun 12, 2024

Numbers every LLM developer should know

4,277 140 Updated Jan 16, 2024

A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers)

10,128 781 Updated May 31, 2024
TypeScript 61 20 Updated Apr 15, 2024

Spark DL Inferencing using external frameworks

Shell 6 Updated May 9, 2023

Databricks Terraform Provider

Go 561 480 Updated Dec 22, 2025

Reference code base for ML Engineering, Manning Publications

Jupyter Notebook 132 42 Updated Jul 16, 2021
Python 28 18 Updated Nov 5, 2025

A high performance and generic framework for distributed DNN training

Python 3,715 494 Updated Oct 3, 2023

Joblib Apache Spark Backend

Python 249 25 Updated Apr 7, 2025

Koalas: pandas API on Apache Spark

Python 3,367 366 Updated Mar 20, 2024

Julia package to computes statistics on streams of data

Julia 3 1 Updated Oct 24, 2017

Spark Exercise

Scala 6 1 Updated May 26, 2014

Intellij Jsonnet Plugin

Java 89 17 Updated Mar 9, 2024

Spark data source for Salesforce

Scala 81 68 Updated May 23, 2024

Rules engine for cloud security, cost optimization, and governance, DSL in yaml for policies to query, filter, and take actions on resources

Python 5,885 1,585 Updated Dec 23, 2025

(Legacy) Command Line Interface for Databricks

Python 397 234 Updated Oct 5, 2023

Spark package for checking data quality

Scala 222 67 Updated Feb 28, 2020

Apache Airflow - A platform to programmatically author, schedule, and monitor workflows

Python 43,637 16,152 Updated Dec 25, 2025

Fast, flexible and powerful server providing access to R from many languages and systems

C 291 65 Updated Dec 15, 2025

Code for Quartz Scheduler

Java 6,657 1,984 Updated Dec 17, 2025

Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, DeepSeek, Mixtral, Gemma, Phi, MiniCPM, Qwen-VL, MiniCPM-V, etc.) on Intel XPU (e.g., local PC with iGPU and NPU, discr…

Python 8,562 1,391 Updated Oct 14, 2025

Generic Implementation of Consensus ADMM over Spark

Python 84 20 Updated Jul 8, 2016

R interface for Apache Spark

R 970 308 Updated Dec 22, 2025

A scalable machine learning library on Apache Spark

Terra 796 176 Updated Aug 30, 2021

Apache Superset is a Data Visualization and Data Exploration Platform

TypeScript 69,539 16,376 Updated Dec 24, 2025

[DEPRECATED] Tensorflow wrapper for DataFrames on Apache Spark

Scala 745 160 Updated Jul 30, 2024
Next