partitioning

Here are 44 public repositories matching this topic...

Ahm-rgb / Alpha-SQL

Official repository for the paper "Alpha-SQL: Zero-Shot Text-to-SQL using Monte Carlo Tree Search" [ICML'25]

javascript mysql php json time-series visio mariadb plsql pyspark data-structures partitioning stocks-api text-to-sql icml-2025

Updated Nov 12, 2025
Python

dhdaines / playa

Sponsor

Star

Parallel and LAzY Analyzer for PDFs 🏖️

pdf etl information-extraction partitioning

Updated Nov 12, 2025
Python

mtholahan / apache-spark-optimization-mini-project

Star

Optimized PySpark jobs by analyzing query execution plans and rewriting transformations for efficiency. Applied techniques such as reducing shuffles, tuning partitions, selecting efficient operators, and choosing optimal data formats. Demonstrates performance tuning for large-scale Spark ETL workloads using Python and PySpark.

python performance big-data spark etl optimization pyspark data-engineering bootcamp partitioning springboard data-pipeline

Updated Nov 11, 2025
Python

deepgraph / deepgraph

Star

Analyze Data with Pandas-based Networks. Documentation:

graphviz data-science data-mining network graphs parallel pandas data-visualization data-structures networkx graph-theory data-analysis graph-database partitioning network-visualization iterative-methods network-analysis interfacing multilayer-networks

Updated Nov 9, 2025
Python

mandibchaulagain / Football_MySQL_DB

Star

Built a MySQL DB from scratch for the purpose of serving football stat api

mysql views database football partitioning joins aggregation indexes triggers normalization materialized-views

Updated Nov 6, 2025
Python

SpiNNakerManchester / PACMAN

Star

Partition and Configuration Manager for SpiNNaker

python routing spinnaker partitioning placement

Updated Nov 12, 2025
Python

drprojects / superpoint_transformer

Star

Official PyTorch implementation of Superpoint Transformer introduced in [ICCV'23] "Efficient 3D Semantic Segmentation with Superpoint Transformer" and SuperCluster introduced in [3DV'24 Oral] "Scalable 3D Panoptic Segmentation As Superpoint Graph Clustering"

fast lightweight deep-learning efficient point-cloud pytorch transformer partition hierarchical partitioning semantic-segmentation 3d graph-clustering panoptic-segmentation superpoint iccv2023 3dv2024

Updated Oct 24, 2025
Python

zumaa-tep / Convex-Fair-Optimize

Star

โปรเจกต์เกี่ยวกับอัลกอริทึมที่ใช้ในการแบ่งพื้นที่ และเส้นรอบรูปให้เท่ากัน

optimization polygon partitioning convex

Updated Oct 19, 2025
Python

singhakshitraj / chatapp

Star

A scalable real-time chat backend built with FastAPI, Redis, and Celery. It supports one-to-one and group chats, offline message persistence, user tracking through Redis, and rate limiting with a sliding window algorithm. The system uses Celery tasks queue for throttled email notifications and maintains partitioned PostgreSQL message tables.

redis postgresql rate-limiting email-sender celery partitioning throttling fastapi

Updated Oct 14, 2025
Python

OzFlux / PyFluxPro

Star

PyFluxPro V3.4 is a significant upgrade from previous versions. It has several new features, improved stability and is introduced ahead of the 2021 OzFlux Data Workshop.

processing flux data gap partitioning filling

Updated Oct 9, 2025
Python

lapets / parts

Star

Minimal library that enables partitioning of iterable collections in a concise manner.

python lists containers split splitting data-structures partition partitioning common-library python-iterables python-containers

Updated Sep 16, 2025
Python

chhuang216 / realtime-data-pipeline

Star

Windows-first PySpark batch pipeline: ingest raw → bronze Parquet, run DQ checks, publish curated silver. PowerShell wrapper adds Spark hygiene, parallelism controls, and step logs.

github git windows airflow etl powershell logging pyspark parquet partitioning powershell-script pyarrow dq bronze-silver-gold

Updated Sep 4, 2025
Python

lucaslopes / hedonic-game

Star

Hedonic Games for Network Clustering

graph clustering network game-theory partitioning hedonic-games

Updated Sep 19, 2025
Python

nagaraju-12 / pyspark-optimization-topics

Star

This project demonstrates key PySpark performance optimization techniques using a synthetic banking transactions dataset (~5,000 records). Built using Databricks and Delta Lake.