Skip to content
View holdenk's full-sized avatar

Organizations

@sparklingpandas @high-performance-spark @scalingpythonml @PigsCanFlyLabs

Block or report holdenk

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

🪴 Nebari - your open source data science platform

Python 323 111 Updated Mar 19, 2026

State-of-the-art TTS model under 25MB 😻

Python 13,154 724 Updated Mar 19, 2026

MESH intercom for motorcycle or general communication

32 1 Updated Mar 19, 2026

Learn to use delta-connect without any headaches.

Jupyter Notebook 5 Updated Sep 19, 2025

Lets make a universal app for OpenStreetMap

TypeScript 355 32 Updated Mar 18, 2026

Fast and accurate automatic speech recognition (ASR) for edge devices

C 7,485 377 Updated Mar 19, 2026

Encrypt files uploaded to a Django application.

Python 7 1 Updated Jun 19, 2022

Let's RAG it RAW without fancy frameworks

Jupyter Notebook 27 2 Updated Sep 15, 2024

A collection of learning resources for curious software engineers

Python 50,686 3,971 Updated Mar 23, 2026

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Python 8,925 605 Updated May 3, 2024

pyspark methods to enhance developer productivity 📣 👯 🎉

Python 685 97 Updated Mar 6, 2025

Apache Spark Connect Client for Golang

Go 247 48 Updated Oct 13, 2025

A Python Library to support running data quality rules while the spark job is running⚡

Python 201 62 Updated Mar 20, 2026

A tool to validate data, built around Apache Spark.

Scala 101 33 Updated Feb 19, 2026

8-bit CUDA functions for PyTorch, modified to build on Jetson Xavier

C 15 9 Updated Apr 26, 2023

LLM finetuned for medical question answering

Python 556 72 Updated Sep 7, 2023

English SDK for Apache Spark

Python 876 135 Updated Jun 12, 2024

Python Stream Processing

Python 1,965 107 Updated Mar 27, 2025

A modular implementation of timely dataflow in Rust

Rust 3,592 293 Updated Mar 21, 2026

State of the Art Natural Language Processing

Scala 4,117 739 Updated Mar 23, 2026

Your self-hosted, globally interconnected microblogging community

Ruby 49,788 7,422 Updated Mar 24, 2026

A POC for multilingual UDFs in KSQL

Shell 3 Updated Mar 16, 2019

TiDB - the open-source, cloud-native, distributed SQL database designed for modern applications.

Go 39,918 6,157 Updated Mar 24, 2026

Prototype implementation of Service-Level Fault Injection Testing in Python.

Python 73 3 Updated Nov 5, 2022

Replaces the factory firmware on the SwitchBot Plug Mini via OTA, enabling the use of Tasmota without disassembling the unit.

C 143 20 Updated Jul 21, 2024

A Label Printer Application

C 324 38 Updated Feb 4, 2026

lakeFS - Data version control for your data lake | Git for data

Go 5,210 437 Updated Mar 24, 2026
Scala 12 Updated Sep 20, 2023

Java imap nio client that is designed to scale well for thousands of connections per machine and reduce contention when using large number of threads and cpus.

Java 63 47 Updated Feb 11, 2025
Next