Skip to content
View jcrobak's full-sized avatar

Block or report jcrobak

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Command-line JSON processor

C 33,188 1,683 Updated Dec 17, 2025

Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization etc. It also comes with Hadoop support built in.

Python 18,607 2,446 Updated May 16, 2025

Build mobile apps with simple HTML, CSS, and JavaScript components.

CSS 14,651 1,433 Updated Mar 18, 2025

A modular SQL linter and auto-formatter with support for multiple dialects and templated code.

Python 9,373 915 Updated Dec 12, 2025

Vowpal Wabbit is a machine learning system which pushes the frontier of machine learning with techniques such as online, hashing, allreduce, reductions, learning2search, active, and interactive lea…

C++ 8,632 1,931 Updated Oct 17, 2024

An animated number component for React, Vue, Svelte, and TS/JS.

TypeScript 6,865 132 Updated Nov 29, 2025

A tool for generating .pex (Python EXecutable) files, lock files and venvs.

Python 4,143 309 Updated Dec 16, 2025

Apache Avro is a data serialization system.

Java 3,191 1,719 Updated Dec 14, 2025

MongoDB Connector for Hadoop

Java 1,603 595 Updated Jan 28, 2022

Distributed Stream Processing

Pony 1,486 67 Updated Apr 6, 2021

Development repository for the docker cookbook

Ruby 1,383 789 Updated Dec 15, 2025

Mirror of Apache Pig

Java 688 446 Updated Sep 15, 2025

A fully asynchronous, non-blocking, thread-safe, high-performance HBase client.

Java 609 300 Updated May 19, 2023

Microbenchmarking and performance regression testing framework for the JVM platform.

Scala 512 70 Updated Aug 22, 2022

A Scala productivity framework for Hadoop.

Scala 482 94 Updated Jul 1, 2022

Titan Core - Snowflake infrastructure-as-code. Provision environments, automate deploys, CI/CD. Manage RBAC, users, roles, and data access. Declarative Python Resource API. Change Management tool f…

Python 479 41 Updated Mar 13, 2025

S3 transport for APT

Python 141 75 Updated May 6, 2025

A fast Graphite relay

Scala 91 12 Updated Aug 2, 2019

A pure Python protobuf parser

Python 60 30 Updated Nov 5, 2025

Snappy compression for Hadoop

Java 40 25 Updated Jun 18, 2015

Maven 2 Plugin for processing Apache Avro files. Avro is a subproject of Apache Hadoop.

Java 34 14 Updated Oct 1, 2010

Simple reST/wiki system

CSS 28 14 Updated Jun 5, 2014

TraitBasis applied to TauBench

Python 14 1 Updated Nov 11, 2025

Code for reproducing the CatAttack pipeline of adversarial triggers for reasoning models

Python 12 1 Updated Oct 3, 2025

An IntelliJ Pig Language Plugin

Graphviz (DOT) 9 2 Updated Apr 10, 2013

Generate high-fidelity, realistic simulation rollouts for AI Agents training and testing

8 3 Updated Oct 22, 2025

Dang. (NOTE: This project has moved to https://github.com/getcloudless/cloudless)

Python 5 Updated Oct 18, 2019

Chef repository for my computing environment

Ruby 4 4 Updated Nov 5, 2023

A Scala productivity framework for Hadoop.

Scala 3 Updated Dec 20, 2012
Next