Skip to content
View srowen's full-sized avatar
🤠
🤠

Organizations

@apache @OryxProject

Block or report srowen

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.

Python 2,783 236 Updated Dec 19, 2025

Curate better data for LLMs

Python 1,066 103 Updated Mar 19, 2024

A Data Streaming Library for Efficient Neural Network Training

Python 1,433 181 Updated Oct 27, 2025

The Stockfish testing framework

Python 324 139 Updated Dec 19, 2025

Databricks’ Dolly, a large language model trained on the Databricks Machine Learning Platform

Python 10,802 1,152 Updated Jun 30, 2023

This repo provides a customizable stack for starting new ML projects on Databricks that follow production best-practices out of the box.

Python 631 234 Updated Dec 12, 2025

The open source developer platform to build AI agents and models with confidence. Enhance your AI applications with end-to-end tracking, observability, and evaluations, all in one integrated platform.

Python 23,398 5,086 Updated Dec 21, 2025

XML data source for Spark SQL and DataFrames

Scala 513 227 Updated Aug 11, 2024

Oryx 2: Lambda architecture on Apache Spark, Apache Kafka for real-time large scale machine learning

Java 1,784 404 Updated Aug 16, 2021

Code to accompany Advanced Analytics with Spark from O'Reilly Media

Scala 1,531 1,021 Updated Sep 25, 2024

ZXing ("Zebra Crossing") barcode scanning library for Java, Android

Java 33,766 9,427 Updated Dec 1, 2025