Skip to content
View srowen's full-sized avatar
🤠
🤠

Organizations

@apache @OryxProject

Block or report srowen

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 255 10 Updated Apr 17, 2026

Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.

Python 3,115 273 Updated May 26, 2026

Curate better data for LLMs

Python 1,071 105 Updated Mar 19, 2024

A Data Streaming Library for Efficient Neural Network Training

Python 1,525 199 Updated Feb 2, 2026

The Stockfish testing framework

Python 340 154 Updated Jun 21, 2026

Databricks’ Dolly, a large language model trained on the Databricks Machine Learning Platform

Python 10,797 1,139 Updated Jun 30, 2023

This repo provides a customizable stack for starting new ML projects on Databricks that follow production best-practices out of the box.

Python 690 257 Updated Jun 12, 2026

The open source AI engineering platform for agents, LLMs, and ML models. MLflow enables teams of all sizes to debug, evaluate, monitor, and optimize production-quality AI applications while control…

Python 26,682 5,887 Updated Jun 22, 2026

XML data source for Spark SQL and DataFrames

Scala 512 223 Updated Aug 11, 2024

Oryx 2: Lambda architecture on Apache Spark, Apache Kafka for real-time large scale machine learning

Java 1,783 401 Updated Aug 16, 2021

Code to accompany Advanced Analytics with Spark from O'Reilly Media

Scala 1,523 1,014 Updated Sep 25, 2024

ZXing ("Zebra Crossing") barcode scanning library for Java, Android

Java 33,991 9,433 Updated Jun 22, 2026