Skip to content
View dragonH's full-sized avatar
🎯
Focusing 🎉
🎯
Focusing 🎉

Block or report dragonH

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
10 stars written in Scala
Clear filter

Source code for the X Recommendation Algorithm

Scala 67,755 12,627 Updated Sep 8, 2025

An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs

Scala 8,386 1,951 Updated Nov 11, 2025

State of the Art Natural Language Processing

Scala 4,067 733 Updated Nov 10, 2025

Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.

Scala 3,537 573 Updated Nov 4, 2025

Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.

Scala 2,268 971 Updated Nov 8, 2025

Mirror of Apache griffin

Scala 1,176 590 Updated Aug 3, 2025

A Spark plugin for reading and writing Excel files

Scala 515 162 Updated Nov 3, 2025

Apache Spark Connector for SQL Server and Azure SQL

Scala 286 132 Updated Feb 27, 2025

Plug-and-play implementation of an Apache Spark custom data source for AWS DynamoDB.

Scala 176 93 Updated Mar 6, 2021

Performant Redshift data source for Apache Spark

Scala 140 65 Updated Oct 15, 2025