Skip to content
View pariksheet's full-sized avatar

Block or report pariksheet

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Pythonic Programming Framework to orchestrate jobs in Databricks Workflow

Python 222 62 Updated Dec 3, 2025

Python framework for building efficient data pipelines. It promotes modularity and collaboration, enabling the creation of complex pipelines from simple, reusable components.

Python 650 37 Updated Dec 17, 2025

Oxford Deep NLP 2017 course

15,864 3,581 Updated Jul 2, 2023

📖 A curated list of resources dedicated to Natural Language Processing (NLP)

18,054 2,726 Updated Sep 13, 2025

A curated list of awesome Deep Learning (DL) for Natural Language Processing (NLP) resources

1,303 255 Updated Jan 5, 2023

A curated list of awesome open source libraries to deploy, monitor, version and scale your machine learning

19,815 2,490 Updated Dec 20, 2025

Machine Learning and Agentic AI Resources, Practice and Research

Python 4,544 1,654 Updated Nov 2, 2025

An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs

Scala 8,477 1,975 Updated Dec 20, 2025

Alluxio, data orchestration for analytics and machine learning in the cloud

Java 7,132 2,956 Updated Apr 29, 2025

VSCode extension to work with Databricks

TypeScript 131 27 Updated Dec 12, 2025

Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.

Scala 3,558 576 Updated Nov 4, 2025

Mirror of Apache griffin

Scala 1,175 590 Updated Aug 3, 2025

Apache TinkerPop - a graph computing framework

Java 2,092 844 Updated Dec 19, 2025

Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics

C++ 16,295 3,955 Updated Dec 22, 2025

Mirror of Apache Toree (Incubating)

Scala 749 227 Updated Nov 27, 2025