Skip to content
View pariksheet's full-sized avatar

Block or report pariksheet

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Pythonic Programming Framework to orchestrate jobs in Databricks Workflow

Python 227 66 Updated Apr 20, 2026

Python framework for building efficient data pipelines. It promotes modularity and collaboration, enabling the creation of complex pipelines from simple, reusable components.

Python 651 39 Updated Apr 1, 2026

Oxford Deep NLP 2017 course

15,856 3,562 Updated Jul 2, 2023

📖 A curated list of resources dedicated to Natural Language Processing (NLP)

18,476 2,792 Updated Apr 6, 2026

A curated list of awesome Deep Learning (DL) for Natural Language Processing (NLP) resources

1,305 255 Updated Jan 24, 2026

A curated list of awesome open source libraries to deploy, monitor, version and scale your machine learning

20,476 2,568 Updated Apr 24, 2026

Machine Learning and Agentic AI Resources, Practice and Research

Python 4,738 1,701 Updated Apr 20, 2026

An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs

Scala 8,768 2,084 Updated Apr 30, 2026

Alluxio, data orchestration for analytics and machine learning in the cloud

Java 7,190 2,944 Updated Apr 29, 2025

VSCode extension to work with Databricks

TypeScript 134 27 Updated Mar 31, 2026

Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.

Scala 3,614 583 Updated Apr 30, 2026

Mirror of Apache griffin

Scala 1,170 586 Updated Aug 3, 2025

Apache TinkerPop - a graph computing framework

Java 2,120 849 Updated Apr 30, 2026

Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics

C++ 16,704 4,089 Updated Apr 30, 2026

Mirror of Apache Toree (Incubating)

Scala 750 226 Updated Apr 2, 2026