Skip to content
View sonalgoyal's full-sized avatar

Organizations

@zinggAI

Block or report sonalgoyal

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Scalable identity resolution, entity resolution, data mastering and deduplication using ML

Java 1,190 162 Updated Apr 30, 2026

A Business Catalog for Unity Catalog

Python 161 43 Updated Apr 28, 2026

AssetOpsBench - Industry 4.0

Python 1,402 225 Updated Apr 27, 2026

In-memory Java DataFrame library

Java 314 30 Updated Apr 28, 2026
Python 1 5 Updated Nov 13, 2025

Moving data tables from one account to another

Python 5 1 Updated Jan 21, 2025

Example project using Zingg on Databricks

Jupyter Notebook 3 Updated Jan 2, 2025

Run TUIs and terminals in your browser

Python 1,395 37 Updated Aug 30, 2024

Non-Metric Space Library (NMSLIB): An efficient similarity search library and a toolkit for evaluation of k-NN methods for generic non-metric spaces.

C++ 3,579 462 Updated Apr 13, 2026

JVector: the most advanced embedded vector search engine

Java 1,710 153 Updated Apr 27, 2026

Code for Husky, an open-source language agent that solves complex, multi-step reasoning tasks. Husky v1 addresses numerical, tabular and knowledge-based reasoning tasks.

Python 347 26 Updated Jun 16, 2024

Semantic Matrix Operations

Roff 1 Updated Mar 5, 2025

An example of SparkConnect extension.

Java 15 3 Updated Mar 5, 2024

Zingg fuzzy matching for products using metadata and images

Python 9 Updated May 20, 2024

Snowflake Snowpark Java & Scala API

Scala 23 23 Updated Apr 24, 2026

An End-to-End Evaluation Framework for Entity Resolution Systems

Python 36 11 Updated Dec 3, 2023

What's in your data? Extract schema, statistics and entities from datasets

Python 1,554 186 Updated Apr 7, 2026

Schema modelling framework for decentralised domain-driven ownership of data.

Java 261 17 Updated Dec 5, 2023

Translating text attributes (like name, address, phone number) into quantifiable numerical representations Training ML models to determine if these numerical labels form a match Scoring the confide…

Python 32 8 Updated Mar 4, 2024

lakeFS - Data version control for your data lake | Git for data

Go 5,265 446 Updated Apr 25, 2026

A collection of research papers and software related to explainability in graph machine learning.

1,986 136 Updated Apr 4, 2022

Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets

Python 4,952 483 Updated Apr 27, 2026

A completely-from-scratch hobby operating system: bootloader, kernel, drivers, C library, and userspace including a composited graphical UI, dynamic linker, syntax-highlighting text editor, network…

C 6,690 541 Updated Apr 27, 2026

OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team co…

TypeScript 13,734 2,069 Updated Apr 30, 2026

A system for quickly generating training data with weak supervision

Python 5,957 854 Updated Apr 10, 2026

Examples showing real-life use cases for fal + dbt

Jupyter Notebook 22 4 Updated Apr 27, 2022

do more with dbt. dbt-fal helps you run Python alongside dbt, so you can send Slack alerts, detect anomalies and build machine learning models.

Python 856 76 Updated Apr 5, 2024

Kuwala is the no-code data platform for BI analysts and engineers enabling you to build powerful analytics workflows. We are set out to bring state-of-the-art data engineering tools you love, such …

JavaScript 809 55 Updated Aug 10, 2022

DuckDB is an analytical in-process SQL database management system

C++ 37,849 3,192 Updated Apr 30, 2026
Next