Skip to content
View sonalgoyal's full-sized avatar

Organizations

@zinggAI

Block or report sonalgoyal

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
69 results for source starred repositories
Clear filter

Scalable identity resolution, entity resolution, data mastering and deduplication using ML

Java 1,146 155 Updated Feb 1, 2026

Business Semantics for Unity Catalog

Python 112 24 Updated Feb 4, 2026

AssetOpsBench - Industry 4.0

HTML 927 139 Updated Feb 4, 2026

In-memory Java DataFrame library

Java 305 29 Updated Feb 1, 2026
Python 1 5 Updated Nov 13, 2025

Moving data tables from one account to another

Python 5 1 Updated Jan 21, 2025

Example project using Zingg on Databricks

Jupyter Notebook 3 Updated Jan 2, 2025

Run TUIs and terminals in your browser

Python 1,334 33 Updated Aug 30, 2024

Non-Metric Space Library (NMSLIB): An efficient similarity search library and a toolkit for evaluation of k-NN methods for generic non-metric spaces.

C++ 3,568 464 Updated Jan 12, 2026

JVector: the most advanced embedded vector search engine

Java 1,681 146 Updated Feb 3, 2026

Code for Husky, an open-source language agent that solves complex, multi-step reasoning tasks. Husky v1 addresses numerical, tabular and knowledge-based reasoning tasks.

Python 346 27 Updated Jun 16, 2024

An example of SparkConnect extension.

Java 15 3 Updated Mar 5, 2024

Zingg fuzzy matching for products using metadata and images

Python 9 Updated May 20, 2024

Snowflake Snowpark Java & Scala API

Scala 23 22 Updated Feb 4, 2026

An End-to-End Evaluation Framework for Entity Resolution Systems

Python 36 11 Updated Dec 3, 2023

What's in your data? Extract schema, statistics and entities from datasets

Python 1,541 181 Updated Sep 26, 2025

Schema modelling framework for decentralised domain-driven ownership of data.

Java 261 17 Updated Dec 5, 2023

Translating text attributes (like name, address, phone number) into quantifiable numerical representations Training ML models to determine if these numerical labels form a match Scoring the confide…

Python 30 9 Updated Mar 4, 2024

lakeFS - Data version control for your data lake | Git for data

Go 5,132 427 Updated Feb 4, 2026

A collection of research papers and software related to explainability in graph machine learning.

1,985 135 Updated Apr 4, 2022

Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets

Python 4,844 474 Updated Jan 26, 2026

A completely-from-scratch hobby operating system: bootloader, kernel, drivers, C library, and userspace including a composited graphical UI, dynamic linker, syntax-highlighting text editor, network…

C 6,610 534 Updated Feb 4, 2026

OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team co…

TypeScript 8,624 1,617 Updated Feb 4, 2026

A system for quickly generating training data with weak supervision

Python 5,937 855 Updated May 2, 2024

Examples showing real-life use cases for fal + dbt

Jupyter Notebook 22 4 Updated Apr 27, 2022

Kuwala is the no-code data platform for BI analysts and engineers enabling you to build powerful analytics workflows. We are set out to bring state-of-the-art data engineering tools you love, such …

JavaScript 805 55 Updated Aug 10, 2022

DuckDB is an analytical in-process SQL database management system

C++ 35,881 2,900 Updated Feb 4, 2026

SPEAR: Programmatically label and build training data quickly.

Jupyter Notebook 109 22 Updated Jun 27, 2024

🌊 Continuously synchronize the systems where your data lives, to the systems where you _want_ it to live, with Estuary Flow. 🌊

C++ 873 88 Updated Feb 4, 2026
Next