-
https://github.com/zinggAI/zingg
- India
- @sonalgoyal
Stars
The easy-to-use open source Business Intelligence and Embedded Analytics tool that lets everyone work with data 📊
Apache Spark - A unified analytics engine for large-scale data processing
DuckDB is an analytical in-process SQL database management system
Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.
ANTLR (ANother Tool for Language Recognition) is a powerful parser generator for reading, processing, executing, or translating structured text or binary files.
dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build applications.
Cleanlab's open-source library is the standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
Always know what to expect from your data.
Distributed and fault-tolerant realtime computation: stream processing, continuous computation, distributed RPC, and more
OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team co…
A completely-from-scratch hobby operating system: bootloader, kernel, drivers, C library, and userspace including a composited graphical UI, dynamic linker, syntax-highlighting text editor, network…
A system for quickly generating training data with weak supervision
lakeFS - Data version control for your data lake | Git for data
A scalable, distributed Time Series Database.
Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets
An online drag-and-drop editor to easily build READMEs
Non-Metric Space Library (NMSLIB): An efficient similarity search library and a toolkit for evaluation of k-NN methods for generic non-metric spaces.
A distributed, fault-tolerant graph database
The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.
Querybook is a Big Data Querying UI, combining collocated table metadata and a simple notebook interface.
🚀 Free resources you may use to promote your next startup
Collect, aggregate, and visualize a data ecosystem's metadata
A collection of research papers and software related to explainability in graph machine learning.
JVector: the most advanced embedded vector search engine
What's in your data? Extract schema, statistics and entities from datasets