Stars
Research code accompanying AlphaGenome
Helper tool for configuring routed IPTV on the UniFi Dream Machine (Pro)
Real-time Go struct memory layout visualization and optimization for VS Code. Analyze padding, alignment, and cache performance with one-click field reordering.
A Python library for extracting structured information from unstructured text using LLMs with precise source grounding and interactive visualization.
Open source implementation of the SOQL parser for Go
GoMLX: An Accelerated Machine Learning Framework For Go
NASA JPL Ephemeris Reader is a Go library designed for reading and processing JPL (Jet Propulsion Laboratory) ephemeris files.
Merlion: A Machine Learning Framework for Time Series Intelligence
Logica is a logic programming language that compiles to SQL. It runs on DuckDB, Google BigQuery, PostgreSQL and SQLite.
Curated list of resources about Apache Airflow
Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per second 🚀
Golang port of simdjson: parsing gigabytes of JSON per second
Portuguese pre-trained BERT models
Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering and data science pipelines that are reproducible, maintainable,…
Mars is a tensor-based unified framework for large-scale data computation which scales numpy, pandas, scikit-learn and Python functions.
High-level tools to simplify visualization in Python.
Fast vectorization, topic modeling, distances and GloVe word embeddings in R.
A small Python library for validating data with pandas