- The Netherlands
-
17:01
(UTC -12:00)
Starred repositories
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
Python packaging and dependency management made easy
DSPy: The framework for programming—not prompting—language models
Prefect is a workflow orchestration framework for building resilient data pipelines in Python.
Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization etc. It also comes with Hadoop support built in.
A MNIST-like fashion product database. Benchmark 👇
q - Run SQL directly on delimited files and multi-file sqlite databases
Pure bash script to test and wait on the availability of a TCP host and port
A modular SQL linter and auto-formatter with support for multiple dialects and templated code.
A system for quickly generating training data with weak supervision
Benchmarks of approximate nearest neighbor libraries in Python
Stream Framework is a Python library, which allows you to build news feed, activity streams and notification systems using Cassandra and/or Redis. The authors of Stream-Framework also provide a clo…
Representation learning on large graphs using stochastic graph convolutions.
Scalable and efficient data transformation framework - backwards compatible with dbt.
Python 3 API wrapper for Garmin Connect to get statistics and set activities
MetricFlow allows you to define, build, and maintain metrics in code.
One framework to develop, deploy and operate data workflows with Python and SQL.
Repository of sample applications for https://vespa.ai, the open big data serving engine
On the Dimensionality of Word Embedding
Metadata driven Spark Declarative Pipelines framework for bronze/silver pipelines
Algorithms to categorize products and do named entity recognition on words in product descriptions
Gaussian node embeddings. Implementation of "Deep Gaussian Embedding of Graphs: Unsupervised Inductive Learning via Ranking".
StarRocks MCP (Model Context Protocol) Server
Text classification using Naive Bayes and Elasticsearch
Dynamic Network Embedding by Modeling Triadic Closure Process
🌊 FluRS: A Python library for streaming recommendation algorithms
An implementation of the minimum description length principal expert binning algorithm by Usama Fayyad